Portfolio

Available for work

J Ay

Other

Seattle, WA·0 yrs exp

Data Engineer with 4 years building reliable data pipelines, warehouses, and analytics infrastructure at scale. I bridge the gap between raw data and business decisions — working closely with analytics, ML, and product teams.

GitHub ↗LinkedIn ↗Twitter ↗

About

“I write pipelines like I write code — modular, tested, and documented. If an analyst can trust the data, the pipeline matters. Observability and data quality checks are first-class citizens, not afterthoughts.”

Data Engineer with 4 years building and maintaining data infrastructure at scale using Python, Airflow, dbt, and Snowflake.

I started in analytics at a mid-size e-commerce company and kept getting pulled into the infrastructure side — fixing broken pipelines, rebuilding unreliable ingestion jobs, and wondering why the data was always wrong. Eventually I made it official. I've worked across retail, fintech, and SaaS, and I've learned that good data engineering is mostly about trust — making sure downstream teams can rely on what you ship.

NowCurrently building PipelineKit and exploring real-time streaming with Flink and Kafka.

Skills

01Pythonlanguages

02SQLlanguages

03Scalalanguages

04Apache Airflowdata engineering

05Apache Sparkdata engineering

06Apache Kafkadata engineering

07dbt (data build tool)data engineering

08Snowflakedata warehousing

Projects

01↗

PipelineKit — Airflow DAG Scaffolding CLI

CLI tool for scaffolding production-ready Apache Airflow DAGs from YAML specs, with built-in retry logic, SLA alerting, and data quality checks via Great Expectations.

PythonApache AirflowGreat ExpectationsClickDocker

02↗

StreamLedger — Real-Time Financial Event Tracker

Reference architecture for real-time financial event processing using Kafka and Flink, with a live React dashboard.

PythonApache KafkaApache FlinkPostgreSQLReactTypeScript

03↗

dbt-audit-macros

A collection of dbt macros for automated audit logging, freshness checks, and row-count reconciliation across Snowflake and BigQuery models.

dbtSQLSnowflakeBigQueryJinja2

Experience

2023-01-01 — Now

Meridian Analytics — Seattle, WA

Seattle, WA

Senior Data Engineer

Senior data engineer on a 6-person data platform team at a Series B fintech company. Own core ingestion pipelines, the Snowflake data warehouse, and data quality infrastructure used by 30+ analysts and 3 ML engineers.

—Redesigned the company's core ELT architecture from ad-hoc Python scripts to a fully orchestrated Airflow + dbt stack, reducing data freshness SLA breaches by 90%.
—Built a data contract framework with Great Expectations that runs on every pipeline run — catching schema drift and volume anomalies before they reach downstream dashboards.
—Led a cross-functional initiative with analytics and product to define and document 80+ core business metrics in dbt, creating a single source of truth for company-wide reporting.

2020-08-01 — 2022-12-31

Novu Commerce — Portland, OR

Portland, OR

Data Engineer

Data engineer on a two-person data team at a mid-size e-commerce company. Built and maintained pipelines for marketing, finance, and operations analytics.

—Migrated 14 legacy cron-based ETL jobs to Apache Airflow DAGs with proper retry logic, SLA monitoring, and Slack alerting — reducing weekly pipeline failures from 8-10 incidents to near-zero.
—Implemented a Snowflake data warehouse from scratch, consolidating data from Shopify, Stripe, Google Ads, and an internal PostgreSQL database into a unified analytics layer.
—Built a near-real-time inventory sync pipeline using AWS Lambda and DynamoDB Streams that reduced inventory discrepancy reports by 70%.

Education

University of Washington — Seattle

B.S. Information Systems

2016-09-01 – 2020-06-01

Udacity

Data Engineering Nanodegree

2020-10-01 – 2021-03-01

Contact

Making Data Teams Actually Trust Their Data

I'm currently open to senior data engineering roles at data-driven companies. If you're looking for someone who can build reliable pipelines, improve data quality, and work closely with analytics and ML teams — feel free to reach out.

ayj909961@gmail.com ↗

Location

Seattle, WA

Elsewhere

LinkedIn ↗GitHub ↗Twitter ↗Website ↗

Portfolio

Available for work

J Ay

Other

Seattle, WA·0 yrs exp

GitHub ↗LinkedIn ↗Twitter ↗

About

Data Engineer with 4 years building and maintaining data infrastructure at scale using Python, Airflow, dbt, and Snowflake.

NowCurrently building PipelineKit and exploring real-time streaming with Flink and Kafka.

Skills

01Pythonlanguages

02SQLlanguages

03Scalalanguages

04Apache Airflowdata engineering

05Apache Sparkdata engineering

06Apache Kafkadata engineering

07dbt (data build tool)data engineering

08Snowflakedata warehousing

Projects

01↗

PipelineKit — Airflow DAG Scaffolding CLI

CLI tool for scaffolding production-ready Apache Airflow DAGs from YAML specs, with built-in retry logic, SLA alerting, and data quality checks via Great Expectations.

PythonApache AirflowGreat ExpectationsClickDocker

02↗

StreamLedger — Real-Time Financial Event Tracker

Reference architecture for real-time financial event processing using Kafka and Flink, with a live React dashboard.

PythonApache KafkaApache FlinkPostgreSQLReactTypeScript

03↗

dbt-audit-macros

A collection of dbt macros for automated audit logging, freshness checks, and row-count reconciliation across Snowflake and BigQuery models.

dbtSQLSnowflakeBigQueryJinja2

Experience

2023-01-01 — Now

Meridian Analytics — Seattle, WA

Seattle, WA

Senior Data Engineer

—Redesigned the company's core ELT architecture from ad-hoc Python scripts to a fully orchestrated Airflow + dbt stack, reducing data freshness SLA breaches by 90%.
—Built a data contract framework with Great Expectations that runs on every pipeline run — catching schema drift and volume anomalies before they reach downstream dashboards.
—Led a cross-functional initiative with analytics and product to define and document 80+ core business metrics in dbt, creating a single source of truth for company-wide reporting.

2020-08-01 — 2022-12-31

Novu Commerce — Portland, OR

Portland, OR

Data Engineer

Data engineer on a two-person data team at a mid-size e-commerce company. Built and maintained pipelines for marketing, finance, and operations analytics.

—Migrated 14 legacy cron-based ETL jobs to Apache Airflow DAGs with proper retry logic, SLA monitoring, and Slack alerting — reducing weekly pipeline failures from 8-10 incidents to near-zero.
—Implemented a Snowflake data warehouse from scratch, consolidating data from Shopify, Stripe, Google Ads, and an internal PostgreSQL database into a unified analytics layer.
—Built a near-real-time inventory sync pipeline using AWS Lambda and DynamoDB Streams that reduced inventory discrepancy reports by 70%.

Education

University of Washington — Seattle

B.S. Information Systems

2016-09-01 – 2020-06-01

Udacity

Data Engineering Nanodegree

2020-10-01 – 2021-03-01

Contact

Making Data Teams Actually Trust Their Data

ayj909961@gmail.com ↗

Location

Seattle, WA

Elsewhere

LinkedIn ↗GitHub ↗Twitter ↗Website ↗