Portfolio
  • About
  • Skills
  • Projects
  • Experience
  • Education
  • Contact
P
Portfolio

Navigation

  • 01About
  • 02Skills
  • 03Projects
  • 04Experience
  • 05Education
  • 06Contact
Portfolio
Available for work

J Ay

—

Other

Seattle, WA·0 yrs exp

Data Engineer with 4 years building reliable data pipelines, warehouses, and analytics infrastructure at scale. I bridge the gap between raw data and business decisions — working closely with analytics, ML, and product teams.

GitHub ↗LinkedIn ↗Twitter ↗

About

02

“I write pipelines like I write code — modular, tested, and documented. If an analyst can trust the data, the pipeline matters. Observability and data quality checks are first-class citizens, not afterthoughts.”

Data Engineer with 4 years building and maintaining data infrastructure at scale using Python, Airflow, dbt, and Snowflake.

I started in analytics at a mid-size e-commerce company and kept getting pulled into the infrastructure side — fixing broken pipelines, rebuilding unreliable ingestion jobs, and wondering why the data was always wrong. Eventually I made it official. I've worked across retail, fintech, and SaaS, and I've learned that good data engineering is mostly about trust — making sure downstream teams can rely on what you ship.

NowCurrently building PipelineKit and exploring real-time streaming with Flink and Kafka.

Skills

04
01Pythonlanguages
02SQLlanguages
03Scalalanguages
04Apache Airflowdata engineering
05Apache Sparkdata engineering
06Apache Kafkadata engineering
07dbt (data build tool)data engineering
08Snowflakedata warehousing

Projects

05
01↗

PipelineKit — Airflow DAG Scaffolding CLI

CLI tool for scaffolding production-ready Apache Airflow DAGs from YAML specs, with built-in retry logic, SLA alerting, and data quality checks via Great Expectations.

PythonApache AirflowGreat ExpectationsClickDocker
02↗

StreamLedger — Real-Time Financial Event Tracker

Reference architecture for real-time financial event processing using Kafka and Flink, with a live React dashboard.

PythonApache KafkaApache FlinkPostgreSQLReactTypeScript
03↗

dbt-audit-macros

A collection of dbt macros for automated audit logging, freshness checks, and row-count reconciliation across Snowflake and BigQuery models.

dbtSQLSnowflakeBigQueryJinja2

Experience

03
Now
2023-01-01

2023-01-01 — Now

Meridian Analytics — Seattle, WA

Seattle, WA

Senior Data Engineer

Senior data engineer on a 6-person data platform team at a Series B fintech company. Own core ingestion pipelines, the Snowflake data warehouse, and data quality infrastructure used by 30+ analysts and 3 ML engineers.

  • —Redesigned the company's core ELT architecture from ad-hoc Python scripts to a fully orchestrated Airflow + dbt stack, reducing data freshness SLA breaches by 90%.
  • —Built a data contract framework with Great Expectations that runs on every pipeline run — catching schema drift and volume anomalies before they reach downstream dashboards.
  • —Led a cross-functional initiative with analytics and product to define and document 80+ core business metrics in dbt, creating a single source of truth for company-wide reporting.
2022-12-31
2020-08-01

2020-08-01 — 2022-12-31

Novu Commerce — Portland, OR

Portland, OR

Data Engineer

Data engineer on a two-person data team at a mid-size e-commerce company. Built and maintained pipelines for marketing, finance, and operations analytics.

  • —Migrated 14 legacy cron-based ETL jobs to Apache Airflow DAGs with proper retry logic, SLA monitoring, and Slack alerting — reducing weekly pipeline failures from 8-10 incidents to near-zero.
  • —Implemented a Snowflake data warehouse from scratch, consolidating data from Shopify, Stripe, Google Ads, and an internal PostgreSQL database into a unified analytics layer.
  • —Built a near-real-time inventory sync pipeline using AWS Lambda and DynamoDB Streams that reduced inventory discrepancy reports by 70%.

Education

06
#InstitutionDegreePeriod
01

University of Washington — Seattle

B.S. Information Systems

2016-09-01 – 2020-06-01

02

Udacity

Data Engineering Nanodegree

2020-10-01 – 2021-03-01

Contact

07

Making Data Teams Actually Trust Their Data

I'm currently open to senior data engineering roles at data-driven companies. If you're looking for someone who can build reliable pipelines, improve data quality, and work closely with analytics and ML teams — feel free to reach out.

Email

ayj909961@gmail.com ↗

Location

Seattle, WA

Elsewhere

LinkedIn ↗GitHub ↗Twitter ↗Website ↗
Made withSerisLab