dataset · NYC TLC · Parquet

NYC TLC taxi trip records

1.5 billion yellow / green / FHV trips since 2009, distributed as monthly Parquet files. Stories use build-time DuckDB aggregates. The Playground tab runs DuckDB WASM in your browser — queries execute locally against remote Parquet, no server involved.

About this dataset

Scale

1.5 billion rows, 50+ GB of Parquet. Monthly files going back to 2009.

Architecture

Stories use frozen aggregates built with DuckDB at deploy time. The Playground tab uses DuckDB WASM — queries run in your browser, no server involved.

Coverage

Yellow + green cabs, FHV (for-hire vehicles), and FHVHV (Uber / Lyft / Via). Schema includes fare, tip, distance, pickup/dropoff zones.

Stories using this data

article

After the parade

The 2024 NYC Pride march concluded on June 30 around 6 PM. In the four hours that followed, hundreds of thousands of attendees dispersed to bars, dinners, after-parties, and homes across the five boroughs. The taxi drop-off pattern shows you exactly where the post-parade economy lives.

article

The taxi data found the cellular dead zones

Every yellow cab logs its trip to the TLC's central server in real time. When the cellular signal drops, the meter buffers the trip locally and uploads it later. The TLC published the flag that marks these buffered trips. They probably did not realize they were also publishing a map of NYC's cellular dead spots.

article

What the congestion toll did to yellow cabs

On January 5, 2025, the Congestion Relief Zone toll went live. Battery Park lost 40% of its yellow cab pickups. World Trade Center dropped 22%. The TLC trip records show which zones the toll hit hardest — and which barely moved.

article

The tip tells you where you are

Yellow cab tip percentages by pickup zone don't track the income map as neatly as you'd expect. Airport runs, tourist corridors, and short hops have their own tipping logic — all of it baked into every credit card receipt since 2008.

article

NYC at 3 am

Every yellow cab drop-off between midnight and 5 am in 2023, aggregated by zone. The East Village handles more late-night arrivals than most of the outer boroughs combined. The nocturnal city has a geography — and it's not where you think.

article

The black car takeover

In 2017 there were more yellow cab trips than Uber and Lyft combined. By 2023 it wasn't close. Seven years of TLC data tells the story of the largest disruption in urban transportation since the car replaced the horse.

article

The taxi data is coming

1.5 billion rows of NYC taxi trips. The largest mobility dataset any U.S. city publishes — and the first to include the new Manhattan congestion-toll field. Why it doesn't fit our live-Socrata pattern, and what the planned pipeline looks like.

Available Parquet files

Monthly files from the TLC's public distribution. Use these URLs in the Playground tab with read_parquet('url').

yellow 2025-01 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2025-01.parquet
yellow 2025-02 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2025-02.parquet
yellow 2025-03 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2025-03.parquet
yellow 2024-01 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-01.parquet
yellow 2024-02 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-02.parquet
yellow 2024-03 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-03.parquet
yellow 2024-04 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-04.parquet
yellow 2024-05 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-05.parquet
yellow 2024-06 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-06.parquet
yellow 2024-07 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-07.parquet
yellow 2024-08 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-08.parquet
yellow 2024-09 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-09.parquet
yellow 2024-10 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-10.parquet
yellow 2024-11 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-11.parquet
yellow 2024-12 https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-12.parquet

Open the Playground to query these files directly.