Skip to content

rebrowser/carscom-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 

Repository files navigation

Cars.com Vehicle Listings Dataset

Updated Records Rebrowser

Daily sample of Cars.com vehicle listings with make, model, trim, mileage, body style, drivetrain, and dealer location across new and used inventory.

This repository contains a preview sample of the Cars.com dataset published by Rebrowser. If you're doing academic research, you may be eligible for free access to a much larger slice — see Free Datasets for Research.

This dataset contains 1 entity, each in its own folder: Car Listings (car-listings). See below for a full field breakdown, sample counts, and data distributions for each.

Found this useful? ⭐ Star this repo to help us keep publishing fresh data. Found an error? Let us know.


Car Listings

Sample of Cars.com vehicle listings with year, make, model, trim, mileage, body style, drivetrain, fuel type, and dealer location.

6,025,981 total records from 2025-11-16 to 2026-04-12, up to 30,000 rows in this sample (0.50% of full dataset). Exported as one file per day, up to 1,000 rows each, last 30 days retained.

Data Growth

Field Type Fill Rate Description
_primaryKey string 100% Unique identifier for this record
_firstSeenAt datetime 100% First time this record was seen
_lastSeenAt datetime 100% Last time this record was updated
listingId string 100% Unique Cars.com listing UUID (e.g., 62a9e175-556a-49da-b18e-e2ac95ef83ba)
vin 🔒 string 100% Vehicle Identification Number (17-character unique code)
stockType string 100% Listing type (New, Used, Certified)
year float 100% Vehicle model year
make string 100% Vehicle manufacturer (e.g., Mazda, Audi, Toyota)
model string 100% Vehicle model name (e.g., CX-5, Q5, Camry)
trim string 99% Vehicle trim level (e.g., 2.5 S Preferred Package, Premium Plus 45 TFSI)
price 🔒 float 100% Listed price in USD
msrp 🔒 float 46% Manufacturer suggested retail price in USD
mileage float 99% Odometer reading in miles
bodyStyle string 100% Body style (Sedan, SUV, Coupe, Hatchback, Truck, etc.)
exteriorColor string 99% Exterior color (e.g., Black, White, Silver)
interiorColor string 97% Interior color (e.g., Black Leather, Beige)
drivetrain string 98% Drivetrain type (e.g., All-wheel Drive, Front-wheel Drive, Four-wheel Drive, Rear-wheel Drive, FWD, AWD)
transmission string 2% Transmission type (e.g., Automatic, Manual)
engine string 2% Engine description (e.g., SKYACTIV-G 2.5L I-4)
fuelType string 98% Fuel type (e.g., Gasoline, Hybrid, E85 Flex Fuel, Diesel)
mpg string 2% EPA mileage rating range (e.g., 26-30)
stockNumber string 2% Dealer stock number
sellerType string 100% Seller type (e.g., dealership)
sellerName 🔒 string 99% Seller/dealer name (e.g., Liberty Mazda, Audi Richmond)
sellerCity string 99% Seller city location
sellerState string 99% Seller state abbreviation (e.g., CT, VA)
images 🔒 array 2% Array of all listing photo URLs
imagesCount float 2% Number of listing images
options array 2% Array of vehicle options (e.g., Adaptive Cruise Control, Heated Seats, Bluetooth)
optionsCount float 2% Number of vehicle options
description string 2% Seller description/notes about the vehicle
listingUrl 🔒 string 100% Full URL to the Cars.com vehicle listing page

🔒 Premium fields are included in the data files but their values are replaced with [PREMIUM]. To access real values, use our website.

Field Distributions

Stock Type (New/Used/Certified) (stockType)
Value Count Share
Used 3,407,589 ███████████░░░░░░░░░ 56.5%
New 2,618,392 █████████░░░░░░░░░░░ 43.5%
Body Style Distribution (bodyStyle)
Value Count Share
SUV 3,340,874 ███████████░░░░░░░░░ 55.6%
Truck 1,157,641 ████░░░░░░░░░░░░░░░░ 19.3%
Sedan 944,403 ███░░░░░░░░░░░░░░░░░ 15.7%
Hatchback 164,660 █░░░░░░░░░░░░░░░░░░░ 2.7%
Coupe 133,215 ░░░░░░░░░░░░░░░░░░░░ 2.2%
Passenger Van 91,556 ░░░░░░░░░░░░░░░░░░░░ 1.5%
Convertible 67,924 ░░░░░░░░░░░░░░░░░░░░ 1.1%
Cargo Van 67,793 ░░░░░░░░░░░░░░░░░░░░ 1.1%
Minivan 22,990 ░░░░░░░░░░░░░░░░░░░░ 0.4%
Wagon 21,020 ░░░░░░░░░░░░░░░░░░░░ 0.3%
Top Vehicle Makes (make)
Value Count Share
Ford 777,815 ████░░░░░░░░░░░░░░░░ 18.7%
Chevrolet 618,285 ███░░░░░░░░░░░░░░░░░ 14.9%
Toyota 581,190 ███░░░░░░░░░░░░░░░░░ 14.0%
Honda 419,977 ██░░░░░░░░░░░░░░░░░░ 10.1%
Nissan 337,963 ██░░░░░░░░░░░░░░░░░░ 8.1%
Hyundai 322,100 ██░░░░░░░░░░░░░░░░░░ 7.7%
Jeep 321,860 ██░░░░░░░░░░░░░░░░░░ 7.7%
Kia 299,310 █░░░░░░░░░░░░░░░░░░░ 7.2%
GMC 260,027 █░░░░░░░░░░░░░░░░░░░ 6.2%
BMW 223,566 █░░░░░░░░░░░░░░░░░░░ 5.4%
Fuel Type Distribution (fuelType)
Value Count Share
Gasoline 5,053,982 █████████████████░░░ 85.4%
Hybrid 350,923 █░░░░░░░░░░░░░░░░░░░ 5.9%
Diesel 222,064 █░░░░░░░░░░░░░░░░░░░ 3.8%
Electric 187,217 █░░░░░░░░░░░░░░░░░░░ 3.2%
E85 Flex Fuel 64,974 ░░░░░░░░░░░░░░░░░░░░ 1.1%
Gas 19,808 ░░░░░░░░░░░░░░░░░░░░ 0.3%
Plug-In Hybrid 6,300 ░░░░░░░░░░░░░░░░░░░░ 0.1%
Regular unleaded 5,503 ░░░░░░░░░░░░░░░░░░░░ 0.1%
Flexible Fuel 4,015 ░░░░░░░░░░░░░░░░░░░░ 0.1%
Regular Unleaded 2,478 ░░░░░░░░░░░░░░░░░░░░ 0.0%
Listings by State (sellerState)
Value Count Share
TX 611,161 ████░░░░░░░░░░░░░░░░ 18.4%
FL 609,811 ████░░░░░░░░░░░░░░░░ 18.4%
CA 557,489 ███░░░░░░░░░░░░░░░░░ 16.8%
OH 283,754 ██░░░░░░░░░░░░░░░░░░ 8.5%
IL 278,838 ██░░░░░░░░░░░░░░░░░░ 8.4%
NY 216,127 █░░░░░░░░░░░░░░░░░░░ 6.5%
GA 204,426 █░░░░░░░░░░░░░░░░░░░ 6.2%
NC 188,987 █░░░░░░░░░░░░░░░░░░░ 5.7%
NJ 187,311 █░░░░░░░░░░░░░░░░░░░ 5.6%
PA 182,879 █░░░░░░░░░░░░░░░░░░░ 5.5%

Pre-built Views on Rebrowser

Rebrowser web viewer lets you filter, sort, and export any slice of this dataset interactively. These pre-built views are ready to open:

Car Listings

Vehicle Listings with Pricing — 5,888,737 records

[{"field":"price","op":"gt","value":0},{"sort":"price ASC"}]

New Vehicle Listings — 2,573,973 records

[{"field":"stockType","op":"is","value":"New"},{"sort":"price ASC"}]

Used Vehicle Listings — 3,314,764 records

[{"field":"stockType","op":"is","value":"Used"},{"sort":"price ASC"}]

Listings with Multiple Photos — 91,181 records

[{"field":"imagesCount","op":"gt","value":5},{"sort":"imagesCount DESC"}]

SUV Listings — 3,266,873 records

[{"field":"bodyStyle","op":"is","value":"SUV"},{"sort":"price ASC"}]

See all 34 views →


Code Examples

import pandas as pd
from pathlib import Path

# ── Car Listings ─────────────────────────────────────────────────────────────
files = sorted(Path('rebrowser/carscom-dataset/car-listings/data').glob('*.parquet'))[-7:]
listings = pd.concat([pd.read_parquet(f) for f in files])

# Top 15 makes by listing count
print(listings['make'].value_counts().head(15).to_string())

# Body style breakdown for used vehicles
used = listings[listings['stockType'] == 'Used']
print(used['bodyStyle'].value_counts().to_string())

# Average mileage by body style for used cars
print(used.groupby('bodyStyle')['mileage'].mean().sort_values().to_string())

# States with the most listings
print(listings['sellerState'].value_counts().head(10).to_string())

# Fuel type distribution across all listings
print(listings['fuelType'].value_counts().to_string())

# Listings per model year (most recent years)
print(listings['year'].value_counts().sort_index(ascending=False).head(10).to_string())

Use Cases

Inventory Composition Analysis

Group listings by body style, drivetrain, and fuel type to understand how dealer inventory is distributed across vehicle segments and regions.

Regional Market Comparison

Filter by seller state to compare which makes and models dominate specific geographic markets. Identify regional preferences like truck-heavy states vs sedan markets.

Vehicle Specification Trends

Analyze how trim levels, drivetrain types, and fuel options shift across model years. Track the growth of hybrid and electric listings relative to gasoline.

Listing Quality Benchmarking

Compare image counts and option detail completeness across listings to benchmark what constitutes a high-quality vehicle listing on Cars.com.


Full Dataset on Rebrowser

This repo is a 1,000-row preview sample. The full dataset is at rebrowser.net/products/datasets/carscom

Doing academic research? You may qualify for free access to a larger slice. See Free Datasets for Research.

On Rebrowser you can:

  • Filter before you buy — use the web UI to apply filters on any field and sort by any column. Preview results before purchasing. You only pay for records that match your criteria.
  • Export in your format — CSV, JSON, JSONL, or Parquet depending on your plan.
  • Access via API — integrate dataset queries into your pipelines and workflows.
  • Choose your freshness — plans range from a 14-day lag to real-time data with no delay.
  • Select only the fields you need — keep exports lean. Premium fields with richer data are available on higher plans.

Pricing starts at $2 per 1,000 rows with volume discounts.


License & Terms

Free for research and non-commercial use with attribution. See license terms and how to cite.

@misc{rebrowser_carscom,
  author       = {Rebrowser},
  title        = {Cars.com Vehicle Listings Dataset},
  year         = {2026},
  howpublished = {\url{https://rebrowser.net/products/datasets/carscom}},
  note         = {Accessed: YYYY-MM-DD}
}

Commercial use requires a paid license — see pricing. Use of this data is governed by the Rebrowser Terms of Use, which may be updated at any time independently of this repository.


Disclaimer

Rebrowser is an independent data provider and is not affiliated with, endorsed by, or sponsored by Cars.com. Any trademarks are the property of their respective owners. This dataset is compiled from publicly available information; we do not request or collect Cars.com user credentials. By using this dataset, you agree to comply with Cars.com's Terms of Service and all applicable laws and regulations. Images, logos, descriptions, and other materials included in this dataset remain the intellectual property of their respective owners and are provided solely for informational purposes. Rebrowser makes no warranties regarding the accuracy, completeness, or legality of the data and assumes no liability for how the data is used. You are solely responsible for ensuring that your use of this dataset does not infringe on the rights of any third party.

You can also find this data on Kaggle, HuggingFace, Zenodo.

Releases

No releases published

Packages

 
 
 

Contributors