openroles

A daily-refreshed, privacy-respecting job board across 32 applicant tracking systems. No accounts. No ads. No tracking. Static HTML and gzipped JSON, served from GitHub Pages — filtered in your browser, cached for instant revisits.

Dark mode & mobile

Note

Live at https://openroles.today/. Refreshed every night.

What it is

openroles scrapes the public APIs of 32 hiring platforms each night, normalises the postings into a shared schema, and ships them to your browser as a set of content-hashed JSON.gz chunks. Filter, sort, search, and pagination all run client-side over the in-memory dataset — once the chunks have loaded, every interaction is instant and works offline.

Apply links go directly to the source ATS. Saved roles, applied roles, ignored roles, saved searches, and your theme preference live in localStorage on your device — nothing about you ever leaves the browser. The masthead, hero, and first 50 rows are pre-rendered HTML so first paint never waits on JavaScript.

Try it

Every filter is in the URL — bookmark a query, share it, embed it in a Notion page. No login required for any of these.

Query	URL
Senior + staff engineers on Greenhouse or Lever, remote-only, last 7 days	`?ats=greenhouse,lever&level=senior,staff&wt=remote&since=7d`
"Staff engineer" anywhere in the title, Germany only	`?q=title:"staff engineer"&country=DE`
Stripe, all roles	`?q=company:stripe`
Hide recruiter posts + hide stale carry-forwards	`?recruiter=0&hide_stale=1`

The URL DSL is documented in specs/filter-ui.md; the parser is property-tested in site/src/lib/search-dsl.test.ts.

How it works

Common Crawl  ──►  weekly-harvest.yml  ──►  data/tenants/{ats}.json
                                                    │
                                                    ▼
ATS public APIs ──►  scrape.yml (nightly)  ──►  data/scrape-outputs/*
                                                    │
                                                    ▼
                              build-deploy.yml  ──►  jobs.{sha}.sqlite (build-time)
                                                    │
                                                    ▼
                              slim-index emitter  ──►  data/slim/*.json.gz
                                                    │
                                                    ▼
                              Astro static build  ──►  GitHub Pages
                                                    │
                                                    ▼
                                                 Browser
                                            (Web Worker decodes
                                             slim-index chunks,
                                             FilterTable runs
                                             everything in memory)

The build-time SQLite is scaffolding only — it isn't deployed. What ships to the browser is a few dozen content-hashed JSON.gz chunks (~50 MB total today, growing with the corpus) plus a manifest.json. A Service Worker caches the gzipped bytes; the merged dataset is further cached in IndexedDB keyed by the build sha, so warm reloads restore in ~100 ms and skip the chunk pipeline entirely.

See ARCHITECTURE.md for the full system shape and docs/adr/ for the locked architectural decisions.

Coverage

Thirty-two ATSes. The multi-tenant set, weighted by tenant volume in the public Common Crawl index:

Greenhouse · Lever · Ashby · BambooHR · Workday · iCIMS · Recruitee
Breezy · Personio · Workable · Teamtailor · SmartRecruiters · CSOD
Taleo · UltiPro · Jobvite · Zoho Recruit · Talentlyft · Pinpoint HQ
ApplicantPro · ApplicantStack · Homerun · Factorial · Eightfold
SuccessFactors · BrassRing

Plus two vendor-agnostic harvesters and four per-company custom adapters:

JSON-LD harvester — walks a per-tenant sitemap and extracts schema.org/JobPosting structured data (e.g. Lockheed Martin, Spectrum).
Google-for-Jobs RSS harvester (gjobsfeed) — reads a brand's public Google-for-Jobs feed. Recovers brands whose primary API is robots.txt-blocked (SAP, ExxonMobil, Halliburton, Cintas, …).
Four per-company custom adapters for employers running their own careers API: Amazon · Apple · TikTok · Meta.

Tenant slugs are discovered from public Common Crawl snapshots and liveness-probed weekly; hard-dead slugs are dropped, transient failures are retained for retry.

Quick start

bun install
bun run dev      # http://localhost:4321/
bun run test     # full suite, 95% line / 95% function / 90% branch
bun run e2e      # Playwright + axe-core a11y + Lighthouse
bun run build    # static site to site/dist/

To build the SQLite + slim-index from cached scrape outputs:

bun run build-db -- --input data/scrape-outputs \
                    --tenants data/tenants-merged.json \
                    --output-dir data --short-sha 0000000

Project layout

scraper/    Bun + TypeScript scraper, build-db, harvest CLI, drift detector
site/       Astro 6 site, Svelte 5 filter island, slim-index runtime
shared/     Cross-workspace zod schemas + shared types
specs/      Per-feature behavior contracts
docs/adr/   Locked architectural decisions
.github/    scrape · weekly-harvest · build-deploy · pr CI workflows

Documentation

Doc	What it's for
ARCHITECTURE.md	High-level system shape, data flow, key commitments
CONTRIBUTING.md	TDD discipline, conventional commits, pre-commit hooks
SECURITY.md	Vulnerability disclosure
docs/adr/	Locked architectural decisions (Madr 4.0)
specs/	Per-feature behavior contracts
CHANGELOG.md	Release log, regenerated from Conventional Commits

Licence

Code — MIT. Fork it, ship it, sell it; the only ask is to keep the copyright line.
Listings dataset — CC BY-SA 4.0. Reuse is fine; attribution + share-alike are required so derivative aggregators stay open.

Acknowledgements

Inspiration and design influence for this project came from Feashliaa/job-board-aggregator. Independent implementation, but credit where credit is due.

Help spread the word

If openroles saves you time — or you appreciate that it's ad-free, account-free, and tracker-free — please ⭐ the repo and pass it to a friend who's job-hunting. Every star helps people find an aggregator that's actually on their side.

Name		Name	Last commit message	Last commit date
Latest commit History 280 Commits
.github		.github
data		data
design-wireframes		design-wireframes
docs		docs
scraper		scraper
shared		shared
site		site
specs		specs
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.tool-versions		.tool-versions
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
LICENSE-DATA		LICENSE-DATA
README.md		README.md
SECURITY.md		SECURITY.md
biome.json		biome.json
bun.lock		bun.lock
bunfig.toml		bunfig.toml
cliff.toml		cliff.toml
commitlint.config.cjs		commitlint.config.cjs
lefthook.yml		lefthook.yml
nginx.conf		nginx.conf
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

openroles

What it is

Try it

How it works

Coverage

Quick start

Project layout

Documentation

Licence

Acknowledgements

Help spread the word

About

Licenses found

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

openroles

What it is

Try it

How it works

Coverage

Quick start

Project layout

Documentation

Licence

Acknowledgements

Help spread the word

About

Topics

Resources

License

Licenses found

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages