03 · Sources
Trusted, defensible data
No scraping of Levels.fyi, Glassdoor, or LinkedIn. Crucible runs on government filings, legally compliant pay-transparency postings, and a feedback loop from realized offers.
v1 sources · 4
v1.1 sources · +3
Source · 01 · Government
3 years of LCA disclosures and PERM filings. Verified employer, worksite, SOC code, and prevailing wage rate. The defensible spine of every band.
Source · 02 · Government
Occupational Employment & Wage Statistics. Per SOC × MSA percentiles (10/25/50/75/90). Used as the geographic baseline and sanity check.
Source · 03 · Compliance
CA, CO, WA, NY, IL, MD, HI, DC, NJ. Legally-required disclosure tied to actual JD text. Highest-confidence source; powers the semantic retriever.
Source · 04 · Filings
DEF 14A proxy statements. Named executive officer comp. Anchors the top of band for public companies and reveals comp philosophy.
Source · 05 · Ontology
SOC code normalization, skill ontology, occupation tasks. Not a comp source — the scaffolding that lets every other source speak the same language.
Source · 06 · Internal
Every recruiter validation, every realized offer, every "your band was off" feedback. Closes the loop. Drives calibration and drift detection.