rafhq · omen

OMEN

Simulate the electorate, not the candidate.

Census-backed populations, one campaign at a time. Events move them. Manual live orders are reserved for campaigns that clear provenance, backtest, and market-mapping gates.

scroll · 01 · what

01What

A population simulator for predictable events.

Elections, awards, referendums, regulatory decisions — outcomes are not fate. They are the aggregate of millions of individual decisions under predictable behavioral biases. OMEN simulates that aggregation. Where our population disagrees with the market, we only act once the campaign is explicitly marked ready for manual live trading.

02Pipeline

The loop.

Populate

Census ACS 5-year PUMS microdata — real individual records for the race's state or district — get weighted-sampled down to the simulation size (default 2,000; cap 25,000). Same seed, same voters, every time.

Event-feed

The race's timeline — polls, debates, endorsements, scandals, ad spend — is encoded with a valence and magnitude against our rubric. Events flow through the population chronologically.

iii

Simulate

Each voter updates their intent under a stateful belief-updating rule. Partisan base locks in. Swing moves on debates. Low-info chases endorsements. Cross-pressured swings late. The population drifts.

Aggregate & gate

Per-event snapshots sum to a campaign-level prediction. OMEN only surfaces a live trade when provenance, backtests, geometry coverage, and market mapping are all green.

03Population

Real people. Real demographics. Real districts.

Populations come from US Census ACS 5-year PUMS microdata — every simulated voter is a real individual record, preserving the joint distribution of age × sex × education × race × ethnicity × income × citizenship × PUMA as reported to Census. We filter to voting-age citizens, weighted-sample down to 25,000 rows per state for repo size, and rescale so the total weight still equals the state's full citizen-voting-age population.

State + district caches

108.6M

Citizen-voting-age covered

≤2pp

Max sample delta vs true at n=2k

ACS 2022

5-year PUMS vintage

Cached populationCitizen voting-age

FL · Florida

15.7M

TX · Texas

19.2M

PA · Pennsylvania

9.9M

OH · Ohio

9.0M

NC · North Carolina

7.7M

GA · Georgia

7.7M

MI · Michigan

7.6M

VA · Virginia

6.3M

AZ · Arizona

5.1M

WI · Wisconsin

4.5M

CO · Colorado

4.3M

MN · Minnesota

4.2M

IA · Iowa

2.4M

NV · Nevada

2.1M

NM · New Mexico

1.5M

NH · New Hampshire

1.1M

NY · NY-16 (district)

0.3M

source · data/acs/*.json · fetched via Census ACS 5-year PUMS API · filters: AGEP≥18, CIT∈{1-4} · weighted-sample-without-replacement cap 25,000 rows · weight rescaled so totalWeight = full citizen-voting-age population

How the population updates

Each simulated voter holds an ideological coordinate, a turnout propensity, and a behavioral-segment label (partisan base, swing, low-info, demographic bloc, cross-pressured). Events propagate through the population under segment-specific update rules — base locks in, swing moves on debates, low-info reacts to endorsements + attack ads. The segment mix is drawn from Pew + ANES typology studies as an informed starting prior, then calibrated per-race when polling crosstabs are available. The forecast does not rely on these mix proportions in isolation — they're a rule for how recent events perturb the outcome on top of the Census-demographic + polling + structural signal, all weighted via the four-engine ensemble.

full derivation + thresholds · /omen/methodology

04Backtest

Held-out evaluation. No grading your own work.

The honest number below is the hybrid forecast with information frozen 30 days before election day — the forecast OMEN would have produced at T−30, without knowing the late polls or last-minute events. It's the only number that counts. Deterministic seeds mean any result is exactly reproducible from source.

7 / 9

Honest T−30 winners correct

2.24 pp

Honest T−30 mean share err

Full-information reference (T−0)

For comparison: the same hybrid ensemble scored with full information (every poll + event up to election day visible). This is the ceiling we grade ourselves against — the gap between it and the honest T−30 number is what the late-cycle information was worth.

7 / 9

Full-info winners correct

1.78 pp

Full-info mean share err

Error breakdown · by race type

Where the system is strong and weak. Sorted by hybrid share err, best-calibrated first. Primaries are genuinely the hardest class because within-party ideology is a noisier signal than D/R contrast.

Race typeNHonest T−30Full infoWinners H / F

general71.56pp1.23pp5/7 · 5/7

runoff12.10pp1.91pp1/1 · 1/1

primary17.18pp5.53pp1/1 · 1/1

Architecture

Four engines combine via confidence-weighted ensemble with 15% shrinkage toward uniform: fundamentals (pregame + incumbency), polling aggregate (recency × sample × population weighted), synthetic electorate (real ACS Census populations + CPS-calibrated individual turnout propensity + ideology-proximity voter preferences), and an event/sentiment engine. Populations for 7 of 9 fixtures are drawn from US Census ACS microdata — real people from the actual state or district, every dimension (age × education × race × income) sampling within ~2pp of the true weighted distribution at n=2,000. The two national-popular-vote fixtures use the segment-mix generator instead (national ACS caches exceed repo size budget).

05Fixtures

Every backtest. Every number.

Eight historical races across primaries, gubernatorial, senate, and presidential contests from 2016 to 2024. Ranked by share error. Two documented upsets where pregame priors had the wrong winner.

RaceHonest T−30Full infovs pregame

NY-16 Democratic Primary2024

Latimer · Pregame had Bowman.

7.18pp

5.53pp

+13.50pp

US Presidential2020

Biden

0.90pp

+0.00pp

Virginia Governor2021

Youngkin (upset) · Pregame had Mcauliffe.

2.48pp

missed winner

1.02pp

+1.94pp

Pennsylvania Senate General2022

Fetterman

1.15pp

+0.00pp

US Presidential2024

Trump · Pregame had Harris.

0.95pp

+0.43pp

MI Presidential2016

Trump (upset) · Pregame had Clinton.

2.60pp

missed winner

2.60pp

+0.50pp

Arizona Senate2022

Kelly

1.05pp

-3.17pp

Texas Senate2018

Cruz

1.76pp

0.95pp

-2.98pp

Georgia Senate Runoff2022

Warnock

2.10pp

1.91pp

-5.13pp

06Principles

Deterministic. Traceable. Falsifiable.

Deterministic

No LLMs in the hot path. Same seed + same events = byte-identical prediction. Reproducibility is load-bearing.

Traceable

Every aggregate prediction decomposes to per-segment and per-voter contributions. If you can't explain the call, the call is wrong.

Falsifiable

Backtests score on a held-out slice the model never saw. Live predictions will be pre-registered before election date. Known failures documented in the repo, not buried.

Grounded in Levitt 2004, Croxson & Reade 2014, Argyle et al. 2023, Park et al. 2023–2024, Gao et al. 2024.

07Next

Live predictions begin when the first market opens.

OMEN is a research system today. Backtests are clean; live trading is the next phase. The plan: pre-register a forecast for a 2026 primary market on Kalshi at least 72 hours before election day, with full per-segment breakdowns and the exact population seed. Pre-registered means the record is public.

How it works · /omen/methodology

Every algorithm, every data source →

Population sampling, turnout model, ideology routing, four engines, ensemble weights, uncertainty, event scoring rubric. If a number is on this page, the source is there.

The public record → /omen/predictions

Every forecast, pre-registered.

Every OMEN forecast is timestamped and committed to git before election day — 72-hour minimum lead. 8 historical races replayed through the full hybrid forecaster as demo backtests. First live pre-registration lands with the 2026 primary cycle.

Meanwhile · run the process

Run the simulator in your browser →

Pick any historical race, adjust the population seed and size, watch the prediction evolve event-by-event. Same deterministic engine that produces the backtest table.

/console