Overview & Methodology
DATA SOURCES VERIFIED
What this report shows: Every voter in this dataset has been matched against the New York State death index —
a state-maintained record of deaths 1957–2017 — and has a recorded vote in 2020 or later.
The matching algorithm uses name, birth year, county of residence, gender, and middle name as independent
corroborating factors. All cases presented here cleared a HIGH confidence threshold requiring at least
four independent points of agreement between the voter registration record and the death record.
Key Statistics
| Finding | Count | Notes |
|---|---|---|
| Statewide HIGH confidence matches (born 1940+) | 31,075 | Exact name + birth year; voted 2020+ |
| Statewide HIGH confidence — all cohorts (pre-1920, 1920–1939, 1940+) | 33,906 | Complete coverage across all birth years on active voter rolls |
| Perfect matches — post-1972 death + county corroborated + score ≥ 10 | 790 | Fraud-allegation-ready: every available factor corroborates |
| Voted 10+ years after recorded death date (1940+ cohort) | 26,583 | 85.5% of HIGH matches — rules out most data-entry error explanations |
| Voted 20+ years after recorded death date (1940+ cohort) | 15,301 | 49.2% — systematic long-term roll maintenance failure at minimum |
| NY-19 HIGH confidence matches | 2,305 | Congressional District 19 only |
| NY-19 addresses with 2+ deceased voters registered (clusters) | 17 | Potential ballot harvesting or institutional registration locations |
Data Source 1
NYS Active Voter Registration
New York State Board of Elections — full statewide active voter file, all 62 counties.
Data Source 2
NY State Death Index 1957–2017
NYS Dept. of Health — annual death records with name, estimated birth year, county of death, sex, and state file number.
County Coverage Boundary
The death index carries a county_code field only for deaths from 1973 onward (100% coverage). Deaths before 1973 carry no county — geographic corroboration is only possible for the post-1972 subset.
Vote Filter
All matched voters have LASTVOTERDATE ≥ 2020-01-01. Because the death index covers deaths through 2017, every match in this dataset voted at least 3 years after their recorded death.
Scoring Methodology
Each death-voter match receives a confidence score based on the number of independent corroborating
factors. A score of 7 or higher is classified HIGH confidence. The system deliberately
penalizes county mismatches (−1 point) to suppress false positives from common names.
| Factor | Points | Explanation |
|---|---|---|
| Unique name statewide | +4 | Last + first name appears exactly once on the entire voter roll — eliminates name-collision ambiguity |
| County of residence match | +3 | Voter's county code equals death record county code — independent geographic corroboration (post-1972 deaths only) |
| Middle name / initial match | +3 | First initial of voter middle name matches death record middle name |
| Exact birth year | +2 | Voter DOB year matches death record estimated birth year exactly |
| Birth year off by 1 | +1 | ±1 tolerance applied for age-at-death integer rounding in older records |
| Gender match | +1 | Voter gender matches death record sex field |
| County mismatch | −1 | Voter county differs from death county — penalty reduces score to suppress false positives |
| HIGH confidence threshold | ≥ 7 | Minimum score for inclusion in HIGH tier |
| Perfect match threshold | ≥ 10 | Requires at minimum: unique name (4) + county match (3) + birth year (2) + gender (1) |