Launching OpenMedicare: AI-Powered Medicare Fraud Detection Across $854B in Payments

Today we're launching OpenMedicare — a free, searchable platform covering ten years of Medicare physician payment data from the Centers for Medicare & Medicaid Services. The dataset spans 2014–2023, encompasses 96 million rows of provider-level payment records, and represents $854.8 billion in total Medicare spending. For the first time, anyone can explore where Medicare dollars go, compare providers, and see which ones our fraud model flags as high-risk.

The Scale of Medicare Spending

Medicare is the federal health insurance program covering 65 million Americans — primarily seniors and people with disabilities. The physician payment data we've indexed reveals striking patterns in how that money flows.

Across the decade, 1.72 million unique providers billed Medicare. The average markup ratio — what Medicare paid versus what providers charged — stands at 3.77x, meaning providers routinely charge nearly four times what they actually receive. While this is standard practice in medical billing, the variation across providers and specialties tells a more nuanced story.

Our analysis also identified 4,636 "impossible" providers — practitioners whose billing patterns defy normal medical practice. These include providers billing for an implausible number of patients per day, billing across incompatible specialties simultaneously, or showing geographic patterns inconsistent with a single practice location.

$854.8 billion in payments analyzed

96 million records across 1.72 million providers over ten years — every dollar searchable and explorable.

Machine Learning Fraud Detection

The centerpiece of OpenMedicare is our ML fraud detection model. We trained a classifier using the HHS Office of Inspector General's List of Excluded Individuals and Entities (LEIE) — a database of providers who have been officially sanctioned for fraud, abuse, or other misconduct.

By matching 8,300+ confirmed fraudsters from the LEIE against their historical Medicare billing patterns, we built a model that learns what fraudulent billing looks like: unusually high volumes, suspicious procedure mixes, billing spikes, and geographic anomalies.

The model achieves an AUC of 0.83 — meaning it correctly distinguishes between fraudulent and legitimate providers 83% of the time. Applied to the full Medicare dataset, it has flagged approximately 500 providers as high-risk for potential fraud, waste, or abuse. Each flagged provider's profile on OpenMedicare includes the risk score and the contributing factors.

To be clear: a fraud flag is not an accusation. It indicates statistical patterns consistent with known fraud cases and warrants further investigation. Our goal is to surface leads, not render verdicts.

Key Findings

Beyond the fraud model, our analysis of ten years of Medicare data has surfaced several significant patterns:

  • $2.8 billion in COVID-era anomalies: During 2020–2021, certain providers and specialties showed dramatic billing spikes inconsistent with their historical patterns — some legitimate (telehealth expansion), others suspicious (billing for services unlikely to have been delivered during lockdowns).
  • $5.5 billion wound care cluster: A concentrated group of wound care providers collectively billed $5.5 billion, with billing patterns that diverge sharply from the specialty's norms. Wound care has been flagged by the OIG as a high-fraud specialty, and our data confirms the concern at scale.
  • 3.77x average markup ratio: While standard in medical billing, the variance is enormous — some providers charge 10x or more what Medicare pays, raising questions about list pricing practices.
  • 4,636 impossible providers: Billing patterns that defy the constraints of time, geography, or medical plausibility — strong candidates for further investigation.

500 providers flagged by ML model

Trained on 8,300+ LEIE-confirmed fraudsters with an AUC of 0.83 — surfacing high-risk billing patterns for further investigation.

Part of TheDataProject Portfolio

OpenMedicare joins our growing suite of government transparency platforms:

  • OpenMedicaid — $1.09 trillion in Medicaid provider payments across all 50 states
  • OpenFeds — 2.07 million federal employees, DOGE tracking, brain drain analysis
  • OpenSpending — $11.2 trillion in federal contracts, grants, and agency expenditures

Together, these platforms represent the most comprehensive open-access window into federal healthcare and government spending available anywhere. Medicare and Medicaid alone account for over $1.9 trillion annually — nearly a third of all federal spending. Making this data searchable and analyzable isn't just a technical project; it's an accountability project.

Explore OpenMedicare

Visit openmedicare.us to search any Medicare provider, view their billing history and peer comparisons, check fraud risk scores, and explore ten years of payment data. Everything is free, open, and requires no account.

OpenMedicare is built by TheDataProject.AI — making public data usable, searchable, and accessible to everyone.

Share this article