TPAS logo: a sword piercing a stack of papers.

Claims Denials in U.S. Health Insurance

Claims denials in U.S. health insurance pose a serious problem for patients navigating their care. In this report we present and analyze data that sheds light on the occurrence and prevalence of claims denials in commercial health insurance, propose policies to improve patient outcomes, and highlight existing avenues for consumer recourse. Our analyses show that commercial claims denials correspond to a highly valuable population of claims, and that even modest improvements to appeal access, utilization and outcomes have the potential to return billions of dollars to consumers.

Table of Contents

Introduction

Claims are a fundamental and atomic unit of U.S. health insurance: every time an insured consumer attempts to use their health insurance to cover a portion of a bill, a claim is submitted to their insurer on their behalf. The claim records details about the care received and its cost necessary for an insurer to make a payment determination. The primary role of health insurers is to receive and adjudicate these claims based on the contracts they sell, and ultimately pay for some portion of them, on behalf of those who purchased the contracts (typically employers or individuals).

When insurers adjudicate a claim to determine whether they will pay for it, they review the contracts that govern the coverage which has been purchased on behalf of the insured. The permissible content in these contracts is often restricted by laws and regulations, and the contracts may also refer to external sources (e.g. "clinical policy bulletins", or "scientific literature") which will be used to inform coverage determinations. We will say a coverage decision is contractually inappropriate if the decision is inconsistent with the contract governing a policy, or the laws and regulations that apply to that policy.

There are many coverage decisions that one might deem morally or medically inappropriate that would not be considered contractually inappropriate. For example, it would be morally troubling for corporations to sell policies that exclude the coverage of expensive treatments for those facing dire medical situations in such a way that the relevant contract clauses conflate actual scientific consensus with consensus according to a for-profit corporation’s own non-expert reading of their hand picked literature selection. However, such behavior is not necessarily contractually inappropriate.

Contractually inappropriate claims denials, and the lack of adequate recourse for consumers even in these cases, have been serious problems for a long time. There are organizations that regularly publish reports analyzing the limited publicly available claims denial data (Pollitz & Rae, 2020), and the subject has been discussed in policy research, legal research (Fox, 2022) and popular journalism (Konrad, 2010) for a long time.

Recently, there have been numerous investigations and articles that have brought to light for the general public just how damaging some of the inappropriate practices taking place really are (Armstrong et al., 2023b, 2023a), and illuminated the scarcity of publicly available data (Fields, 2023).

Finally, there are those most seriously affected; many patients with dire, serious, or chronic medical needs (and their friends and families) have long known about the lack of recourse they enjoy, and the frequency with which contractually inappropriate denials occur just in seeking their own care -- they have been on the front lines for decades, suffering the consequences of an inadequately regulated industry.

In this work, we set out to illuminate the extent and prevalence of claims denials, and the crucial monetary role they play in our systems. To do so, we analyze publicly available claims denial and appeal data. We seek to do this analysis in a transparent, and reproducible way, to ensure that our work can be extended and used by others seeking to improve patient outcomes, justice, and accountability. We discuss this more explicitly in our transparency statement below. It is our hope that the analyses in this report can be used to catalyze change, such as in supporting those seeking to convince decision makers to enact relevant policy.

Transparency Statement

We aim to publish reports in such a way that all of our analyses, plots, and numerical claims drawn from data can be vetted, reproduced, and built upon as desired. We believe the ability to reproduce results end to end is crucially relevant in discussions of this topic, as existing data and transparency are scarce and poorly understood, and there are many conflicting voices and viewpoints in the space.

To this end, we provide transparent, open source access to all analyses and data used to generate this document. The interested reader may probe any aspect of our end to end analyses of raw data provided by government institutions by investigating the following resources:

  • All non-interactive figures appearing in this report, as well as the data populating interactive figures, were generated via the code housed here.

  • Data we obtained via public records is hosted here.

  • A static copy of this entire article is housed here.

If you believe there are errors in this report, analytical or otherwise, we would appreciate your correspondence. Our goal is to achieve rigorous results that can be audited by anyone, and that constitute trustworthy evidence that improves in quality over time. We hope to build trust with you by providing such open access to our end to end workflow.

To correspond about our analyses, please open an issue in our public repository, or send us a message at info@persius.org.

Acknowledgments

It is a pleasure to thank the following people for their support in editing and improving this work:

  • Alan Bush, Ph.D.

    For editorial and organizational support.

To be clear, despite their contributions to this article, the parties above do not necessarily share views expressed, nor does their editorial support constitute endorsement of opinion.

A Primer on Claims Denials

A claim denial occurs when a claim is submitted to an insurer, and the insurer decides they will not pay for it. In practice, such decisions are made for many reasons; some valid1, and others inappropriate or indefensible.

When a claim is denied, a patient is typically left with an out of pocket expense larger than what they would otherwise face.

Denials

One of the most basic metrics that one can use to investigate how claims handling varies is an overall claims denial rate. This measures the fraction all claims submitted that are denied:

Denial Rate=Claims DeniedClaims Received\text{Denial Rate} = \dfrac{\text{Claims Denied}}{\text{Claims Received}}

This is not a uniquely defined notion, because the numerator and denominator can be defined in various ways. For example, one could compute this ratio for all claims submitted to a particular insurer over the course of a year, or for all of the claims submitted for a particular plan over a month, or even for all claims submitted from just one individual to a given plan over the course of a year2. In what follows, we will calculate denial rates that fit this form in a few different ways.

It is worth noting that:

  • Among the claims that are appropriate3, a low denial rate is always a good thing for patients.
  • High denial rates can be caused by numerous things, and do not necessarily indicate inappropriate insurer practices.

Appeals

When insurers deny claims, patients are often entitled to an appeals process. For example, those with federal marketplace plans are afforded such recourse, as are all others with so-called non-grandfathered commercial qualified health plans (see (Pollitz, 2021) for more background); Medicare allows for its own distinct processes for appealing denied claims; and Medicaid programs support their own distinct processes (see e.g. this guide).

The details of these processes vary based on details of one's insurance plan and relevant regulations, but typically they allow patients to initially file an internal appeal of the denial, which is adjudicated by the insurer itself. This is the case, for example, in commercial insurance plans, which are the primary focus of this article given the data we analyze.

If the internal appeal process (which in some plans requires an insured to make two attempts at an internal appeal) results in the insurer upholding their denial, patients are then sometimes able to file an external appeal. This means they can submit an appeal of the decision to a independent third party that is responsible for judging the merits of the determination; the final determinations of these third parties are typically binding. The third parties, and processes by which external appeals are submitted to them and reviewed by them, vary according to the details of the type of plan involved.

In understanding the landscape of claims denials, it is useful to understand the frequency of occurrence, and distribution of outcomes, of these internal appeal processes.

Internal Appeal Rates

We define the initial internal appeal rate to be the fraction of denied claims that are appealed at the first (internal) level available to consumers.

Internal Appeal Rate=Claims Appealed (@ first level)Claims Denied\text{Internal Appeal Rate} = \dfrac{\text{Claims Appealed (@ first level)}}{\text{Claims Denied}}

Again, this is a loose definition that we will specify more precisely in each particular calculation below.

Of those denials that are internally appealed, we denote the fraction that are overturned by the insurer as the internal appeal overturn rate.

Internal Appeal Overturn Rate=Claims Overturned (@ first level)Claims Appealed (@ first level)\text{Internal Appeal Overturn Rate} = \dfrac{\text{Claims Overturned (@ first level)}}{\text{Claims Appealed (@ first level)}}

External Appeal Rates

We define the external appeal rate to be the fraction of denials eligible for external appeal (this consists mostly of internally appealed and subsequently upheld denials) which are actually externally appealed by consumers.

External Appeal Rate=Claims Appealed ExternallyClaims Eligible for External Appeal\text{External Appeal Rate} = \dfrac{\text{Claims Appealed Externally}}{\text{Claims Eligible for External Appeal}}

Of those denials that are externally appealed, we denote the fraction that are overturned by a third party as the external appeal overturn rate.

External Appeal Overturn Rate=Claims Overturned ExternallyClaims Appealed Externally\text{External Appeal Overturn Rate} = \dfrac{\text{Claims Overturned Externally}}{\text{Claims Appealed Externally}}

Public Claims Denial Data

Overview

To understand the landscape of claims denials, contractually inappropriate claims denials, appeals, and effective consumer recourse, we need data. Unfortunately, requirements for commercial insurers to publicly disclose claims denial data are few and far between, and the extent to which requirements that do exist are enforced and regulated is unclear. However, some publicly available data does exist.

Datasets

We make use of the following publicly available data:

  • Federal Marketplace Public Use Files (PUFs) from the Centers for Medicare and Medicaid Services (CMS)

    Dataset Details

    Each year CMS releases a collection of transparency data related to federal marketplace insurers. Here we are interested in the transparency in coverage4 (TIC) public use files. These are files compiled from self-reported insurer data specifying aggregate counts of claims submitted, claims denied, and claims appealed, among other things. This data is limited in scope, but it is the closest thing we have to a denial transparency survey among commercial plans with broad (geographical) scope. It is compiled and reported to the public in accordance with the Department of Health and Human Services (HHS) rule 45 of the Code of Federal Regulations (CFR), part 155, subpart K, (cf CFR 156 subpart C).

    In this article we focus on the 2023 public use file, which details claims denial data from the 2021 plan year.

    This 2021 data corresponds to:

    • Plan year 2021.
    • 33 states.
    • 230 unique insurers.
    • 6,764 plans.
    • 8,251,703 consumers.
  • New York Health Care Claim Reports

    Dataset Details

    Thanks to legislation from 2020, New York publishes health care claim reports for insurance companies that contain aggregate counts of claims submitted, claims denied, and claims appealed for each commercial NY insurer, among other things.

    This data, released quarterly as of 2022 in the form of spreadsheets for each New York insurer, details aspects of health insurance coverage for insurance plans in New York. Various details about how the data is reported are included in the frequently asked questions for the data on the DFS homepage.

    Residents of New York state can not buy individual Affordable Care Act (ACA) compliant Qualified Health Plans (QHPs) on the federal marketplace. Instead, NY maintains its own marketplace where such plans can be purchased, known as New York State of Health. The New York Health Care Claims Reports includes statistics relating to NY marketplace plans, in addition to plans that are not sold via the marketplace.

    In this article, we analyze the health care claims report data from 2022. The 2022 data includes:

    • Plan year 2022.
    • 29 unique insurers.
    • 145,446,802 claims adjudicated, worth at least $66,810,881,848.
    • 25,996,601 claims denied, worth at least $2,265,844,512.
    • 177,917 claims denied for medical necessity, worth at least $35,747,594.
    • 145,129 internal appeals adjudicated, worth at least $303,942,639.
    • 42,375 overturned internal appeals, worth at least $124,245,612.

    All denials recorded in the data are post-service claims denials, so pre-authorization denials are not included. Appeals in the data include both appeals submitted directly by consumers, and appeals submitted by providers on behalf of consumers.

  • New York External Appeal Outcome Data

    Dataset Details

    External appeals of denials from New York marketplace plans, fully insured group plans, and Medicaid managed care plans are adjudicated by Independent Review Organizations. The New York Department of Financial Services maintains a public database of results from such external appeals.

    The New York External Appeal Database corresponds to:

    • Plan years 2019 to 2023.
    • 24,728 external appeals.
    • Individual and group health plans.
    • Fully insured employer plans.
    • IMRs associated with plans regulated by the New York State Department of Financial Services.

    In this article we discard the subset of the dataset corresponding to claims from 2023; such claims correspond to an incomplete plan year, and they have the potential to introduce confounding inconsistencies into our analyses. We initially included them in our analyses, but found they led to easily misinterpreted results.

  • Washington External Appeal Outcome Data

    Dataset Details

    In Washington state, claims denial data is not generally made public. However, as in New York, the final adjudication status of claims which are both ultimately externally appealed and fall under the authority of the Office of the Insurance Commissioner (OIC) are recorded in a publicly accessible dataset.

    External appeals of denials from Washington marketplace plans are adjudicated by Independent Review Organizations. According to Washington Administrative Code 284-43-3030 and the Revised Code of Washington 48.43.530, insurers must support both internal and external appeal processes, and report resulting data to the insurance commissioner. The code above provides for external appeals to be sought as soon as an internal process is exhausted, or 30 days after internal appeal initiation has occurred, whichever is sooner. Details of the rules pertaining to external appeals are described in the Revised Code of Washington 48.43.535.

    The OIC in Washington state maintains the aforementioned public database of results from these external appeals. This data corresponds to consumers enrolled in Washington health plans that are regulated by the Office of the Insurance Commissioner (OIC). This includes, but is not limited to, individual plans sold on the Washington marketplace (known as Washington Health Plan Finder).

    The OIC external appeal database corresponds to:

    • Plan years 2016 through 2023.
    • 8,774 external appeals
    • Individual and group health plans.
    • Self insured and fully insured employer plans.
    • IMRs associated with plans regulated by the OIC.

    In this article we discard the subset of the dataset corresponding to claims from 2023; such claims correspond to an incomplete plan year, and they have the potential to introduce confounding inconsistencies into our analyses. We initially included them in our analyses, but found they led to easily misinterpreted results.

  • California External Appeal Data from the California Department of Insurance (CDI) and the Department of Managed Health Care (DMHC)

    Dataset Details

    California releases data related to internal and external appeal outcomes in at least four places, all of which we consider:

    External appeals in California, also known as Independent Medical Reviews (IMRs), follow processes dictated by two pieces of the California Insurance Code (CIC).

    • Sections 10169.1 - 10169.5, which initially became effective January 1, 2001, describe rules that apply to IMR sought for health coverage decisions purportedly related to medical necessity. There have been numerous modifications to the law since then. The current law expressly provides for the inclusion of those on Medicare and Medicare Advantage plans, however it also allows for processes to be subsumed by existing state Medicare resolution outlets. It allows IMRs to be sought as soon as an internal process is exhausted, or 30 days after internal initiation has occurred, whichever is sooner. It also allows the CDI to contract with the DMHC to administer the IMR as desired (which has come into play for managed care plans, such as HMOs, as described in the last dataset description above.)
    • Section 10145.3, also effective as of January 1, 2001, describes particular rules that apply to IMR sought for insurer decisions stemming from deeming treatment or services as experimental or investigational in nature.

    The CDI IMR database corresponds to:

    • Plan years 2011 through 2023.
    • 4,719 Independent Medical Reviews.
    • Individual and group health plans.
    • Self insured and fully insured employer plans.
    • IMRs associated with plans regulated by the California Department of Insurance.

    The DMHC IMR database corresponds to:

    • Plan years 2001 through 2023.
    • 34,615 Independent Medical Reviews.
    • Individual and group health plans.
    • Self insured and fully insured employer plans.
    • IMRs associated with plans regulated by the California Department of Managed Healthcare.

    In this article we discard the subset of both datasets corresponding to claims from 2023; such claims correspond to an incomplete plan year, and they have the potential to introduce confounding inconsistencies into our analyses. We initially included them in our analyses, but found they led to easily misinterpreted results.

  • Connecticut Denial Data From Consumer Report Cards

    Dataset Details

    Each year Connecticut's Department of Insurance releases a report, the so called Consumer Report Card on Health Insurance Carriers. The report includes various aggregate statistics about health insurance plans, including utilization for certain services, member satisfaction, and aggregate denial statistics.

    The yearly data is released as a published PDF report, and corresponds to all insurance plans sold by Connecticut issuers except those that are government sponsored; in other words, all CT health insurance enrollees other than those with government sponsored plans (e.g. Medicaid and Medicare plans) are represented. As such, the data pertains to both individual and group health insurance plans, including self insured and fully insured employer plans.

    In this article we focus on the 2022 report, which details claims from the 2021 plan year.

    The data corresponds to coverage for:

    • Plan year 2021.
    • 11 unique insurers.
    • 1,991,903 consumers.
    • HMO and indemnity plans.
    • Individual, small group and large group plans.
    • Self insured and fully insured plans.
  • Pennsylvania Department of Insurance claims denial data (New Data Release)5

    Dataset Details

    Pennsylvania utilized the federal marketplace for qualified health plans until the end of 2020, when it transitioned to using its own newly formed marketplace, known as Pennie, in accordance with the passage of PA House Bill 3. Beginning in 2021, consumers could buy qualified health plans on the Pennsylvania marketplace.

    We submitted a public records request under the Pennsylvania Right to Know Law to the Pennsylvania Insurance Department (PA DOI) asking for a comprehensive analysis of various aggregate statistics pertaining to claims adjudication for which we believed they were likely to have records. This dataset is the result of that request. While the data is not as comprehensive or extensive as we had hoped might be possible, it is a step forward. We are releasing the raw data we obtained, as well as parsed and preprocessed adaptations of it, to the public along with the release of this article.

    The dataset contains aggregate counts for claims, denials, and internal and external appeals broken down by issuer for claims corresponding to marketplace plans, plus an undisclosed additional population of plans, from plan years 2020 and 2021. In addition, the data includes claims, denials and denial rationales broken down by plan for the subset of claims corresponding specifically to marketplace plans (this subset constitutes a large majority of the dataset to begin with).

    The data obtained from the PA Department of Insurance corresponds to:

    • Plan years 2020 and 2021.
    • 13 unique insurers.
    • 599 unique plans.
    • At least 337,722 consumers.
    • 21,646,696 claims adjudicated
    • 2,964,421 claims denials.
    • 2,751 internal appeals.
    • 103 external appeals.
    • Individual marketplace plans.
    • Possibly additional group plans regulated by the PA Department of Insurance.

An overview of the data we analyze in this report is detailed below.

Data Source (plan years considered)Market Segment RepresentedNumber of DenialsDenial Levels RepresentedConsumer Population Represented
CMS TIC PUF (2021)Federal marketplace48,302,001All8,251,703
NY Health Care Claims Reports (2022)All commercial, Medicaid managed care, Essential Plan, and Child Health Plus plans25,996,601AllUnknown, but > 5,000,000 6
NY External Appeals Database (2019-2022)NY marketplace, fully insured group plans, and Medicaid managed care plans24,728Externally Appealed (IMR)Unknown, but > 5,000,000 6
WA External Appeals Database (2016-2022)WA marketplace8,774Externally Appealed (IMR)222,731
CA DOI External Appeals Database (2011-2022)CA marketplace, and fully insured group plans4,719Externally Appealed (IMR)852,506 7
CA DMHC External Appeals Database (2001-2022)Large group and self funded plans regulated by DMHC34,615Externally Appealed (IMR)23,474,332 8
Connecticut Consumer Report Card (2021)All CT issuers (excluding government sponsored)2,708,724All1,991,9039
Pennsylvania Department of Insurance Data (2020, 2021)PA marketplace plans, plus some subset of Department of Insurance regulated group plans2,964,421AllUnknown, but > 337,722 10

To our knowledge, this is the first report that compiles aggregate results across a wide array of disparate sources containing claims denial data from different market segments, and the first public report to analyze claims denial data from the Pennsylvania marketplace. The varied nature of the data makes it difficult to make direct comparisons across datasets, but we believe it is still informative to witness the extent to which certain phenomenology occurs consistently across these datasets, despite the varied contexts and regulatory settings in which the data is produced.

Although we collect and analyze data from many sources, the data is far from complete, even among any fixed market segment such as federal and state ACA exchanges.

We note that we submitted many public records requests seeking data that would allow us to perform a more comprehensive account of all marketplace claims denial data. However, with the exception of Pennsylvania, the state insurance departments to which we submitted requests were unable or unwilling to grant our requests.

The state departments of insurance which did not grant our requests either cited lack of access to such data, or claimed that our (admittedly non-expert) reading of the Department of Health and Human Services (HHS) rule 45 of the Code of Federal Regulations (CFR), part 155, subpart K, (cf CFR 156 subpart C), which suggests to us both that

  1. ACA exchange claims denial data must be made available to the public for all ACA exchanges.
  2. Each state's department of insurance should have access to that data.

is incorrect.

Our interpretation, which of course may be flawed, comes from the following chain of logic:

  • CFR part 155, subpart K, § 155.1040 (a) suggests "The Exchange" must collect transparency data in accordance with CFR part 156, subpart C (specifically § 156.220).
  • CFR part 156, subpart C (specifically § 156.220) suggests that such claims denial data must be furnished by QHP issuers to both exchanges and state insurance commissioners, and that such data must include the number of claims that are denied (by § 156.220 (b) and § 156.220 (a).(5) respectively).
  • CFR part 156, subpart C (specifically § 156.220 (b)) suggests that some entity must make that data available to the public.
  • "The Exchange" in CFR part 155, subpart K seems to be referring to any exchange (whether federal or state run) based on CFR part 155, subpart A specifically, so the points above should apply to all state exchanges, in addition to healthcare.gov.

In at least one case, a response from a state department of insurance to our records request claimed that the CFR sections cited above do not require the State to collect denial data. We do not yet understand a reading of the CFR text that allows for this interpretation.

In addition to failing to gain access to claims denial data for some state marketplaces, we also simply neglected to consider some data that is already public; we did so only for expediency, not lack of interest or relevance. For example, we have not yet incorporated claims denial data made available by Covered California, but plan to incorporate such data into a subsequent iterate of this article in the near future.

Finally, we note that the total number of unique consumers represented across all of the data sets we consider is unknown (see the Consumer Population Represented column in the data summary table above). However, what is clear is that this population is a small fraction of the total population of insured individuals in the U.S. This fact is not surprising, given that many of the datasets we consider correspond to data from highly limited market segments (e.g. individual plans sold on state marketplaces) which comprise a small fraction of all insurance plans. Having claims denial data for the entire population of the insured U.S. population would be invaluable for ensuring accountability and the ability to effectively audit the justness of adjudication processes at large; we make the case for the need for such data in our concluding policy recommendations section.

Submitted Claims and Denials

Four of the datasets we consider here provide information sufficient to compute overall initial denial rates in particular market segments, as well as denial rates broken down across subcategories of those market segments.

These datasets are:

  • CMS TIC PUFs.
  • New York Health Care Claims Reports.
  • Connecticut Consumer Report Cards.
  • Pennsylvania Department of Insurance Data.

In what follows, we attempt to provide common views of the disparate datasets to facilitate considerations about comparison when possible, but we warn the reader that direct comparisons should not assumed to be valid, as these datasets are reported according to inconsistent methodologies. In fact, we don't even have temporal overlap of the plan years reported for all of the datasets. For example, the NY Health Care Claims Reports have only been reported since plan year 2022, while the CMS TIC PUFs have been regularly published for many years, but have yet to be made public for plan year 2022.

Despite the inconsistencies in reporting, all of the data suggests that denials are generally prevalent, regardless of how one slices the data.

We will now walk through calculations of overall claims denial rates in each dataset that allows for such a calculation. We perform these broad, general calculations to serve as a background for all subsequent calculations, and provide context for the scale of the issues at play. After computing the overall denial rates, we will consider how denial rates break down in various ways: by denial associated denial rationale, by insurer, and by state or region, to name a few.

Overall Denial Rates

First we consider overall denial rates for each of the datasets.

CMS Federal Marketplace PUFs

On the federal marketplace, 16% of all claims submitted in 202111 were initially denied. This data is displayed in Figure 1.

Figure 1. Aggregate denial rate across federal marketplace insurers in 2021. Approximately 16% of all federal marketplace claims were initially denied in 2021, according to CMS TIC PUF data.

Source: CMS TIC PUF data.

New York Health Care Claims Reports

Among New York insurers, 18% of all claims submitted in 2022 were initially denied. This data is displayed in Figure 2.

Figure 2. Overall denial rate (18%) among all NY issuers who submitted health care claims reports to the Department of Financial Services in 2022.

Source: NY Health Care Claims Reports.

Connecticut Consumer Report Cards

Among Connecticut insurers, 25% of all claims submitted in 2021 were initially denied. This calculation involved throwing out seemingly spurious data reported for the indemnity plans for one insurer, Connecticare (the report indicates Connecticare had more denials of claims than actual claims received). This data is displayed in Figure 3.

Figure 3. Overall claims denial rate in CT consumer report card data.

Source: 2022 CT Consumer Report Card

Pennsylvania Department of Insurance Data

Among Pennsylvania insurers represented in the PA DOI data, approximately 15% of all claims submitted in 2020 were initially denied, while approximately 13% of the claims submitted in 2021 were initially denied. This data is displayed in aggregate in Figure 4.

Figure 4. Overall claims denial rate in the Pennsylvania Department of Insurance data in in 2020 and 2021.

Source: Pennsylvania Department of Insurance Data

Denials By Rationale

When insurers deny coverage of a claim, they must report their rationale for the denial. The rationales, and the details of how they must be reported to both the insured and regulatory agencies, vary by plan, but typically include a small set of mutually exclusive labels. In each of the datasets that provide denial counts, some metadata regarding the denial rationales is included, though the details vary by dataset.

CMS Federal Marketplace PUFs

Federal marketplace plans included in the CMS TIC PUF data sometimes report rationales for claim denials. Figure 5 shows the categories allowed in the CMS PUF reporting methodology, and their frequency of occurrence in the subset of data which includes rationales.

Most denials logged in the CMS PUF data are associated with a rationale of "Other". This provides little information about the underlying nature of the denials.

Figure 5. Distribution of claims denial rationales in a subset of the TIC PUF data. Denial rationales are reported at the plan level, and we only included here those plans which reported denial rationales which either summed to the total number of denials reported, or which summed to the total number of denials reported excluding out of network denials. For some plans, neither validation method held, and we could not resolve what standard was being used to report the rationale data; that rationale data is not included. Source: CMS TIC PUF data.

Effectively, this reporting scheme allows insurers to withhold reporting the actual reason for a denial to the public if they so choose, since it is not completely clear what is allowed to be reported in the "Other" bucket, and there is no strict validation of the reported data. Whether or not insurers actually abuse the system in this way cannot be gleaned from this data alone.

New York Health Care Claims Reports

The New York Health Care Claims Reports detail the rationales for all of the denials documented in the data, but allow for a different set of available options for denial rationales. These rationales, and their relative frequency of occurrence within the data, are shown in Figure 6.

Figure 6. Distribution of denial rationales for all denials recorded in the health care claims reports submitted to the Department of Financial Services in 2022.

Source: NY Health Care Claims Reports.

Again, we see that an ambiguous "Other" category is utilized heavily in the reporting. Furthermore, while many of the rationale categories in this dataset are well-defined terms that have consistent, standard meaning across the industry (see e.g. a typical definition of step therapy), others are ambiguous.

For example, the "Not Covered or Limitation" label here is our abbreviation of a label specified in the raw data as "Not a covered benefit/Exceeds benefit limits (e.g., visit limits)", which is a broad category that appears to allow for many interpretations, similar to the "Other" category. It would be useful to understand more explicitly exactly which subset of denials are being recorded in the "Not Covered or Limitation" category, as opposed to the "Other" category, or the "NMN or Experimental" category, since there is ambiguity and conceptual overlap in each of these concepts. For example, claims deemed to be not medically necessary or experimental are typically a subset of services that are explicitly considered 'not covered', according to language in insurance contracts.

Connecticut Consumer Report Cards

The Connecticut consumer report cards detail rationales for every denial recorded in the data, but again use a different set of categories, and make liberal use of an "Other" category. The data is displayed in Figure 7.

Figure 7. Distribution of denial rationales for all denials in the CT consumer report card data. Source: 2022 CT Consumer Report Card.

Pennsylvania Department of Insurance Data

The Pennsylvania Department of Insurance data details rationales only for denials recorded at the plan level in the data (which is the subset corresponding to marketplace claims), but the story remains the same in that the set of permissible rationales in the data is distinct from each other set we've considered. The data is displayed in Figure 8.

Figure 8. Distribution of denial rationales for all denials in the PA DOI data.

Source: Pennsylvania Department of Insurance Data.

'Not Medically Necessary' Denial Rates

Each of the datasets considered provides a category of denial rationale corresponding to purported lack of medical necessity or experimental nature of a service (what we will collectively call NMN denials).

While NMN denials occur relatively infrequently, as Figures 5, 6, 7 and 8 show, they still correspond to a large population of claims in an absolute sense; just for the tiny fraction of all insurance plans covered by the data shown above, for a single plan year, there are more than 700k such claims. Furthermore, they are an important subset of denials in the sense that:

  • Their contractual merit is typically subtle, because it typically involves contractual definitions that themselves reference things like clinical policy bulletins, scientific literature, medical standards, or other external data.
  • The consequences for such denials can be perilous for patients relying on coverage of treatments for debilitating issues that they would not otherwise be able to afford. This can be true for other rationale categories as well, depending on the categories considered, but is not the case for all categories (e.g. duplicate claims).

For these reasons, we draw attention to aspects of the data pertaining specifically to NMN denials, when possible. Unfortunately, the data is often insufficient to allow much to be ascertained definitively about this subpopulation of denials.

One thing we can understand definitively about the subpopulation of NMN denials in these datasets is the rate at which they occur among all claims; these rates can be deduced by reconsidering Figures 5, 6, 7 and 8, but we make the rates explicit in the table below.

Data Source (plan years considered)NMN DenialsClaims Received With RationaleNMN Denial Rate
CMS TIC PUF (2021)520,14013,157,832.0395
NY Health Care Claims Reports (2022)177,88134,168,504.0052
Connecticut Consumer Report Card (2021)27,5172,006,049.0137
Pennsylvania DOI Data (2020)104,39610,451,280.0099
Pennsylvania DOI Data (2021)66,0319,494,221.0069

Unsurprisingly we see that NMN denial rates are orders of magnitude smaller than overall denial rates; however, it is interesting to note that there is significant variation in the overall NMN denial rate across these datasets. One contributing factor is surely the lack of standardization in reporting methodologies, such as the variation in the set of denial rationales permissible in the reported data. It is unclear to what extent the actual denial rates relating to medical necessity would vary across these datasets if one were to correct for reporting inconsistencies.

Denials By Insurer

Each of the datasets reporting denials includes breakdowns of the claims received and claims denied by insurer. It is informative to understand how the aggregate denial rates and NMN denial rates vary by insurer within each dataset.

We note at the outset that while it is informative to understand how various distributions vary by insurer, there are many subtleties lurking behind such breakdowns, just as there were for denial rationales. For example, different insurers may have different distributions of plan network types (e.g. relative fraction of consumers on HMO, EPO, PPO, and POS plans), and as a result insurer denial differences might partially be a reflection of the differences in those plan type distributions. Or, alternatively, two insurers might have identical plan type distributions and administration practices, but have a different list of drugs for which they require prior-authorization, with one list containing some much more commonly utilized drugs. Given such subtleties which the data does not allow us to address, it is difficult to definitively understand the cause of the phenomenology we observe in differences across insurers.

In the same vein, the distribution across insurers of denials whose rationale specifically involves the experimental nature of a treatment, rather than another medical necessity rationale, could be influenced by many things, in addition to the raw counts of claims. In particular, one would expect that wealthy and renowned hospital systems conducting cutting edge research would be more likely than others to attempt truly experimental or novel services for their patients in need, and that such properties of a hospital are correlated with other properties, such as proximity to a major city and research center. As a result, health insurance companies with networks that overlap certain geographic areas, or that include certain hospitals, might receive a distribution of claims with a relatively high share that could be interpreted as involving experimental treatments, when compared to distributions received by other insurance companies. In such a context, it would be inappropriate to make an inference about how experimental claims denial processes vary by insurer from experimental claims denial rates alone.

As with all of the data we examine in this article, there are many subtleties that make it difficult to draw conclusions without more information. We make suggestions to address this troubling predicament in the concluding recommendations section.

CMS Federal Marketplace PUFs

Figures 9, 10 and 11 show the overall claims counts, denial rates, and NMN denial rates for the 10 issuers in the CMS TIC PUF Federal Marketplace data with the highest claims volume.

Figure 9. Claims received for the 10 issuers with the largest number of claims adjudicated in the 2021 CMS TIC PUF data.

Source: CMS TIC PUF data.

Figure 10. Overall claims denial rates for the 10 issuers with the largest number of claims adjudicated in the 2021 CMS TIC PUF data.

Source: CMS TIC PUF data.

Figure 11. NMN (including experimental) denial rates for the 10 issuers with the largest number of claims adjudicated in the 2021 CMS TIC PUF data.

Source: CMS TIC PUF data.

Among the largest federal marketplace issuers of 2021, as measured by claims received, Blue Cross Blue Shield of South Carolina has an NMN denial rate roughly two to three orders of magnitude larger than the others. It is unclear what drives this phenomenon.

Figure 12 shows a heatmap displaying the distribution of denial rationales among these same highest volume insurers.

Figure 12. Distribution of denial rationales for the 10 issuers with the largest number of claims adjudicated in the 2021 CMS TIC PUF data. The color indicates the fraction of all denials for a given insurer that were coded with a particular rationale (i.e. rows sum to 1). Source: CMS TIC PUF data.

New York Health Care Claims Reports

Figures 13, 14 and 15 show the overall claims counts, denial rates, and NMN denial rates for the 10 issuers in the New York Health Care Claims Reports data with the highest claims volume.

Figure 13. Claims received for the 10 issuers with the largest number of claims adjudicated in the 2022 Department of Financial Services claims report data.

Source: NY Health Care Claims Reports.

Figure 14. Overall claims denial rates for the 10 issuers with the largest number of claims adjudicated in the 2022 Department of Financial Services claims report data.

Source: NY Health Care Claims Reports.

Figure 15. NMN (including experimental) denial rates for the 10 issuers with the largest number of claims adjudicated in the 2022 Department of Financial Services claims report data.

Source: NY Health Care Claims Reports.

Figure 16 shows a heatmap displaying the distribution of denial rationales among these same highest volume insurers.

Figure 16. Distribution of denial rationales for the 10 issuers with the largest number of claims adjudicated in the 2022 Department of Financial Services claims report data. The color indicates the fraction of all denials for a given insurer that were coded with a particular rationale (i.e. rows sum to 1).

Source: NY Health Care Claims Reports.

Connecticut Consumer Report Cards

Figures 17, 18 and 19 show the overall claims counts, denial rates, and NMN denial rates for the 10 issuers in the Connecticut Consumer Report Card data with the highest claims volume.

Figure 17. Claims received for all issuers in the CT consumer report card data.

Source: 2022 CT Consumer Report Card

Figure 18. Overall claims denial rates for all issuers in the CT consumer report card data.

Source: 2022 CT Consumer Report Card

Figure 19. NMN (including experimental) denial rates for all issuers in the CT consumer report card data.

Source: 2022 CT Consumer Report Card

Figure 18 shows odd and potentially spurious data that indicates one set of issuers, 'ConnectiCare Benefits' and 'ConnectiCare', have extremely high claims denial rates above 85%. We suspect this is a reporting error in the raw data, but we show it here because we cannot be sure of that fact. We were unable to find further information about this anomaly.

Figure 20 shows a heatmap displaying the distribution of denial rationales among these same highest volume insurers.

Figure 20. Distribution of denial rationales for all issuers in the CT consumer report card data. The color indicates the fraction of all denials for a given insurer that were coded with a particular rationale (i.e. rows sum to 1).

Source: 2022 CT Consumer Report Card

Pennsylvania Department of Insurance Data

Figures 21, 22 and 23 show the overall claims counts, denial rates, and NMN denial rates for the 10 issuers in the Pennsylvania Department of Insurance Data data with the highest claims volume.

Figure 21. Claims received for all issuers in the PA DOI data.

Source: Pennsylvania Department of Insurance Data.

Figure 22. Overall claims denial rates for all issuers in the PA DOI data.

Source: Pennsylvania Department of Insurance Data.

Figure 23. NMN (including experimental) denial rates for all issuers in the PA DOI data.

Source: Pennsylvania Department of Insurance Data.

Figure 24 shows a heatmap displaying the distribution of denial rationales among these same highest volume insurers.

Figure 24. Distribution of denial rationales for all issuers in the PA DOI data who report denial rationales. The color indicates the fraction of all denials for a given insurer that were coded with a particular rationale (i.e. rows sum to 1).

Source: Pennsylvania Department of Insurance Data.

Dataset Specific Considerations

Since the datasets do not report claims and denial information in a standard or consistent way, there are many interesting considerations that can be made for only some of the datasets. Here we present dataset specific considerations relating to claims and denial distributions.

CMS Federal Marketplace PUFs

The CMS TIC PUFs provide many pieces of metadata associated with individual plans, and their corresponding aggregate claims data, that allow denials to be broken down across additional categories.

At a high level, we can calculate the overall denial rate for each issuer (an insurer's state specific entity), by calculating the ratio of all claims denied by that issuer, to the claims received by that issuer (across all their plans). Figure 25 shows the distribution of such issuer denial rates across issuers represented in the 2021 CMS TIC PUF data. The distribution is bimodal, with a large majority of issuers comprising a contingency with aggregate denial rates between 0% to 30%, and a small contingency of issuers maintaining aggregate denial rates between 35% to 50%. While we could compute such a distribution for the other datasets as well, the number of issuers involved is too small to infer much from the resulting distribution. Here, however, the distribution sheds some light on how denial rates vary by issuer.

Figure 25. Distribution of claims denial rates across federal marketplace issuers in 2021. Across issuers, denial rates vary from approximately 0% to 50%.

Source: CMS TIC PUF data.

Each plan reported in the data also has metadata associated with it. Figure 26 contains two interactive plots that show the distribution of individual plan denial rates, as a function of two properties of plans:

  • Plan administration and network type

    Health insurance plans utilize different network and administration policy types, which dictate things like which providers are considered part of your plans preferred network, and how you must seek certain types of care and services (e.g. by obtaining referrals for specialist visits). Some common policy types include Exclusive Provider Organizations (EPOs), Health Maintenance Organizations (HMOs), Point of Service (POS), and Preferred Provider Organizations (PPOs).

  • Metal level

    Federal marketplace plans are partitioned according to so-called plan categories, which are mostly named by metals. The categories are Catastrophic, Bronze, Bronze Expanded, Silver, Gold and Platinum. Generally, plans that fall into a more precious metal category have higher premiums, but cover a larger fraction of the cost of care when coverage kicks in. Catastrophic plans, as the name suggests, typically only provide coverage for catastrophic situations.

The data show that

  • A larger fraction of EMO and HMO plans have denial rates between 30% to 50% than do PPO and POS plans.
  • Only platinum level plans completely lack a contingent of 30% to 50% denial rate constituents.

Denial Rates by Plan Type, Federal Marketplace (2021)



Denial Rates by Metal Level, Federal Marketplace (2021)


Figure 26 (interactive). Distribution of claims denial rates broken down by plan types, and by plan metal levels. Click on a plan type or plan metal level to see the corresponding distribution of denial rates.

Source: CMS TIC PUF data.

Figure 27 shows the average issuer level claims denial rates for each state or region represented in the CMS TIC PUF data, as well as various statistics of the data within each region. For example, we display the number of issuers represented in each state, the total number of claims represented for each state, and the issuers with the highest claims denial rates in each state.

Denial Rates By Region, Federal Marketplace (2021)

Figure 27 (interactive). Claims denial rates and other data attributes broken down by region. Hover over regions for more detailed information.

Source: CMS TIC PUF data.

We also include a table summarizing the data included in this interactive plot:

Table View
State or RegionOverall Denial RateClaims ReceivedClaims DeniedNumber of Consumers ReportedNumber of Insurers Reported
Alaska0.235611313143836167072
Alabama0.224765706117169931687172
Arkansas0.185151809582808497596295
Arizona0.19236121186949761154096
Florida0.131755884079868686186756810
Georgia0.2891082514931279633874416
Hawaii0.244486179118578163432
Iowa0.1492828360421458406993
Illinois0.1761075553118939052483637
Indiana0.2364233263997114916253
Kansas0.1352333613314062437165
Louisiana0.1515040194761581782364
Michigan0.205778368715990481383988
Missouri0.191617394411821811130348
Mississippi0.3811411877537222507692
Montana0.0961379424132490412333
North Carolina0.1252644664032957782842695
North Dakota0.042187345077936218033
Nebraska0.1023305776338692486192
New Hampshire0.1841436531264608343603
Ohio0.2175795555125580013834910
Oklahoma0.163898642614619981386954
Oregon0.0719999671396051113436
South Carolina0.181050215218856411793173
South Dakota0.05777681844532254192
Tennessee0.181119645120111821406726
Texas0.20435256067720116559107410
Utah0.1381008629113910501519056
Virginia0.164922411215086962246059
Wisconsin0.117595208169410512886313
West Virginia0.16288337714310070692
Wyoming0.1281157244148487267772

Among this federal marketplace data, Mississippi stands out for having a uniquely high aggregate denial rate, with Texas, Alabama, Alaska, Georgia, Ohio, Indiana and Michigan forming a contingent with the next highest rates.

In many regions, a relatively small collection of issuers contribute a majority of the claims denied.

New York Health Care Claims Reports

The New York Health Care Claims Reports provide a different set of metadata associated with individual insurers, and their corresponding aggregate claims data, that allow denials to be considered from different lenses. The claims reports also include information relating to the monetary value of claims, which we consider in more detail below.

The reports include information that details how claims, denials, and appeals are broken down across different provider categories. The categories specified in the data are:

  • Hospital Inpatient
  • Hospital Outpatient
  • Other Facilities Inpatient
  • Physicians
  • Other Health Care Providers
  • Pharmacy
  • Other

Of note, the "Hospital Outpatient" category includes all claims submitted for emergency room services, and the "Other Facilities Inpatient" category includes all claims submitted for inpatient rehabilitation or skilled nursing facility services.

Figure 28 shows the overall claims counts occurring in the dataset for each category, while Figure 29 shows the overall claims denial rates in each category, aggregated over all insurers in the dataset.

Figure 28. Claims counts broken down by provider category, aggregated across all insurers who submitted health care claims reports to the Department of Financial Services in 2022.

Source: NY Health Care Claims Reports.

Figure 29. Denial counts broken down by provider category, aggregated across all insurers who submitted health care claims reports to the Department of Financial Services in 2022.

Source: NY Health Care Claims Reports.

The relative frequency of denials across these categories is somewhat consistent with the distribution of claims received across the categories, but denial rates vary from 10% to 30% across the categories.

We can also observe how NMN denials specifically are broken down by provider category. This is shown in Figure 30.

Figure 30. Total claims denied for medical necessity or experimental nature of a treatment, for each provider category, aggregated across all insurers who submitted health care claims reports to the Department of Financial Services in 2022.

Source: NY Health Care Claims Reports.

The New York Health Care Claims Reports are unique in that they also provide detailed breakdowns of the monetary value associated with many of the aspects of the data they report. Generally, the data reports the value of claims in various subcategories via two mechanisms:

  • Allowed Amounts

    An allowed amount for a service under a given plan is the maximum monetary value that the insurance plan will pay for a covered service.

  • Billed Charges

    A billed charge is the actual monetary value that a service provider charges a patient or insurer; this amount need not agree with the allowed amount for a plan.

Figure 31 shows the collective value of NMN denials in the dataset, as measured by the total allowed amounts. Allowed amounts for NY NMN denials accounted for over 35M USD in 2022.

Figure 31. Total allowed amounts for claims denied for medical necessity or experimental nature of a treatment, for each provider category, aggregated across all insurers who submitted health care claims reports to the Department of Financial Services in 2022.

Source: NY Health Care Claims Reports.

The distributions for both NMN denial counts and corresponding allowed amounts across provider categories look largely similar to those for all denials, with two notable exceptions.

  • There are very few Pharmacy denials for medical necessity in this data. One contributing factor for this is that the NMN category in this data does not include denials requiring the use of step therapy. While this is a separate denial category in this data, the categories are similar in that they involve a determination about purportedly medically appropriate treatments. Another contributing factor to the relative scarcity of pharmacy NMN denials is the reporting methodology: pre-authorization denials are not included in this data, which is one context in which one would expect NMN pharmacy denials to be occurring.
  • Among NMN denials, Hospital Outpatient allowed amounts far outweigh those of Hospital Inpatient, unlike in the distribution among denials of all types.

While there are likely many factors driving the latter result, it is worth noting that generally data suggests that hospital revenue share from outpatient procedures has been consistently increasing, with a large share of orthopedic and minor surgeries occurring in outpatient settings. Collectively, these trends to perform certain types of services and treatments more frequently in outpatient vs inpatient settings, coupled with the distribution of purported medical necessity of services within these disciplines, as dictated by insurers, might explain why NMN denials for outpatient services correspond to a high total allowed amount, relative to the inpatient NMN denials.

We can also consider the proportion of denials in each provider category that correspond to each denial rationale. Figure 32 shows a heatmap displaying these distributions.

Figure 32. Distribution of denial rationales for each provider category in the 2022 Department of Financial Services claims report data. The color indicates the fraction of all denials in a given provider category that were coded with a particular rationale (i.e. rows sum to 1).

Source: NY Health Care Claims Reports.

While it is difficult to extract detail about the distributions from Figure 32 given the prevalence of "Other" rationale reporting, some trends are noticeable. For example, among the Pharmacy denials in the dataset that are not recorded under the "Other" category, a large proportion are coded as denials related to non-covered or limited services.

Connecticut Consumer Report Cards

The Connecticut Consumer Report Cards provide a few types of information that are not present in the other datasets. Most notably for our purposes, the reports include detailed breakdowns of how claims counts and enrollment are split across different market segments, such as plan type, plan funding type, and insurer. We display a few statistics relating to enrollment in what follows.

All claims data in the consumer report corresponds to a certain population of consumers. Understanding that population is highly relevant, as it serves as an important denominator for considerations about the average impact of claims denials on each person. For example, one might be interested in understanding the average number of medical necessity claims denials received by an individual member of an insurance plan over the course of a year. This sort of calculation requires an understanding of not just raw claims and denial numbers, but also enrollment, associated with a plan.

As we noted previously, the entire dataset corresponds to 1,991,903 insured people. Figure 33 shows how that total enrollment is broken down across the insurers represented in the data.

Figure 33. Enrollment in CT consumer report card broken down by insurer.

Source: 2022 CT Consumer Report Card

Figures 34 and 35 respectively display how the total enrollment is split across:

  • Individual, small group, and large group plans

    Individual plans are plans purchased on an individual basis, for example through the federal or a state based Affordable Care Act (ACA) marketplace. In the case at hand, this means plans bought through Connecticut's marketplace, Access Health Connecticut. Group plans are plans administered for a group of employees or other beneficiaries; the term small group has technical meaning that varies by state, but typically refers to employer group health plans for employers with 50 or fewer employees.

  • Self insured and fully insured plans

    Self insured plans are group plans in which premiums are collected by, and covered services for the insured are paid for by, an employer itself. Such plans need not take on the administrative burden of insurance coverage, and often outsource that aspect of insuring to large, well-known insurance companies (often referred to as third party administrators, or TPMs, when they are employed in this role). Fully insured plans are plans purchased by an employer.

Figure 34. Enrollment in CT consumer report card broken down by group type. The data are partitioned according to individual, small group, and large group plans.

Source: 2022 CT Consumer Report Card

Figure 35. Enrollment in CT consumer report card broken down by payer type. The data are partitioned according to fully insured plans and self funded plans.

Source: 2022 CT Consumer Report Card

The data from Connecticut is unique in that it includes a large population of large group health plans, which together with the other data provides for a relatively comprehensive view of the total insured population in the state. It would be invaluable to have such comprehensive data for all states.

Summary of Findings

In studying claims and denials in each of the datasets considered, we've observed a few key points that are consistent across market segments and marketplaces.

  • Denials are extremely prevalent, with claims denial rates consistently hovering between 10% and 30%.
  • Reporting standards are insufficient to understand the landscape of denials; most denial rationales are recorded as "Other", which is inadequate for understanding how and why denial rates vary across insurers, provider categories, and other partitions of claims.
  • Denials pertaining to medical necessity or the experimental nature of treatments comprise a small fraction of all denials, but still account for a large absolute pool of claims denials, and money.

Informed by these findings, we discuss some proposals for improving patient outcomes in the recommendations section below.

Internal Appeals

Next we consider the occurrence of internal appeals in the data. The same datasets that provide information sufficient to compute raw claims denial rates provide information about internal appeals. We again consider these datasets:

  • CMS TIC PUFs.
  • New York Health Care Claims Reports.
  • Connecticut Consumer Report Cards.
  • Pennsylvania Department of Insurance Data.

Despite the inconsistencies in reporting across the datasets, the data is striking in that it consistently makes clear that while appeals of all forms are extremely rare among consumers, internal appeal overturn rates are high. We note that this disparity between the fraction of denials being internally appealed and the fraction of those appeals being internally overturned could be rooted in various phenomena12.

Overall Internal Appeal Rates

CMS Federal Marketplace PUFs

The CMS PUFs report the number of internal appeals submitted for denied claims only at the issuer level (so there is not enough information to determine the internal appeal rate for a particular plan).

Analyzing this data is more subtle than it ought to be, because not all issuers in the dataset that report their denial counts report their internal appeals counts, and not all issuers that report their internal appeal counts report their external appeal counts. As a result, simply calculating the ratio of the number of internal appeals reported in the data to the number of denials recorded in the data would only give a lower bound on the internal appeal rate. To address this, we report internal appeal rates by restricting attention to those plans that report both denials and internal appeals.

Among 2021 federal marketplace plans, the average initial appeal rate is about .2%, while the internal appeal overturn rate among this same population of internally appealed claims is about 41%. This data is shown in Figure 36.

Figure 36. Internal appeal rates and appeal overturn rates among federal marketplace issuers in 2021. Only 1/5 of a percent of the approximately 48 million initially denied claims are appealed, but of those that are, 41 percent get overturned on a first level internal review.

Source: CMS TIC PUF data.

New York Health Care Claims Reports

The New York Health Care Claims Reports document the number of internal appeals submitted for each insurer in the data. Figure 37 shows the fraction of all initially denied claims that are internally appealed by consumers, and the fraction of those that are overturned on initial appeal. The initial appeal rate is about .6%, while the internal appeal overturn rate among these claims is about 29%.

Figure 37. Internal appeal rates and appeal overturn rates among insurers who submitted health care claims reports to the Department of Financial Services in 2022. Only 1/2 of a percent of the approximately 25 million initially denied claims are appealed, but of those that are, 29 percent get overturned on a first level internal review.

Source: NY Health Care Claims Reports.

Connecticut Consumer Report Cards

The Connecticut Consumer Report Cards document the number of internal appeals submitted for each insurer in the data. Figure 38 shows the fraction of all initially denied claims that are internally appealed by consumers, and the fraction of those that are overturned on initial appeal. The initial appeal rate is about .4%, while the internal appeal overturn rate among these claims is about 60%.

Figure 38. Overall internal appeal rates in the CT consumer report card data. Only .4 percent of the approximately 2 million initially denied claims are appealed, but of those that are, 60 percent get overturned on a first level internal review.

Source: 2022 CT Consumer Report Card

Pennsylvania Department of Insurance Data

Figure 39 shows the fraction of all initially denied claims that are internally appealed by consumers, and the fraction of those that are overturned on initial appeal in the 2020 and 2021 PA DOI data. The initial appeal rate is about .1%, while the internal appeal overturn rate among these claims is about 59%.

Figure 39. Overall internal appeal rates in the PA DOI data. Only .1 percent of the approximately 3 million initially denied claims are appealed, but of those that are, 59 percent get overturned on a first level internal review.

Source: Pennsylvania Department of Insurance Data.

Internal Appeals by Insurer

As we noted above, each of the datasets provides enough information to compute appeal rates and appeal overturn rates on an individual insurer basis. In what follows we share some such calculations.

CMS Federal Marketplace PUFs

Figures 40 and 41 show the internal appeal rates and internal appeal overturn rates for the 10 insurers with the largest volume of claims adjudicated in the 2021 CMS PUF data.

Figure 40. Internal appeal rates for the 10 insurers with the largest number of claims adjudicated in the 2021 federal marketplace data.

Source: CMS TIC PUF data.

Figure 41. Internal appeal overturn rates for the 10 issuers with the largest number of claims adjudicated in the 2021 federal marketplace data.

Source: CMS TIC PUF data.

New York Health Care Claims Reports

Figures 42 and 43 show the internal appeal rates and internal appeal overturn rates for the 10 insurers with the largest volume of claims adjudicated in the NY claims report data.

Figure 42. Internal appeal rates for the 10 insurers with the largest number of claims adjudicated in the 2022 Department of Financial Services claims report data.

Source: NY Health Care Claims Reports.

Figure 43. Internal appeal overturn rates for the 10 insurers with the largest number of claims adjudicated in the 2022 Department of Financial Services claims report data.

Source: NY Health Care Claims Reports.

Connecticut Consumer Report Cards

Figures 44 and 45 show the internal appeal rates and internal appeal overturn rates for all insurers in the Connecticut consumer report card data.

Figure 44. Internal appeal rates for insurers in the 2022 Connecticut Consumer Report Card Data.

Source: 2022 CT Consumer Report Card

Figure 45. Internal appeal overturn rates for issuers in the 2022 Connecticut Consumer Report Card Data.

Source: 2022 CT Consumer Report Card

Pennsylvania Department of Insurance Data

Figures 46 and 47 show the internal appeal rates and internal appeal overturn rates for all insurers in the Pennsylvania Department of Insurance data.

Figure 46. Internal appeal rates for 10 insurers with the most claims in the PA DOI data.

Source: Pennsylvania Department of Insurance Data.

Figure 47. Internal appeal overturn rates for 10 insurers with the most claims in the PA DOI data.

Source: Pennsylvania Department of Insurance Data.

Dataset Specific Considerations

The considerations above exhaust most of the analysis on internal appeals that can be done in a common way across the datasets reporting internal appeal counts. In what follows, we consider a few additional aspects of the data reported in only some of the datasets.

New York Health Care Claims Reports

The data recorded in the New York Health Care Claims Reports allow us to investigate how internal appeal rates and internal appeal overturn rates vary by provider category. Figure 48 shows the overall claims denial rates, internal appeal rates, and internal appeal overturn rates for each of these categories, aggregated over all insurers in the dataset.

Figure 48. Denial, internal appeal, and internal appeal overturn rates across provider categories in the 2022 New York Health Care Claims Reports.

Source: NY Health Care Claims Reports.

While all provider categories have relatively high internal appeal overturn rates, initial denial rates and internal appeal rates vary widely across the categories. It is noteworthy that while claims for inpatient hospital visits occupy a small fraction of the total claims covered in the dataset (compare Figure 28), denial rates for those claims are high at 24%, and internal appeal rates are two to six times larger than those occurring among the other categories. Despite this increased appeal rate, appeal overturn rates remain high at 27%.

This empirical phenomenon is informed by considering the underlying costs associated with the claims in each of these provider categories. The cost data, displayed in Figure 49, shows that the sum of the allowed amounts for internally appealed 'Inpatient Hospital' claims is greater than the sum for any other category. Moreover, consumers collectively recouped roughly 60% of the entire monetary value of these appealed 'Inpatient Hospital' claims, despite having an appeal overturn rate of only 27%.

Figure 49. Reported total allowed amounts for internally appealed claims, and appeal overturns, broken down by provider category in the 2022 New York Health Care Claims Reports.

Source: NY Health Care Claims Reports.

To summarize, consumer behavior in successfully appealing Hospital Inpatient denials led to the overturn of a very lucrative portion of the internally appealed denials in the dataset; this is a fact that is not visible when solely considering denial or appeal rates, but it is consistent with the intuition that the most expensive contractually inappropriate denials are the ones most likely to be appealed by consumers or their providers.

Another contributing factor that may be driving this phenomenon is that hospitals often have significant resources provisioned to address contractually inappropriate denials. Hospitals routinely have departments dedicated to insurance claims resolution, and pay full time staff to navigate claims submission and appeals pertaining to a wide variety of issues. It is natural to expect that professionals with ample experience navigating appeals, working for organizations whose revenue and patients depend on successful claims resolution, would both appeal more frequently than would individual consumers, and fare better in their appeal overturn rates.

Connecticut Consumer Report Cards

The data recorded in the Connecticut Consumer Report Cards allows one to consider internal appeal rates and internal appeal overturn rates broken down by the rationale provided for the initial denial. This is an incredibly useful lens through which to view the data, and one which is unfortunately unsupported by most of the datasets.

Figures 50 and 51 show the internal appeal rates and internal appeal overturn rates for each denial rationale in the dataset.

Figure 50. Internal appeal rate among the population of claims corresponding to each denial rationale in the 2022 Connecticut Consumer Report Card.

Source: 2022 CT Consumer Report Card

Figure 51. Internal appeal overturn rate among the population of claims corresponding to each denial rationale in the 2022 Connecticut Consumer Report Card.

Source: 2022 CT Consumer Report Card

In this dataset, the internal appeal rate for NMN denials is between 10 to 50 times larger than the appeal rates in all other categories. On the other hand, internal appeal overturn rates for internally appealed NMN denials are among the lowest of all categories.

We note that the high rate of internal appeals for denials whose rationales involve medical necessity, or the purported experimental nature of a treatment or service, has an intuitive potential cause. Medical necessity of particular services or care is not typically made explicit in insurance contracts; instead, broad definitions, which themselves reference literature and other documents, dictate a purported standard by which medical necessity will be determined. Given these definitions are often ambiguous and ill-defined in nature, it makes good sense that cases involving them would be more subtle, and be more likely to be appealed, than clear cut cases.

For example, if a contract explicitly states something is not covered, or a claim is inappropriately submitted and then denied for a member whose coverage had lapsed at the date of service, there is not much ground on which to make an appeal. This same logic explains why the internal appeal overturn rate is 100% for internally appealed claims which were denied purportedly because they were duplicates; whether or not a claim is a duplicate is a clear cut matter of accounting, so any substantiated appeal of a claim denied for being purportedly duplicate is likely to be sound.

Summary of Findings

In studying internal appeals in each of the datasets considered, we've observed a few key points that are consistent across market segments and marketplaces.

  • While claims denials occur frequently, internal appeal rates are vanishingly low; this fact remains true across all categories of denial, so far as can be told from the data which reports sufficient detail to make such a determination.
  • Among internally appealed claims denials overturn rates are high.
  • The monetary value historically recouped in the observed data through internal appeals overturns is large, despite the fact that internal appeals are vanishingly rare.

Informed by these findings, we discuss some proposals for improving patient outcomes in the recommendations section below.

External Appeals

We now turn to external appeals. The datasets that provide information relating to external appeals are:

  • CMS TIC PUFs.
  • California CDI external appeal database.
  • California DMHC external appeal database.
  • New York external appeal database.
  • Washington external appeal database.
  • Pennsylvania Department of Insurance data.

Each of these datasets reports metadata relating to external or independent review of denied claims in a different way. Despite the variations in how the data is reported, the independent sources display at least one consistent result: the population of claims denials which are upheld on internal appeal and then externally appealed are overturned by external review boards at high rates. This is a key fact to take away from these datasets.

This result suggests that internal review processes may not be sufficient for fair or equitable adjudication. If the populations of internal and external reviewers were all perfectly impartial, and shared consistent, repeatable processes for making their determinations, one would expect near perfect agreement between determinations made by the two bodies for the same pool of denied claims.

Instead, we see staggering disagreement in each dataset about which of the externally appealed claims merit overturn. The disagreement cannot conclusively be attributed to faulty or biased internal appeal processes, but it suggests that either internal appeal adjudication is faulty, external appeal adjudication is faulty, or appeal adjudication is too complex to expect a high degree of consistency between any two reviewers, at least for the population of externally appealed claims. The latter explanation appears unlikely given the relatively consistent external overturn rates across all of the different data sources we exhibit, and across many different independent, third party review organizations.

It is possible that if one were to submit all internally appealed claims to third party external review organizations, one would find that among the population of internally appealed claims there would be a high degree of agreement between internal and external reviewers on appropriate determination outcomes. It is of course impossible to surmise what this hypothetical level of agreement would be if the audit were to occur.

However, note that even if this hypothetical test were to result in overwhelming agreement between internal and external reviewers when considering the population of internally appealed claims, such a result would not negate the problematic nature of the actual level of disagreement between internal and external reviewers when considering externally appealed claims denials alone. There is no reason to believe that the population of internally and externally appealed claims are drawn from similar distributions, and as the data clearly shows, the population of internally appealed claims is orders of magnitude larger.

It appears that internal processes may effectively be hoops through which consumers must jump before they are afforded unbiased consideration of their appeals. Historically, most consumers that have navigated their way through an internal appeal process and lost do not (or cannot) proceed with an independent external review, as we will see in the data below.

Overall External Appeal Rates

We begin be examining the overall external appeal overturn rate in each of the datasets.

The CMS PUF TIC files and PA Department of Insurance data are unique among the external appeals data in that they also allow us to calculate the overall external appeal rates themselves. We calculate this as the total number of external appeals in the dataset divided by the corresponding13 total number of upheld internal appeals in the dataset.

All of the other datasets with external appeals data provide information related to submitted external appeals, but do not indicate the corresponding population of upheld internal appeals to which the external appeals correspond. For this reason, we cannot provide an accurate overall external appeal rate in these cases, and instead restrict attention to the overall external appeal overturn rate.

CMS Federal Marketplace PUFs

In Figure 52, we display the overall external appeal rate and external appeal overturn rate from the CMS TIC PUF data.

Figure 52. Overall external appeal rate and overall appeal overturn rate among federal marketplace issuers that reported the data in 2021. Not all issuers and plans reported external appeal data in the CMS PUF. Among the issuers reporting external appeal data, only 5 percent of the approximately 34 thousand denials upheld on internal appeal are externally appealed, but of those, nearly 49 percent get overturned when reviewed by an independent external agency.

Source: CMS TIC PUF data.

Of the 1,760 claims represented in this data that were denied, internally appealed and upheld as denials, and then finally externally appealed, a staggering 49% were deemed by external reviewers to merit overturn. However, 95% of the consumers who had their internal appeals of claims denials upheld on internal review did not proceed with the external review process.

California CDI External Appeal Database

Figure 53 shows the overall external appeal overturn rate over the time period covered by the CDI external appeal dataset, excluding data from 2023.

Figure 53. External appeal overturn rates among California insurers from 2011 to 2022. Roughly 50 percent of denials that are externally appealed get completely overturned when reviewed by an independent external agency.

Source: CDI External Appeal Database.

California DMHC External Appeal Database

Figure 54 shows the overall external appeal overturn rate over the time period covered by the DMHC external appeal dataset, excluding data from 2023.

Figure 54. External appeal overturn rates among California issuers from 2001 to 2022. Roughly 49 percent of denials that are externally appealed get completely overturned when reviewed by an independent external agency.

Source: DMHC IMR database.

We again see roughly 50% disagreement between external reviewers and prior internal review determinations among the population of internally upheld and externally appealed claims denials.

New York External Appeal Database

Figure 55 shows the overall external appeal overturn rate over the time span covered by the New York external appeal dataset, excluding data from 2023.

Figure 55. External appeal overturn rates among New York issuers from 2019 to 2022. Roughly 43 percent of denials that are externally appealed get completely overturned when reviewed by an independent external agency.

Source: NYS External Appeal Database.

In this case, 43% of internally upheld and externally appealed claims denials are deemed to merit overturn by external reviewers.

We note that while one could attempt to create an estimate for the external appeal rate in NY by combining the upheld internal appeal counts from the health care claims reports, or other sources, with the external appeal counts in this dataset, there is insufficient publicly available information to ensure the underlying population of claims in the two datasets is the same. As a result, even the rough accuracy of such an estimate would be unclear.

Washington External Appeal Database

Figure 56 shows the overall external appeal overturn rate over the time period covered by the Washington external appeal dataset, excluding data from 2023.

Figure 56. External appeal overturn rates among OIC regulated Washington health plans from 2016 to 2022. Roughly 25 percent of denials that are externally appealed get completely overturned when reviewed by an independent external agency.

Source: Washington External Appeal Data.

At 25%, the Washington data shows a substantially lower fraction of internally upheld and externally appealed claims denials that are deemed to merit overturn by external reviewers, as compared to the other datasets. This could be influenced by numerous factors, but one possible explanation is that internal appeal review processes are more consistent with external appeal review processes than they are in other states.

Pennsylvania Department of Insurance Data

In Figure 57, we display the overall external appeal rate and external appeal overturn rate from the Pennsylvania Department of Insurance data. At 25%, the external appeal overturn rate is consistent with that in Washington, and substantially lower than the rate in the other datasets. However, note that the sample size is extremely small in the PA data; only 102 external appeals are recorded in the data.

Figure 57. Overall external appeal rate and overall appeal overturn rate among issuers represented in the 2020 and 2021 PA DOI data. In this data only 10 percent of approximately one thousand denials upheld on internal appeal were externally appealed, but of those, 25 percent get overturned when reviewed by an independent external agency.

Source: Pennsylvania Department of Insurance Data.

In the Pennsylvania DOI data, approximately 90% of the consumers who had their internal appeals of claims denials upheld on internal review did not proceed with the external review process.

External Appeals By Denial Rationale

Under federal regulations, claims denials that are upheld after an internal review are only eligible for external appeal by consumers in certain contexts (see (Pollitz, 2021) for more information), such as denials whose rationale involves a purported lack of medical necessity. As a result, the distribution of denial rationales for claims that make it to the external appeal process varies drastically from the distribution of rationales among all denials, or even among internally appealed denials.

Despite this, the datasets with external appeals that we consider that allow for the denial rationale distribution to be calculated show distinct distributions of denial rationales. In this section, we display these variations. Just as we saw that the allowable claims denial rationales varied by dataset, we see here that even the allowable denial rationales eligible for external appeals are not standardized across these datasets.

California CDI External Appeal Database

In the California CDI external appeal database, 66% of external appeals correspond to denials whose rationale was medical necessity. Figure 58 shows the distribution of rationales provided in this data.

Figure 58. Distribution of rationales for denials in the CDI IMR database.

Source: CDI External Appeal Database.

California DMHC External Appeal Database

In the California DMHC external appeal database, 71% of external appeals correspond to denials whose rationale was medical necessity. Figure 59 shows the distribution of rationales provided in this data.

Figure 59. Distribution of rationales for denials in the DMHC IMR database.

Source: DMHC IMR database.

New York External Appeal Database

In the New York external appeal database, 92% of external appeals correspond to denials whose rationale was medical necessity. Figure 60 shows the full distribution of rationales provided in this data.

Figure 60. Distribution of rationales for denials in the NYS external appeal database.

Source: NYS External Appeal Database.

Washington External Appeal Database

In the Washington external appeal database, 43% of external appeals correspond to denials whose rationale was medical necessity. Figure 61 shows the distribution of rationales provided in this data.

Figure 61. Distribution of rationales for initial denials among Washington external appeals.

Source: Washington External Appeal Data.

External Appeal Overturn Rates Over Time

All of the datasets we've considered here include external appeals that have been tracked over many years. The graphs below show how the external appeal overturn rates, and frequency with which external appeals have been submitted, have varied over the years. Because we've focused our attention on the 2023 CMS TIC PUF in this article, rather than an aggregate of all CMS TIC PUFs, we omit such a graph for that dataset.

All of the datasets show an increase in external appeal overturn rates over the last four years.

California CDI External Appeal Database

In aggregate across the determinations made between 2011 and 2022, the complete overturn rate for external appeals logged in the CA CDI database was roughly 50%; Figure 62 shows that this rate varied considerably across these years.

Figure 62. Overturn rates and raw counts of external appeals among California CDI regulated issuers from 2011 to 2022, broken down by year.

Source: CDI External Appeal Database.

California DMHC External Appeal Database

In aggregate across the determinations made between 2001 and 2022, the complete overturn rate for external appeals logged in the CA DMHC database was roughly 49%; however Figure 63 shows that this rate also varied considerably across these years.

Figure 63. Overturn rates and raw counts of external appeals among California DMHC regulated issuers from 2001 to 2022, broken down by year.

Source: DMHC IMR database.

New York External Appeal Database

In aggregate across the determinations made between 2019 and 2022, the complete overturn rate for external appeals logged in the NY database was roughly 43%; Figure 64 shows how this rate varied across those years.

Figure 64. Overturn rates and raw counts of external appeals among New York issuers from 2019 to 2022, broken down by year.

Source: NYS External Appeal Database.

Washington External Appeal Database

In aggregate across the determinations made between 2016 and 2022, the complete overturn rate for external appeals logged in the Washington external appeal database was roughly 25%; Figure 65 shows how this rate varied across those years.

Figure 65. Overturn rates and raw counts of external appeals among Washington health plans from 2016 to 2022, broken down by year.

Source: Washington External Appeal Data.

In the Washington external appeal data, we can further segment the yearly external appeal overturn rates according to the rationales provided for the underlying denials. The result, displayed in Figure 66, shows that the overturn rate for the 'Contractual Coverage Dispute' category is much lower than the others. This is consistent with the hypothesis we formulated earlier regarding the subtlety inherent in medical necessity and experimental denial determinations that is not present in many other denial rationale categories.

Figure 66. External appeal overturn rates by year, broken down by purported rationale for the denial provided by the insurer. Overturn rates for contractually disputed services are much lower than those for denials corresponding to purported lack of medical necessity or purportedly experimental or investigational services.

Source: Washington External Appeal Data.

External Appeal Overturn Rates By Insurer

Three of the datasets with external appeals data provide enough information to compute external appeal overturn rates broken down by insurer. In what follows we share some of these breakdowns.

In each dataset, the overturn rate for external appeals varied considerably across different insurers, with overturn rates among the 10 insurers that received the most external appeals in each dataset varying by at least 14%. This suggests that either the average validity of externally appealed denials is varying by insurer, or that review of external appeals is being performed in an inconsistent way across insurers. Note, however, that the average validity of externally appealed denials could vary across insurers for numerous reasons (e.g. the average validity of initial denials could be varying by insurer, or the internal appeal adjudication processes could be varying by insurer, etc).

CMS Federal Marketplace PUFs

Figure 67 shows the external appeal counts for the 10 insurers that received the most external appeals in the 2021 CMS federal marketplace data. Two insurers accounted for a significant share of the external appeals received.

Figure 67. External appeals among federal marketplace insurance plans from 2021, broken down by insurer. Only the 10 insurers with the most external appeals received in the dataset are shown here.

Source: CMS TIC PUF data.

In figure 68, we show the external appeal overturn rates for these same insurers. A majority of these top 10 insurers have external appeal overturn rates between 30% and 40%, but there are three outliers with overturn rates above 50%.

Figure 68. External appeal overturn rates among federal marketplace insurance plans, broken down by insurer. Only the 10 insurers with the most external appeals received in the dataset are shown here.

Source: CMS TIC PUF data.

New York External Appeal Database

Figure 69 shows the external appeal counts for the 10 insurers that received the most external appeals in the New York external appeal database.

Figure 69. External appeals among New York insurance plans from 2019 to 2022, broken down by insurer. Only the 10 insurers with the most external appeals received in the dataset are shown here.

Source: NYS External Appeal Database.

Figure 70 shows the external appeal overturn rates for these same insurers. A majority of these top 10 insurers again have external appeal overturn rates between 30% and 40%, but there there are numerous outliers with higher overturn rates.

Figure 70. External appeal overturn rates among New York insurance plans from 2019 to 2022, broken down by insurer. Only the 10 insurers with the most external appeals received in the dataset are shown here.

Source: NYS External Appeal Database.

Washington External Appeal Database

Figure 71 shows the external appeal counts for the 10 insurers that received the most external appeals in the Washington external appeal database.

Figure 71. External appeals among Washington insurers from 2016 to 2022, broken down by insurer. Only the 10 insurers with the most external appeals received in the dataset are shown here.

Source: Washington External Appeal Data.

Figure 72 shows the external appeal overturn rates for these same insurers. External appeal overturn rates in the Washington data are markedly lower than those in the other datasets. All of the external appeal overturn rates among these top 10 insurers fall between 15% and 30%.

Figure 72. External appeal overturn rates among Washington insurers from 2016 to 2022, broken down by insurer. Only the 10 insurers with the most external appeals received in the dataset are shown here.

Source: Washington External Appeal Data.

External Appeal Overturn Rates By Independent Review Organization

In this section we examine how external appeal submissions and external appeal overturn rates vary based on the review organizations assigned to review the appeals. Among the datasets we consider, only the New York external appeal database and the Washington external appeal database detail the review organizations associated with each review.

New York External Appeal Database

In the New York external appeal database, a total of 4 independent review organizations were involved in adjudication.

Those organizations are:

Figure 73 shows the number of external appeals in the NY DFS external appeal database that were adjudicated by each of these organizations.

Figure 73. Counts of the external appeals adjudicated by each of the NY state review organizations.

Source: NYS External Appeal Database.

Figure 74 shows the average external appeal overturn rate for these organizations over the time period covered by the data.

Figure 74. Average external appeal overturn rates for each of the NY external review organizations.

Source: NYS External Appeal Database.

It is informative in this case to understand how the complete overturn rates for each of the review organizations appearing in this data has evolved over the years; this evolution is shown in figure 75. It is interesting to note that although all organizations received a similar distribution of denial rationales (this can be verified in the data, but is not shown here), the complete overturn rates vary by organization in a way that has been relatively consistent over the years (e.g. IPRO has consistently maintained the highest complete overturn rate).

Figure 75. Complete overturn rates for external appeals received by each of the independent review organizations in the New York external appeal database.

Source: NYS External Appeal Database.

It would be useful to understand why the average overturn rate varies consistently between these organizations14, and what (if any) protocols exist to ensure that reviewers across organizations are following consistent standards.

We were also unable to find information detailing the process by which external review agencies are granted contract renewals, or monitored and evaluated by the state of New York to ensure they are evaluating external appeals consistently and appropriately, but this seems like information which would be useful to disclose prominently to gain the trust of consumers.

It would also be helpful if external appeal data (including metadata reflecting the relevant review organizations) were included in the Health Care Claim Reports, which we analyzed previously, to allow consumers, researchers and policy makers to audit the nature of NY external appeal processes with more context, such as knowledge of the pool of internal appeals to which they correspond, and more fine grained detail.

Washington External Appeal Database

In the Washington external appeal database, a total of 19 independent review organizations were involved in adjudication.

Figure 76 shows the number of external appeals in the WA external appeal database that were adjudicated by the 10 organizations that administered the most appeal reviews.

Figure 76. Counts of the external appeals adjudicated by each of the WA external review organizations from 2016-2022, for those review organizations that received at least 500 adjudication requests.

Source: Washington External Appeal Data.

Figure 77 shows the average external appeal overturn rate for these organizations over the time period covered by the data.

Figure 77. Average external appeal overturn rates for each of the WA external review organizations from 2016-2022, for those review organizations that received at least 500 adjudication requests.

Source: Washington External Appeal Data.

The average external appeal overturn rate varies drastically across these organizations. It is unclear from the data we have what causes this discrepancy, though what is clear (when one compares the yearly breakdown of overturn rates for each review organization) is that this variance cannot be solely attributed to yearly variations in the overturn rates for each constituent review organization. Regardless of the explanation, such large variation in overturn rates for potentially crucial medical claims denials merits explanation, and monitoring and evaluation, to ensure accountability and prevention of inappropriate practices, and build consumer trust.

For example, it would be interesting and useful for the public to gain access, from an official and reputable source, to the implemented policy that determines exactly how external appeal requests are routed on a case by case basis to the review agencies in Washington. It is possible that review organizations do not receive a random subset of external appeals, but rather get routed appeals based on undisclosed factors that lead to this overturn variation (e.g. one external review agency specializes in adjudicating appeals pertaining to emergency services, etc.).

External Appeal Overturn Rates By Underlying Medical Characteristics

The final lens through which we examine the external appeals in these datasets involves how the external appeals, and the overturn rates, vary according to the underlying medical details of the claim. One way to study this variation is to consider codes or other descriptors associated with the underlying diagnoses, treatments or services related to a claims denial. All of the external appeal datasets we consider here except for the CMS TIC PUFs provide information of this sort in varying degrees.

Although such information is provided in numerous datasets, the way in which the information is provided is inconsistent, which makes attempts at direct comparisons tenuous. Nonetheless, we consider how the data breaks down along these dimensions in what follows.

California CDI External Appeal Database

The California CDI external appeal database provides breakdowns of external appeals by both diagnosis and service categories; in both cases, externally appealed claim records are labeled with a plain text description of the diagnosis or service or treatment, but no associated diagnosis or procedure codes are included in the data. In addition, many of the plain text descriptions are generic and ambiguous in nature, as we see in the figures below.

Figure 78 shows the most common diagnoses among the external appeals data, while Figure 79 shows the most common services, and Figure 80 shows the most common diagnosis-service pairs.

Figure 78. Counts of the external appeals corresponding to the 10 most common diagnoses appearing in the CDI IMR database.

Source: CDI External Appeal Database.

Figure 79. Counts of the external appeals corresponding to the 10 most common services appearing in the CDI IMR database.

Source: CDI External Appeal Database.

Figure 80. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the CDI IMR database.

Source: CDI External Appeal Database.

Figure 81 shows the most common diagnosis-service pairs among the denials corresponding to medical necessity in the dataset.

Figure 81. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the CDI IMR database among denials whose rationale involves medical necessity.

Source: CDI External Appeal Database.

California DMHC External Appeal Database

The California DMHC external appeal database provides breakdowns of external appeals by both diagnosis and service categories in a way that is similar to how such data is provided in the CA CDI external appeal dataset; no precise diagnosis or procedural codes are provided, rather plain text descriptions indicate the fields. The plain text descriptors appear to be distinct even from those appearing in the CDI dataset, despite both datasets corresponding to data from the same state.

Figure 82 shows the most common diagnoses among the external appeals data, while Figure 83 shows the most common services, and Figure 84 shows the most common diagnosis-service pairs.

Figure 82. Counts of the external appeals corresponding to the 10 most common diagnoses appearing in the DMHC IMR database.

Source: DMHC IMR database.

Figure 83. Counts of the external appeals corresponding to the 10 most common services appearing in the DMHC IMR database.

Source: DMHC IMR database.

Figure 84. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the DMHC IMR database.

Source: DMHC IMR database.

Figure 85 shows the most common diagnosis-service pairs among the denials corresponding to medical necessity in the dataset.

Figure 85. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the DMHC IMR database among denials whose rationale involves medical necessity.

Source: DMHC IMR database.

New York External Appeal Database

The New York external appeal database provides breakdowns of external appeals by both diagnosis and service categories in a way that is similar to how such data is provided in the CA CDI and DMHC external appeal datasets; no precise diagnosis or procedural codes are provided, rather plain text descriptions indicate the fields. The plain text descriptors appear to be distinct from those used in both the CA CDI and CA DMHC external appeal databases.

Figure 86 shows most common diagnoses among the external appeals data, while Figure 87 shows the most common services, and Figure 88 shows the most common diagnosis-service pairs.

Figure 86. Counts of the external appeals corresponding to the 10 most common diagnoses appearing in the NY external appeal database.

Source: NYS External Appeal Database.

Figure 87. Counts of the external appeals corresponding to the 10 most common services appearing in the NY external appeal database.

Source: NYS External Appeal Database.

Figure 88. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the NY external appeal database.

Source: NYS External Appeal Database.

Figure 89 shows the most common diagnosis-service pairs among the denials corresponding to medical necessity in the dataset.

Figure 89. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the NY external appeal database among denials whose rationale involves medical necessity.

Source: NYS External Appeal Database.

Washington External Appeal Database

Finally, we consider the breakdown of diagnosis and service descriptors in the Washington external appeal database. Among the datasets we consider, this database appears to provide the most fine grained and well-specified breakdown of diagnosis and service descriptions. This detail is provided for each external appeal recorded both via plain text descriptors, but also by corresponding codes determined by standardized diagnosis and procedure specifiers.

The use of common, standardized coding schemes means that direct comparisons of externally appealed denials broken down by these properties across datasets would be feasible, if other datasets moved toward reporting by the same standard. This line of reasoning is one that we consider in more detail in the policy recommendations section below.

Figure 90 shows the counts of the top ten diagnosis codes among the external appeals in this data, while Figure 91 shows the most common services, and Figure 92 shows the most common diagnosis-service pairs.

Figure 90. Counts of the external appeals corresponding to each of the top 10 diagnosis codes appearing in the WA external appeal database.

Source: Washington External Appeal Data.

Figure 91. Counts of the external appeals corresponding to the 10 most common services appearing in the WA external appeal database.

Source: Washington External Appeal Data.

Figure 92. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the WA external appeal database.

Source: Washington External Appeal Data.

Figure 93 shows the most common diagnosis-service pairs among the denials corresponding to medical necessity in the dataset.

Figure 93. Counts of the external appeals corresponding to the 10 most common diagnosis-service pairs appearing in the WA external appeal database among denials whose rationale involves medical necessity.

Source: Washington External Appeal Data.

Because the Washington external appeal data has the most specificity and least ambiguity in diagnosis and service breakdowns, it is useful to note how those labels are distributed in more detail.

For example, it is informative to consider the distribution of diagnosis codes that are occurring in each subtype of denial rationale, rather than doing so only for denials related to medical necessity. Unsurprisingly, diagnoses vary in their relative frequency across each class of denial rationale. For example, cases being deemed not medically necessary are most often being externally appealed when the underlying diagnosis involved is Psoriasis, while among experimental denials the most commonly externally appealed diagnosis is GERD, and among contractual disputes it is screening for malignant neoplasm.

Figures 94, 95, and 96 respectively show the same breakdown of the top 10 diagnoses, but when restricting attention to appeals corresponding to each of the three categories of denial rationales.

Figure 94. Counts of the external appeals corresponding to each of the top 10 diagnosis codes among external appeals of denials whose rationale involves medical necessity, as found in the Washington external appeal data.

Source: Washington External Appeal Data.

Figure 95. Counts of the external appeals corresponding to each of the top 10 diagnosis codes among external appeals of denials whose rationale involves the experimental or investigational nature of a treatment or service, as found in the Washington external appeal data.

Source: Washington External Appeal Data.

Figure 96. Counts of the external appeals corresponding to each of the top 10 diagnosis codes among external appeals of denials whose rationale involves a contractual coverage dispute, as found in the Washington external appeal data.

Source: Washington External Appeal Data.

Figures 97, 98, and 99 show this same data for the three denial rationale categories, but breaking the appeal frequency data down further by insurer.

Figure 97. External appeal frequencies broken down by diagnosis codes and insurer, for denials whose rationale involves medical necessity. Here we show only the 20 diagnoses most prevalent across the dataset. The color of a box in a column indicates the fraction of all externally appealed denials for a given insurer that were associated with a particular diagnosis code (i.e. columns would sum to 1, if all diagnoses in the dataset were included in the plot).

Source: Washington External Appeal Data.

Figure 98. External appeal frequencies broken down by diagnosis codes and insurer, for denials whose rationale involves the experimental or investigational nature of a service or treatment. Here we show only the 20 diagnoses most prevalent across the dataset. The color of a box in a column indicates the fraction of all externally appealed denials for a given insurer that were associated with a particular diagnosis code (i.e. columns would sum to 1, if all diagnoses in the dataset were included in the plot).

Source: Washington External Appeal Data.

Figure 99. External appeal frequencies broken down by diagnosis codes and insurer, for denials whose rationale involves a contractual coverage dispute. Here we show only the 20 diagnoses most prevalent across the dataset. The color of a box in a column indicates the fraction of all externally appealed denials for a given insurer that were associated with a particular diagnosis code (i.e. columns would sum to 1, if all diagnoses in the dataset were included in the plot).

Source: Washington External Appeal Data.

While it is difficult to draw conclusions about what is causing the phenomenology observed in Figures 94 through 99 based solely on the available data, the data raises many questions. The relative frequency of denial and appeal outcomes across insurers and medical diagnoses and services rendered is of critical importance to those with chronic, debilitating, or dire medical conditions. It is important that insurer behaviors of the sort reflected in the plots above are better understood, monitored and regulated so that fair and just adjudication practices can be ensured for consumers.

Summary of Findings

In studying external appeals in each of the datasets considered, we've observed a few key points that are consistent across market segments and marketplaces.

  • Among the small population of claims denials that are upheld on internal appeal and appealed externally, external overturn rates are high.
  • External appeal overturn rates are dependent both on the insurer from which claims denials come, as well as the independent review agents to which appeals are assigned.
  • External appeals corresponding to different denial rationales are not uniformly distributed among services and diagnoses; certain diagnoses and services are associated with denials that incur higher rates of external appeal.

Informed by these findings, we discuss some proposals for improving patient outcomes in the recommendations section below.

Conclusions

We now take a step back to synthesize our analyses, and consider some additional large scale implications.

Lack of Evaluation of Commercial Internal Appeal Processes

The data suggests that internal appeals processes in commercial health insurance are problematic. In each dataset considered here, we see that denied claims that are upheld after internal appeal reviews, and then subsequently reviewed in an external review by an independent third party, are overturned at high rates. For example, in the 2021 federal marketplace data nearly 50% of the 1,760 denied claims that were externally appealed were overturned by third party reviewers with no financial incentive in the determination. This same pattern of high rates of disagreement between internal and external reviewers for the population of externally appealed claims is evident across all data we've studied in this article. This phenomenon suggests to us that internal appeal reviews among commercial health plans need to be audited and evaluated with more scrutiny, as we now explain.

If the internal appeal process and external appeal processes were unbiased, appropriately administered, and employed consistent standards, we'd expect a large degree of consistency between insurer determinations and third party determinations among the population of claims reviewed by both parties. A measure of agreement of this sort is referred to as an inter-rater reliability (IRR) measure, or inter-annotator agreement, and is a useful and commonly employed means of determining consistency or quality of determinations or ratings provided by different individuals.

The degree of disagreement between internal and external reviewers in appeals data is relatively high in all of the data we've seen, given the significance of the consequences for patients when determinations are adjudicated incorrectly. What constitutes a relatively high level of disagreement is of course subjective; however, across all of the datasets we've seen external appeal overturn rates of approximately 15% to 60%, and furthermore, appeal overturn rates tend to remain in these ranges even when we consider individual insurers, or individual review agents.

In other critical matters of auditing quality, society at large would never settle for such a level of inter-rater agreement, particularly when one rater has a monetary incentive to rate a particular way. For example, we posit no would one would fly on a plane if airlines were allowed sole discretion to audit their own safety features "internally" upon receiving safety complaints, but every time they were audited by external agencies, parts that the airlines found to be operational were found by independent safety inspectors to be faulty 15-60% of the time, consistently, and across all airlines, no matter which external safety agency were performing the inspections. That is the level of disagreement we have witnessed throughout these analyses.

Regardless of the cause of the consistently high levels of disagreement between internal and external reviews of externally appealed claims denials, the high levels of disagreement should be compelling motivation for regulators and lawmakers to carefully and consistently monitor and evaluate the outcomes of internal and external reviews in a transparent way among commercial insurers. Even the task of trying to perform this evaluation comprehensively oneself is incredibly difficult, if not impossible, given the lack of publicly available data. In an ideal world consumers and patients could rely on evaluation and monitoring being performed on their behalf, and for their protection, by regulators, but short of this data must at least be accessible so that watchdogs and advocates can shine a light on potentially problematic practices. It is important that discrepancies such as those we see in external appeal review inconsistencies are better understood; regardless of the cause, understanding them will improve outcomes and efficiencies in our systems.

Monetary Value of Appeals

Consider again the disparity between the fraction of denials being appealed and the fraction of those appeals being internally overturned; we've seen such a disparity in every dataset containing internal appeals information that we've considered. One takeaway from this result is that it is in the best interest of consumers, as individuals, to appeal potentially inappropriate denials more frequently. However, if such a change were to occur on a large scale systematically, it would have implications for our healthcare systems at large in addition to improving health outcomes for individuals.

To illustrate, let's consider what would happen if more federal marketplace insurance plan denials were appealed, under various assumptions.

Suppose:

  • The fraction of denials appealed initially is XX% (X=.1899506X=.1899506% in the 2021 federal marketplace data).
  • The initial appeal overturn rate is YY% (Y=41.32Y=41.32% in the 2021 federal marketplace data).
  • The average value per denied claim is $ZZ (we do not have enough information to estimate Z for the distribution of claims data in the 2021 federal marketplace data).

Set XX, YY, and ZZ in the sliders below to see what would result.




If 0.1899506% of all denied claims were appealed, with an initial appeal success rate of 41.32% , internal appeals would result in

(0.001899506×47,325,980 denied claims)×(0.4132 success rate)=0 overturned claims(0.001899506 \times 47,325,980 \text{ denied claims})\times (0.4132 \text{ success rate}) = 0 \text{ overturned claims}

assuming the same number of denials that occurred among the 2021 federal marketplace plans.

(The actual historical parameters from 2021 led to 37,145 claims overturned via internal appeals on the federal marketplace.)

Assuming an average value of $100 per claim, this would correspond to a recoup of

(0 overturned claims)×($100 per claim)=$0(0 \text{ overturned claims}) \times (\$100 \text{ per claim}) = \$0

in the form of regained payments from insurers.

Note that even for modest estimates about the extent to which appeal rates might be scaled for inappropriate denials while maintaining historical overturn rates, scaling appeal rates has high value implications. For example, scaling currently observed internal appeal overturn rates (41.32%) to 1% of all denied claims on the federal marketplace with an average value per claim of just $100 (X=1X=1 , Y=41.32Y=41.32, Z=100Z=100), would result in approximately 200k extra overturned claims and 20 million USD of value returned to consumers.

This does not include all of the insurance denials that occur outside of the federal marketplaces, or those that were not reported in this incomplete data, nor does it take into account the fact that external appeals might be a less biased process that data suggests would lead to even more overturns.

The takeaway is that consumers should be appealing contractually inappropriate denials whenever possible, and we need organizations, regulators, and individuals to support consumers and aid them in scaling the rate at which they are following through with such appeals. The monetary value alone is significant to consumers, and there are many important additional implications for their health.

Need for Data Standardization, Validation and Enhancement

In this article we've considered various data sources that help shed light on the state of coverage denials and appeals outcomes across plans in the federal marketplace, some state marketplaces, and various subgroups of plans unavailable in these markets. While it is a useful start to have access to the data we've analyzed, the level of access we have to such crucial data in general across the insured population is poor.

Furthermore, although we have attempted to present the data in a standardized fashion to help understand broad, nonspecific trends occurring across states and market segments (e.g. the ubiquitous nature of high denial rates), the data is in fact not standardized and is inconsistent, with data details, reporting methodologies, and governing rules varying across datasets, resulting in difficulties and subtleties in addressing even basic questions which ought to be addressable. For example even the highly limited task of presenting a complete account of denial and appeal outcomes (as measured by e.g. denial, internal and external appeal rates) in each ACA exchange (federal and all state exchanges included) appears intractable given the varied reporting standards and limited publicly available data.

We summarize the issues we've encountered by breaking them down into three categories.

  • Standardization

    We have seen that the claims denial data reported by the federal marketplace and various state agencies is reported in different and inconsistent ways. For example, the federal marketplace data released in the CMS PUFs contains internal and external appeal data, while the NY health care claims reports do not include external appeal data. Similarly, the WA external appeal database includes a different amount of detail for each appeal case than does the NY external appeal database. The lack of consistency and standardization across datasets results in a landscape of data that is difficult to analyze, compare and monitor.

    While we have made an attempt to standardize aspects of our presentation of data from different sources, this involved significant effort due to the lack of standardization across sources, as can be seen in the accompanying raw open source analyses.

  • Validation

    In various analyses presented in this article, and many additional calculations that occurred in our open source analyses of the raw data, we encountered what appear to be spurious or erroneous values in the raw data. For example, we saw in our analysis of the 2022 CT consumer report card that one insurer purportedly denied more claims in a plan year than they even received. Similarly, we saw in numerous datasets that the sum of counts for the exhaustive list of denial rationales provided for the dataset did not sum up to the count of all denials, nor did it consistently sum to a value larger than the total number of denials (which one would expect if claims could be labeled with one or more denial rationale), or a value smaller than the total number of denials (which one would expect if not all denials needed to have a rationale).

    While there may well be some consistent, logical explanations for each of the seemingly spurious situations occurring in the data, such explanations were not obvious to us, and are critical and completely necessary for drawing conclusions from the data in an accurate way. Understanding such subtleties across the landscape of all existing data appears (to us, anyway) to be a monumental and difficult task.

    Validating the data before it is made public, according to clearly defined, programmatically automated, rules, would allow consumers, researchers, regulators and policy makers to build trust in the data and take it at face value.

  • Enhancement

    In addition to the need for standardization and validation of the data elements that are already common among the datasets we've presented here, the landscape of data also makes clear that there is a serious need for more data.

    The public data we've been able to access thus far includes relatively complete data from federal marketplace plans, as well as limited data from a handful of state marketplaces, and relatively complete data from one state. Ideally, standardized and validated denial data for all insurance plans in the U.S., including all state marketplace plans, all commercial plans including self-funded employer plans, and all Medicare, Medicare Advantage, and Medicaid plans, would be made available to the public to allow for meaningful understanding of the entire ecosystem, and systemic patterns.

    In addition to enhancing the breadth of insurance plan markets covered by public denial data, there is also a need to enhance the depth and detail included in data elements for each plan or insurer recorded in the data. For example, only some of the datasets we've considered included a breakdown of denials by rationale, and of those, many recorded the fraction of denials with each rationale type, but then neglected to retain information about the rationale breakdown when reporting internal and external appeals. This means, for example, that it is impossible with such datasets to calculate how often NMN denials specifically are being appealed, or how often NMN denials are being overturned via external appeal, etc. There are a huge number of important questions of this sort that simply cannot be addressed given the detail with which the data is reported across these sources. Because the data is not reported in enough detail to answer such questions, such questions cannot be addressed or monitored by members of the public or organizations that help advocate for consumer protection.

Need for Standardization and Streamlining of Appeals Processes

In every dataset considered here, we have seen that appeal rates are low, while appeal overturn rates are orders of magnitude larger. We need to empower consumers and their providers to successfully and accessibly appeal denials in all cases where the denials are contractually inappropriate. To do this efficiently at scale, across different insurance market segments, it would be immensely useful to move toward a common, standardized and streamlined appeal process, whose details are not as dependent on the specific plan, plan type, state, or market segment of the insured as current processes are.

Recommendations

In this section, we provide recommendations based on the conclusions just discussed. We consider both policy recommendations for future laws and regulations, as well as recommendations for individual consumer recourse given the current state of our laws, regulations, markets and government.

Policy

In this section we propose policy recommendations for lawmakers and regulators seeking to improve the situations that patients experience when facing contractually inappropriate coverage denials.

Require Qualification for Internal Appeal Adjudication

  • In many commercial contexts, consumers seeking recourse for what they view to be inappropriate denials are required to submit and wait for determinations in internal appeals before proceeding with external appeals. This can have dire medical consequences for those who are inappropriately forced to wait15. We contend that insurers have no place judging the legality and medical validity of their own behaviors, unless they can prove, with data, that they are in fact retaining impartiality despite the clear monetary incentive at play.

    Insurers should have to qualify, based on public and transparent evaluation of their historical appeal adjudication outcomes, to be involved in judging the merit of their customers' appeals. One possibility for implementing such a policy would be to utilize an IRR measure as a stopgap for inappropriate internal appeal practices. For example, insurers might qualify to administer their own internal appeal process if their historical adjudication determinations maintain an IRR consistency measure with independent review audits that is greater than a set threshold, and otherwise would be forced to allow their consumers to directly submit appeals via externally administered processes. Such an IRR evaluation could be accomplished by randomly auditing a sample of internal appeals each year and having them re-reviewed by external agencies.

    One clear argument against eliminating internal appeals for some insurers is that it might, in principal, raise costs to our system, and as a result be infeasible to implement: a more direct line to external appeals would likely lead to more external appeals, perhaps even an inappropriately large amount, if the system were abused by consumers, and someone will have to administer and pay for the external appeals to be adjudicated. We contend that this cost is already borne by consumers, in the sense that internal appeals can be abused in the same way, and cost money to administer that is presumably passed down to consumers in the form of increases to their premiums.

    Our proposal is to simply remove insurers who cannot pass more strict quality audits from being involved in these processes directly; if such insurers maintain that they perform internal appeal adjudication appropriately, via processes consistent with best-practices, then their internal processes are likely to be costly, and require ample time from trained medical experts, so an insurers removal from the process would save them money that they could pass down to consumers in the form of reduced premiums, or pass up to the external process administrators. In short, if appeals are currently adjudicated appropriately, the money to continue adjudicating appropriately is already being spent; it is a matter of who is getting paid to do that work, and by whom. If appeals are not currently being adjudicated appropriately, then in fact insurers might be adjudicating with less money than can reasonably be spent to accomplish adjudication appropriately. If that is the case, it will collectively cost more money, but the alternative is surely more problematic.

    One might suggest that insurers can administer these appeal processes much more efficiently than external review agencies, since they have privileged access to their own patient data, medical review networks, and efficient business structures, so they should be left to tend to the processes without interference or further regulation to save us all money; while that is possible, it it by no means clear that the ability to adjudicate more efficiently corresponds to an ability to adjudicate more justly or appropriately while maintaining that efficiency, which, one would hope, is the goal. If insurers were forced to qualify for the ability to review internal appeals, such qualification audits might shed light on whether the claim that insurers can adjudicate appeals with the highest efficiency, while maintaining appropriate adjudication outcomes, is in fact true. If it is, that would be beneficial for insurers and patients alike, and public access to the audit data would build trust among consumers both in accountability, and in well-performing insurers themselves. Such a policy would allow the best actors in the space to be rewarded by consumers for their commitment to fair practices.

    Unless insurers can furnish data that displays their ability to evaluate claim validity in the same way third parties with no profit incentive can, we believe it is inappropriate to have them preside over their disputes with their customers, and the medical experts representing their customers.

Levy Insurer Fines For External Appeal Overturns

  • Patients pay when their care is not covered, they sometimes pay retroactively for accessing an external appeal process that they subsequently lose (for example, New York allows commercial insurers to charge consumers for the right to externally appeal a decision, if the insured ultimately loses their external appeal), and if and when they jump through enough hoops to get an independent appeal review and win back coverage, they still pay -- even in the best case denial overturn scenario, they've paid their premium, potentially paid out of pocket or delayed their care while appealing, and certainly paid in the form of time spent rectifying the situation. While they may recoup their coverage, their best case scenario when faced with contractually inappropriate denials is paying this lesser of two costs. In some cases, patients who understand this predicament correctly conclude that it will cost them less to just pay out of pocket, or forego care, for a contractually inappropriate denial than it would to pay the monetary and temporal cost of attempting to win an appeal.

  • On the other hand, insurers are monetarily incentivized to inappropriately deny (whether or not they act based on this incentive); in their worst case, they pay what they owe according to their contracts plus the administrative burden of adjudicating appeals. The numbers we’ve studied show that the frequency and monetary value of appealed denials, compared to the frequency and monetary values of denials themselves, are such that these worst case appeals scenarios could sensibly comprise an accepted piece of an overall profitable process, if there are enough inappropriate denials that go uncontested.

    For plans to which ERISA applies, insurers don't even face the risk of being sued for damages; ERISA provides them relative safety to act in bad faith via denial practices and face a palatable worst case, if they so choose.

    Lawmakers need to make it actually cost insurers to deny inappropriately. The cost to consumers is their health, their money, and their time; their worst case scenario involves dire medical consequences, including death.

    Make the stakes for insurers administering contractually inappropriate denials appropriately serious, by levying heavy fines for external appeal overturns, and we might see insurers begin adjudicating their own claims with high IRR consistency with third parties. At the least, we will remove a troubling perverse incentive for inappropriate denials, which will strengthen consumer trust in industry practices.

    Regardless of intention, monetarily insurers are not making mistakes when they deny in contractually inappropriate ways; they are making the monetarily sound choice given the legal context in which they are being regulated. At the very least, in cases where consumers can be charged for submitting an external appeal which ends up being upheld, regulators ought to allow insurers to be charged a fee in addition to coverage, to the direct benefit of the consumer, for inappropriately denying an appeal as measured by that same external review board. Just as insurers may recoup a small portion of their administrative costs for administering an appeal when the insured was submitting a faulty appeal, consumers should be able to recoup part of their administrative costs (e.g. hours on hold, in the middle of their work day) for orchestrating an appeal for a faulty denial.

Mandate Transparent, Standardized Data Reporting For Denials and Appeals Across All Commercial Plans

  • Lawmakers and regulators should make denial data and appeals data for all commercial plans in the U.S. available, accessible, and transparent in a standardized format. They should do so in a way that is agnostic, to the extent possible, to details about where and how the plan is purchased. For example, whether a plan happens to be sponsored by an employer, be governed by ERISA, or be sold on the PA marketplace, it would not be insurmountable to report denial rates and appeal rates via standardized, general data templates that could accommodate the differences in governing regulations, and underlying data details. This is a tractable, if nontrivial, data schema problem.

    They should also do so in such a way that the data elements reported are standardized across marketplaces and plan types, and in such a way that those data elements are sufficient to understand intricacies and behaviors that inform whether consumers need more protection from inappropriate insurer practices. For example, claims counts, denial counts, internal appeals counts, and external appeals counts for a given plan ought to all be reported in a single place, because each element of data is relevant to and informs the others, and some are meaningless without appropriate context. As we've seen above, currently reported data is reported in fragmented and non-standard ways which renders it unnecessarily complicated and opaque.

  • The data elements required ought to be carefully formulated, and routinely evaluated for efficacy. For example, we saw in the Federal marketplace data and the New York Health Care Claims Reports data that insurers are asked to include the breakdown of denial rationales along with the denial counts, and that in each case a huge majority of the denial rationales were placed in the so-called "Other" category. This provides very little information about the true underlying rationale for a majority of the denials in the dataset.

    Given this result, regulators and lawmakers ought investigate the underlying cause for such pervasive use of the "Other" categories. Understanding the cause will help inform an appropriate solution. If the cause is truly that most denials are denied for reasons that can not be characterized with existing rationale options, the solution is simple: change the allowable categories with which insurers can report to be a larger, but still mutually exclusive and completely exhaustive collection of categories including "Other", such that true rationales for a majority of the data are reported, with "Other" serving as a backup catch-all necessary for only a small subset of the data.

    The prevalence of "Other" in the reporting indicates an issue with either the breakdown being presented to insurers, or enforcement of accurate reporting, and in either case issues like this need to be addressed to make this data actionable.

    As another example, in the NY health care claims reports we saw that the numbers of denials for medical necessity were reported for each insurer. This is an important subset of claims denials to understand well, so it is fortunate that such denials are specified. However, the same reports do not indicate which subset of appeals correspond to each subcategory of initial denial, so one cannot, for example, use the data to understand what the internal appeal rate is for NMN denials specifically. Transparency of such metrics is crucial for meaningful change and monitoring.

  • In requiring such data to be reported, lawmakers and regulators ought to require insurers to report the data to centralized agencies, and additionally require those centralized agencies to host publicly accessible copies of that data. Merely requiring insurers to report the data in a public way directly is insufficient, because it would make the task of compiling the data, enforcing standardization, and even enforcing compliance with posting, unwieldy and unnecessarily complicated. This type of issue has been made clear in the hospital price transparency and transparency in pricing regulation efforts.

  • Regulators and lawmakers also ought to validate the data insurers are providing, and impose penalties for data which is faulty or misrepresented. If there is no requirement or mechanism to enforce actual validity of the data, requiring the data to be reported would be much less meaningful. At a minimum, basic checks should be validated, such as total claims submitted being larger than claims denied, and other comparisons whose failure would indicate nonsensical data. In all of the data we've analyzed here, we've done such basic validation checks manually, and often had to throw out portions of the data which failed the basic checks.

Mandate Transparent, Standardized Processes and Forms for Appeals Across Commercial Plans

  • Commercial appeals processes are opaque and difficult to navigate for consumers, due largely to the fact that the processes vary by region, insurance market, and plan. While there are many resources that help guide patients through the process of submitting such appeals, the resources tend to be too generic to pertain precisely to the situations patients face. In reality, the details of appeals processes depend heavily on the details of ones plan, and even basic information such as the forms that must be filled out, the supporting documentation that must accompany an external appeal, and the existence or lack of a fee associated with an external appeal process is made opaque until one receives direct correspondence about the process from their insurer, or looks up bespoke rules pertaining to their situation. This leads to a landscape of processes that is difficult for consumers, regulators and lawmakers to navigate, monitor, evaluate and regulate.
  • Regulators and lawmakers ought to standardize appeal processes across all commercial insurance markets and plans, so that regardless of employment status or insurance market, a consistent set of standards for appeal rights and processes are guaranteed across the board. For example, one should not have to download a particular form specific to Insurer X when they externally appeal a decision; this allows Insurer X to impose implicit barriers (e.g. maybe they require the form to be faxed, rather than emailed, so now those few people without a fax machine need to: take time off work and pay money to go access one; or, in all likelihood, just abandon the appeal), in a way that is essentially undetectable to the public, and puts undue burden on consumers to access the form, submit it, and abide by its bespoke, and potentially unreasonable, rules and processes.

Patient Recourse

In this section, we provide suggestions for patients, consumers, physicians and others seeking recourse in contexts that appear to involve contractually inappropriate denials received from commercial health insurance plans.

Appeal

  • When you receive a denial for coverage, keep an eye out for signs that it may be contractually inappropriate. If you suspect a denial is contractually inappropriate, or are not sure, it may be worthwhile to appeal if you can afford to spend the time navigating the process. Appealing denials can be time consuming and difficult, but all of the data we've studied here indicates high likelihood of success for those who appeal.

Contact Regulators

  • Contact your state department of insurance, state attorney generals offices, employer benefits managers, employers, or whichever agencies or entities wield power in regulating or affecting the behavior of your particular insurance plan. Do not settle for contractually inappropriate determinations. Reach out to us at info@persius.org if you are not sure who to contact or who the appropriate regulatory agency is in your particular situation, and we can try to help guide you to the most relevant help.

Contact Our Team For Appeal Help

  • We are building tools to help people navigate issues with their insurance, including internal and external appeals processes. If you are facing what you suspect is a contractually inappropriate denial, please feel free to reach out to our team at info@persius.org. Our services are and will always be completely free for individuals.

    Our goal is to build scalable, tech based tools to help drive initial appeal rates higher for contractually inappropriate denials, and thereby change the applicable monetary calculus. Our goal is only to help with those denials which are contractually inappropriate, but we will work with you to determine if a denial appears contractually appropriate or not, and try to help in whatever ways we can. More on this effort in forthcoming work.

Share Your Stories

  • Collectively, we as consumers have access to denial statistics related to our care that can serve as ground truth for the data submitted by insurers that we've studied in this article. The insurer submitted data is sparse and not standardized, which makes it troublesome and difficult to act on. We would like to crowd source our collective understanding of certain denial behaviors, and we believe doing so in a community maintained, standardized way will empower consumers and regulators. Reach out to us at info@persius.org if you'd like to get involved in the effort or have a story to share.

Cite This Work

BibTex

@article{ClaimsDenials2023,
  author = {Mike Gartner},
  title = {Claims Denials in Marketplace Health Insurance (Interactive Version)},
  journal = {TPAS},
  year = {2023},
}

Plain Text

Gartner, Mike. (2023). “Claims Denials in Marketplace Health Insurance (Interactive Version).” TPAS.
https://blog.persius.org/investigations/claims_denials

References

Armstrong, D., Rucker, P., & Miller, M. (2023a). How Cigna Saves Millions by Having Its Doctors Reject Claims Without Reading Them. https://www.propublica.org/article/cigna-pxdx-medical-health-insurance-rejection-claims. https://www.propublica.org/article/cigna-pxdx-medical-health-insurance-rejection-claims
Armstrong, D., Rucker, P., & Miller, M. (2023b). UnitedHealthcare Tried to Deny Coverage to a Chronically Ill Patient. He Fought Back, Exposing the Insurer’s Inner Workings. https://www.propublica.org/article/unitedhealth-healthcare-insurance-denial-ulcerative-colitis. https://www.propublica.org/article/unitedhealth-healthcare-insurance-denial-ulcerative-colitis
Fields, R. (2023). How Often Do Health Insurers Say No to Patients? No One Knows. . https://www.propublica.org/article/how-often-do-health-insurers-deny-patients-claims. https://www.propublica.org/article/how-often-do-health-insurers-deny-patients-claims
Fox, J. (2022). The Lived experience Of Health insurance: An Analysis and Proposal for Reform.
Konrad, W. (2010). Fighting Denied Claims Requires Perseverance. In The New York Times. https://www.nytimes.com/2010/02/06/health/06patient.htmll. https://www.nytimes.com/2010/02/06/health/06patient.htmll
Pollitz, K. (2021). Consumer Appeal Rights in Private Health Coverage [Article]. https://www.kff.org/private-insurance/issue-brief/consumer-appeal-rights-in-private-health-coverage/
Pollitz, K., & Rae, M. (2020). Claims Denials and Appeals in ACA Marketplace Plans in 2020 [Article]. https://www.kff.org/private-insurance/issue-brief/claims-denials-and-appeals-in-aca-marketplace-plans/

Footnotes

  1. Valid here could mean medically, morally or contractually valid. We again note that these notions of validity are rarely the same. We will always explicitly qualify the words appropriate or valid if we are specifically referring to a particular notion.

  2. Trying to estimate this last number when choosing an insurer is particularly important for individuals with serious health needs who would be unable to afford the care they need if it were to get denied by their insurer.

  3. Of course not all claims are appropriate. For example, sometimes two duplicate claims are inadvertently submitted for the same care, and approving such claims would in no way help patients.

  4. The term transparency in coverage has been overloaded, and used in different contexts by CMS and other government agencies. Our use of the term here does not refer to the CMS rule published in 2019 requiring health plans to publish certain rate data, which is broadly being referenced as the transparency in coverage rule. Instead, it refers only to the public use files distributed by CMS documenting federal marketplace denials and appeals (among other things). Note that these are also referred to by CMS as transparency in coverage files, and in fact have been in existence, and referred to in this way, far longer than the TiC rule. Obviously the two efforts are not completely unrelated, and it would be most welcome if reporting requirements in the TiC rule were expanded to incorporate reporting about denial transparency, as these federal marketplace PUFs report. We discuss this proposal at greater length in the policy recommendations section.

  5. This is a new dataset, acquired by our team via a public records request to the PA Department of Insurance. To our knowledge, it has never before been seen or released to the public.

  6. It is unclear which exact pool of insurance plans, and therefore consumers, this data covers. It appears the DFS has jurisdiction to regulate all marketplace plans, fully insured group plans, and Medicaid managed care plans. We planned to estimate this population using the DFS consumer report for the 2021 plan year, but found it does not include enrollment. We couldn't even corroborate the denial numbers from the 2021 plan year from the aforementioned report using the data sources we consider here, and instead found a significant discrepancy between the total number of externally appealed denials listed in the report, and the number that exist in the DFS database. We believe this discrepancy comes from the fact that the report excludes summary statistics for numerous categories of smaller plans, such as HMOs with less than 500 members, and commercial and EPO/PPO plans with less than 50 million USD in premiums.

    However, we can say that the population of consumers corresponding to plans that the DFS regulates at least includes marketplace plans, and according to New York State of Health enrollment data, that population has hovered between 4.5-7 million over the years represented here. 2

  7. We use the total enrollment noted in the 2021 CDI report, since it is explicitly noted. The quoted number is therefore relevant for the subset of the database containing IMRs from 2021, but not other years, since the consumer population in this market segment changes yearly.

  8. We use the total enrollment noted in the 2021 DMHC secretary report, since it is explicitly noted. The quoted number is therefore relevant for the subset of the database containing IMRs from 2021, but not other years, since the consumer population in this market segment changes yearly.

  9. We use the total enrollment noted in the 2022 consumer report card, since it is explicitly noted.

  10. We use the total enrollment reported for the PA marketplace in the CMS 2021 Marketplace Open Enrollment Public Use File as a lower bound, since we know this data at least includes all marketplace plan claims adjudications.

  11. Note that the CMS TIC PUF data is reported and labeled two years later than the date to which it corresponds. Thus the 2021 data is labeled by CMS as the 2023 TIC PUF.

  12. For example, it could be the case that the distribution of cases being internally appealed are those that are particularly egregious, or obviously inappropriate, denials, and that the expected appeal success rate would drop drastically if one were to appeal every denial with the adjudication processes remaining fixed.

  13. Here the word corresponding plays an important role; in some of the datasets, such as the CMS PUFs, not all plans that include internal appeal statistics provide external appeal statistics. As as a result, simply dividing the total number of external appeals by the total number of upheld internal appeals in the dataset would not provide a true measure of an external appeal rate. The denominator would be inaccurately inflated.

  14. There are many possible explanations, and it is impossible from this data alone to determine the cause. For example, it could be that given the same large pool of external denials one review organization would consistently have a much lower overturn rate than the others (this explanation would be problematic, and worthy of investigation to protect consumers). Alternatively, it could be that the different organizations would agree on a large fraction of adjudication decisions if they were given the same pool of denials, and the cause of the discrepancy instead has to do with which population of cases are getting distributed to each organization (this would not necessarily be problematic, though might merit explanation).

  15. While some may claim that existing expedited processes for urgent internal (and simultaneous external) appeals solve this problem, this is a faulty argument. Even expedited internal processes add days to the total time taken to get an appeal resolved, and pose administrative burdens to patients seeking care, and their providers. The administrative burdens (i.e. the work it requires from a patient in coordination with their provider to prepare and submit an internal appeal) take time that is often left unconsidered when estimating the total expected delay that will be caused by review of the internal appeal itself; in other words, even if a patient is able to submit an internal and external appeal simultaneously in an urgent situation, they will need to spend extra time preparing and submitting the internal portion of the appeal than they would if such steps were not required, which is liable to cause delays in their care. Even a delay of a few days, which is the minimum expected delay in expedited cases, unrealistically assuming there is zero time contribution necessary to coordinate with a physician and prepare the appeal, can lead to serious outcomes for those in dire medical situations.