Despite these problems, the Innovation Center holds promise as a tool of policymaking to produce genuine improvements in spending and quality in federal health programs. But to match its lofty mission, the Innovation Center needs to reorient around the concept that models are demonstrations. Importantly, all of these demonstrations should be true and limited demonstrations, and mandatory demonstrations should be prioritized. If a model is shown to work, then Congress can make its feature permanent, or CMS can pursue broader rulemaking.
True Demonstrations
Statute allows the Innovation Center to waive existing laws and regulations governing Medicare and Medicaid in order to implement its models. The core purpose of the Innovation Center models is to experiment with policy changes with the goal of producing savings and improving quality. “True demonstrations” are experiments that seek to test if a policy saves taxpayer dollars and improves quality. Experimentation—and the ability to learn from the demonstration—should be at the heart of every model.
The Innovation Center has not always abided by this core principle. For example, the Biden administration attempted to use the Innovation Center to implement a Two Dollar Drug List Model, which the Trump administration ended before it could be implemented. The proposed model—flat $2 copays for select CMS-listed generic drugs across all benefit phases—was not an experiment: Roughly 20 percent of beneficiaries (mostly in Medicare Advantage) already have $2 generic copays. In 2024, nine of 14 national stand-alone prescription drug plans offered $0 copays for preferred generics, and the median copay for non-preferred generics was $5. This “model” was simply an attempt to standardize and potentially expand this benefit offering (even to non-generics) and make it mandatory. This would have pushed up premiums and, because of the IRA’s 6 percent cap on premium increases, federal subsidies. Real experiments test novel ideas against credible counterfactuals. This proposal did neither and risked misuse of Innovation Center authority.
The Innovation Center should make true experimentation a core standard of all models. Models, like all other scientific experiments, should be designed, tested, and evaluated. They need to demonstrate measurable results—not simply continue operating because they are politically popular or administratively easier than pursuing legislative change.
Time Limits
For similar reasons, demonstrations need to have clear time limitations, with extensions generally avoided. Given the broad waiver authority, models without time limits could be continued indefinitely and after the model has stopped demonstrating anything, particularly if a demonstration is politically popular. Once a demonstration reaches the end of its prescribed time period, the Innovation Center should conduct its standard end-of-demonstration evaluation and make a recommendation to Congress and CMS on whether to implement the model on a nationwide scale.
Limited in Size and Scope
There are important reasons to consider limits on the size and scope of models. The purpose of the Innovation Center is to test innovative models, not to implement sweeping policy under the guise of a model. As such, for example, demonstrations that include the entire (or nearly the entire) Medicare or Medicaid population allow for untestable policy to functionally become the law of the land without the appropriate congressional or regulatory actions. Potential limits on models could be to implement the policy change only in a distinct set of regions, states, or metropolitan statistical areas (MSAs) or a subset of beneficiaries across the nation, including through randomization. Under current statute, a model may be expanded beyond its initial scope if (1) the Secretary of Health and Human Services (HHS) determines that the model is likely to save money without reducing quality or improve quality without increasing spending, (2) the chief actuary of CMS determines that the expansion would reduce net program expenditures or not result in an increase, and (3) the Secretary of HHS determines that such expansion would not deny or limit coverage or the provision of benefits. Going forward, the Innovation Center should adopt a policy of doing expansions only through full notice-and-comment rulemaking. For demonstrations the Innovation Center would like to be made permanent or span the entire (or nearly the entire) Medicare or Medicaid population, the administration should instead request that Congress expand such models through statute. On top of the Innovation Center adopting these policies on its own, Congress should consider putting such guardrails into statute.
Furthermore, there is merit to limiting the number of models that are active at any given point in time. For example, in 2023, 39 percent of all Medicare fee-for-service beneficiaries were in Medicare accountable care organizations (groups of providers that aim to coordinate care and share in the cost or savings of episodes of care), including those in the Medicare Shared Savings Program (which is not an Innovation Center model). This large proliferation makes it difficult to test models without other programs interfering with the results. CBO has noted that the proliferation of multiple models within single health care systems can create conflicting incentives for providers. These overlapping models and programs may also increase administrative burden and complexity, which can create a disincentive for small providers to participate, as discussed previously. More transparency regarding the overlap of models and attribution of savings would help mitigate this particular concern. It would also provide public insight into the impact of models on certain subsets, such as smaller providers and beneficiaries who rely on them.
When smaller providers are more likely to eschew models due to complex administrative requirements, it means larger providers make up the bulk of participants in models, and thus the results and lessons reflect only the experience of those larger providers. One way to mitigate this is to create separate tracks for smaller providers, assuming that can be done without undermining the utility of any benchmark as discussed above. Without smaller providers participating one way or another, the model cannot show if there were any potentially negative effects on these smaller providers. As such, any program-wide changes made to Medicare payments may unintentionally bias changes in Medicare payments against smaller providers.
The effect of administrative complexity extends to the internal operations of CMS generally as well, which has a history of being hamstrung by technological and operating deficiencies that have resulted in the loss of millions of dollars in waived or improper payments. Fewer models would help mitigate this issue and allow more resources to be focused on ensuring effective model implementation.
Ensuring that models are limited in size, scope, and length of time and requiring transparency along the lines outlined above also mitigates concerns about a potential “medallion effect”—in which early model participants obtain advantages. But most importantly, such limitations collectively safeguard against future abuses that attempt to change policy through demonstrations on shaky bases, including with respect to lower or same cost or quality, which are effectively subject only to administrative self-restraint.
Prioritize Mandatory Demonstrations
The Innovation Center should prioritize mandatory demonstrations. Concerns about the potential for mandatory models to become de facto changes to law can be ameliorated by ensuring that all models are limited and fulfill the definition of a true demonstration, as described above. Mandatory demonstrations eliminate the favorable selection problem whereby participants choose only models they are likely to profit from. Ultimately, the true potential of a model to achieve program-wide savings is unclear if the participants are functionally limited to only those who are best equipped to achieve those savings.
Voluntary models can succeed if the benchmarks (and incentives) are crafted correctly. However, this has proven very difficult to accomplish, particularly when benchmarks are made prospectively. The downsides of voluntary models typically outweigh the greater difficulty in implementing mandatory models.
Mandatory models require the Innovation Center to go through the rulemaking process, which significantly delays a demonstration, and comments from stakeholders and entrenched interests can also impact the program’s design. When mandatory models are not practical for political or bureaucratic reasons, the alternative options presented to providers should be limited and not simply allow the target providers to maintain the status quo (with exceptions for any necessary control groups). The recent WISeR model released by the Innovation Center has narrowed the choices that providers have in cooperating with participating companies without subjecting them to mandates (see text box). This approach to voluntary models is a clever way to reduce the problem of favorable selection.