CMS Data Feeds Dataset

About the Dataset

The CMS Data Feeds Dataset is a comprehensive collection of all current and future CMS data feeds available on cms.data.gov. This dataset transforms each feed into a view, aligning all feed file attributes and casting them to appropriate datatypes. The views are automatically updated when new feed files arrive, and new views are included as CMS publishes new data feeds. As of December 2024, the dataset encompasses 190 feeds and over 1900 feed files.

See the full list of feeds in the CMS Data Feeds Catalog.

Dataset Features

  • 38 billion records and growing

  • 190 feeds with automatic addition of new feeds

  • Over 8,000 report attributes

  • 14 years of historical data

  • 700+ GB of data, continuously expanding

  • Daily updates to ensure data freshness

  • Automatic quality checks and active monitoring

  • Designed for seamless ingestion

Data Quality and Maintenance

Dataplex Consulting & Data Products prioritizes data quality through:

  • Automated data quality checks in all pipelines

  • Daily monitoring of ingestion and ETL jobs

  • Timely updates whenever CMS publishes new data

  • Rigorous processes to ensure data integrity and reliability

Business Applications

Users can leverage this dataset for various purposes, including:

  • Enriching or augmenting existing datasets

  • Analyzing published feed metrics over time

  • Performing segmentation analysis

  • Training machine learning models

  • Conducting geospatial analysis

  • Tracking healthcare trends and performance metrics

Example Use Cases

  • Analyze telehealth adoption trends across different states

  • Evaluate the financial performance of home health agencies

  • Track Medicare spending patterns by drug or geography

  • Assess hospital performance metrics and patient satisfaction scores

  • Monitor healthcare-associated infections across facilities

Data Structure

The dataset consists of three main components:

  1. FEEDS: Contains metadata about each CMS data feed

  2. FEEDS_FILES: Stores information about individual files within each feed

  3. Feed-specific views: Separate views for each feed (e.g., PatientReported_Outcomes, Medicare_Spending)

Entity Relationship Diagram

Sample Queries

Get CMS Feeds to Table Mapping

SELECT title,
       description,
       references[0]::string AS cms_page_documentation,
       view_name AS table_name
FROM DWV.FEEDS f;
SELECT title, description, view_name
FROM dwv.feeds
WHERE array_contains('Medicare'::variant, keywords) = True;

Query All 2021 File Records for Medicare Inpatient Hospitals

SELECT ff.temporal_start,
       ff.temporal_end,
       v.*
FROM dwv.feeds f
JOIN dwv.feeds_files ff ON f.id = ff.feed_id
JOIN dwv.Medicare_Inpatient_Hospitals__by_Geography_and_Service v ON ff.id = v.file_id
WHERE title = 'Medicare Inpatient Hospitals - by Geography and Service'
  AND year(ff.temporal_start) = 2021;

Support and Contact

For questions or assistance with the CMS Data Feeds Dataset, please contact:

Email: [email protected]

The Dataplex Consulting & Data Products team monitors ingestion and ETL jobs daily to ensure quality and timely delivery.

About Dataplex

Dataplex Consulting & Data Products offers top-notch, turnkey data products, making data easily accessible for businesses of all sizes. With over 20 years of experience serving small businesses and Fortune 500 companies, our team has gained extensive practical expertise in enhancing data management, boosting revenue, and helping companies become more data-driven.

Last updated