user-doctor-hairCMS NPPES Provider Dataset

About the Dataset

The CMS NPPES Provider Dataset is a comprehensive collection of relational tables tracking all Centers for Medicare & Medicaid Services (CMS) National Plan and Provider Enumeration System (NPPES) unique identification numbers (NPI) and attributes for covered health providers. Available on the Snowflake Marketplace, this dataset offers:

  • Address and contact information

  • License numbers across states

  • Other identifiers

  • Business names

  • Provider taxonomies

  • A full taxonomy table broken down by code, grouping, and classification

The dataset is updated weekly with delta changes and undergoes a full refresh monthly, ensuring up-to-date information.

Dataset Features

  • Comprehensive Coverage: Includes all HIPAA-covered healthcare providers

  • Regular Updates: Weekly delta updates and monthly full refreshes

  • Rich Provider Information: Detailed attributes for each provider

  • Normalized Structure: Organized into relational tables for efficient querying

Data Quality and Maintenance

Dataplex Consulting & Data Products prioritizes data quality through:

  • Automated data quality checks in all pipelines

  • Daily monitoring of ingestion and ETL jobs

  • Timely delivery of high-quality data designed for seamless ingestion

Business Applications

Users can query various provider attributes, including:

  • Specialization

  • Location

  • Address

  • Taxonomy

  • Licenses

  • Identifiers

  • Contact data

Example Use Cases

  1. Identify all active providers with a specific primary taxonomy

  2. Find all dental providers in a particular city

  3. Retrieve all license numbers and states for a specific provider

  4. Discover recently deactivated providers

circle-check
circle-check

Data Structure

The dataset is organized into several interconnected tables:

  • PROVIDERS

  • PROVIDERS_ADDRESSES

  • PROVIDERS_LICENSES

  • PROVIDERS_IDENTIFIERS

  • PROVIDERS_TAXONOMIES

  • TAXONOMIES

Entity Relationship Diagram

CMS NPPES Provider Entity Relationship

Platform Schema Reference

This dataset is available on both Snowflake and Databricks. The table names are the same, but the schema prefix differs:

Platform
Schema
Example

Snowflake

dwv

dwv.providers

Databricks

npi_dwv

npi_dwv.providers

The examples below show queries for both platforms using tabs.

Sample Queries

1. Find providers with a specific primary taxonomy

2. Query dental providers in Houston, Texas


Get Started

circle-check

Includes

All provider tables, weekly updates, full documentation

Support

Email support included

Cancellation

Cancel anytime, no long-term commitment

circle-check

Choose Your Platform

Support and Contact

For questions or assistance with the CMS NPI Provider Dataset, please contact:

Email: [email protected]envelope

About Dataplex

Dataplex Consulting & Data Products delivers turnkey, analytics-ready data products that make complex public and commercial data easy to use across modern data platforms. Our data pipelines include automated quality checks and active monitoring to ensure timely, reliable, and well-structured data that is ready for downstream analytics, machine learning, and operational use.

In addition to data products, Dataplex provides data engineering and analytics consulting services to organizations of all sizes. We bring deep, hands-on experience supporting both early-stage companies and large enterprises, helping teams build scalable data platforms, improve data reliability, and become more data-driven.

Last updated