stethoscopeHRSA Healthcare Resources Dataset

About the Dataset

The HRSA Healthcare Resources Dataset provides comprehensive, county-level access to US healthcare workforce, shortage area designations, and health center locations from the Health Resources and Services Administration. Four government datasets are combined into a single database: physician and nursing workforce counts (AHRF), medically underserved area designations (MUA/MUP), health professional shortage area scores (HPSA), and 14,000+ federally qualified health center sites.

Quick Access

Tables: AHRF, SHORTAGE_DESIGNATIONS, HPSA, HEALTH_CENTER_SITES Sources: 4 government data sources from data.hrsa.gov Update Frequency: AHRF annual, Shortage Designations monthly, HPSA and Health Center Sites quarterly Geography: All 3,200+ US counties

Overview

The HRSA Healthcare Resources Dataset provides comprehensive access to US healthcare workforce and shortage data including:

  • AHRF (Area Health Resources Files) - Physician and nursing workforce counts with per-100k population rates by county

  • SHORTAGE_DESIGNATIONS - Medically Underserved Areas and Populations (MUA/MUP) designations by county

  • HPSA - Health Professional Shortage Area scores for Primary Care, Dental, and Mental Health by county

  • HEALTH_CENTER_SITES - FQHC and Look-Alike service delivery site locations with addresses and coordinates

Metadata Tables

Every Dataplex data product includes these standard metadata tables:

Table
Purpose

FEEDS

Dataset catalog — available tables, descriptions, update dates

FEEDS_FILES

Batch load history with is_latest flag for data freshness

CHANGELOG

Change log — data loads, schema changes, corrections

DATA_DICTIONARY

Column descriptions for all tables

Entity Relationship Diagram

spinner

All four data tables join on fips_county for cross-source county-level analysis. Each table links to FEEDS and FEEDS_FILES via feed_id and feeds_files_id for data lineage and freshness tracking.

Data Tables

AHRF (Area Health Resources Files)

Health workforce data by US county — physicians, nurses, and population with per-100k rates. Unpivoted to latest and previous data periods for trend analysis.

Key Features:

  • ~3,235 rows per vintage (one per US county)

  • Physician and nursing counts with population-adjusted rates

  • Latest and previous data years for year-over-year comparison

  • Annual updates from HRSA (~Q1 each year)

Column Reference

Column
Type
Description

fips_county

VARCHAR

Five-digit Federal county code (state FIPS + county FIPS)

county_name

VARCHAR

County name with state abbreviation (e.g., New York, NY)

state_name

VARCHAR

Full state name

state_abbrev

VARCHAR

Two-letter USPS state abbreviation

fips_state

VARCHAR

Two-digit state FIPS code

vintage_year

VARCHAR

AHRF release identifier (e.g., '2024-2025'). First four digits = latest data year in that release.

data_year

NUMBER

Calendar year for this data point (e.g., 2024). Query WHERE data_year = 2024 for specific years.

total_mds

NUMBER

Non-federal physicians (MDs) in the county

rn_full_time

NUMBER

Full-time registered nurses

rn_part_time

NUMBER

Part-time registered nurses

rn_total

NUMBER

Total registered nurses (full-time + part-time)

lpnlvn_full_time

NUMBER

Full-time licensed practical/vocational nurses

lpnlvn_part_time

NUMBER

Part-time licensed practical/vocational nurses

lpnlvn_total

NUMBER

Total LPN/LVNs (full-time + part-time)

nursing_workforce_total

NUMBER

All nurses: RN + LPN/LVN, full and part-time

population

NUMBER

County population estimate (NULL for some territories)

mds_per_100k

NUMBER

Physicians per 100,000 population

nurses_per_100k

NUMBER

Total nursing workforce per 100,000 population

feed_id

VARCHAR

FK to FEEDS — identifies which dataset this row belongs to

feeds_files_id

VARCHAR

FK to FEEDS_FILES — identifies which batch loaded this data

created_at

TIMESTAMP

When the source data was loaded into the warehouse

updated_at

TIMESTAMP

When the table was last rebuilt

SHORTAGE_DESIGNATIONS (MUA/MUP)

Medically Underserved Areas and Populations by county — federal shortage area designations with population counts.

Key Features:

  • ~3,200 rows (one per US county)

  • MUA/MUP designation counts and affected population

  • Derived from HRSA shapefiles with geospatial county aggregation

  • Monthly updates from HRSA

Column Reference

Column
Type
Description

fips_county

VARCHAR

Five-digit Federal county code

mua_designation_count

NUMBER

Number of MUA/MUP designations in the county

mua_population

NUMBER

Population in Medically Underserved Areas within the county

feed_id

VARCHAR

FK to FEEDS — identifies which dataset this row belongs to

feeds_files_id

VARCHAR

FK to FEEDS_FILES — identifies which batch loaded this data

created_at

TIMESTAMP

When the source data was loaded into the warehouse

updated_at

TIMESTAMP

When the table was last rebuilt

HPSA (Health Professional Shortage Areas)

Health Professional Shortage Areas by county and discipline — Primary Care, Dental, and Mental Health shortage scores aggregated to county level.

Key Features:

  • ~18,000 designation records across 3 disciplines

  • HPSA scores (0-26 scale, higher = more severe shortage)

  • Provider shortage estimates (additional providers needed)

  • Quarterly updates from HRSA with snapshot-based history

Column Reference

Column
Type
Description

fips_county

VARCHAR

Five-digit Federal county code

county_name

VARCHAR

County name

state_name

VARCHAR

Full state name

state_abbrev

VARCHAR

Two-letter USPS state abbreviation

discipline

VARCHAR

HPSA discipline: PRIMARY_CARE, DENTAL, or MENTAL_HEALTH

snapshot_month

VARCHAR

Month this data snapshot was captured (e.g., '2026-02'). Use for trend analysis across time periods.

hpsa_score

NUMBER

Worst (highest) HPSA score in this county for this discipline (0-26 scale, higher = more severe shortage)

designation_count

NUMBER

Number of active HPSA designations in this county for this discipline

designated_population

NUMBER

Total population in designated shortage areas within this county

underserved_population

NUMBER

Estimated underserved population in this county for this discipline

provider_shortage

NUMBER

Number of additional providers needed to remove the shortage designation

last_designation_update

DATE

Most recent designation update date for this county and discipline

feed_id

VARCHAR

FK to FEEDS — identifies which dataset this row belongs to

feeds_files_id

VARCHAR

FK to FEEDS_FILES — identifies which batch loaded this data

created_at

TIMESTAMP

When the source data was loaded into the warehouse

updated_at

TIMESTAMP

When the table was last rebuilt

HEALTH_CENTER_SITES (FQHC and Look-Alike Sites)

FQHC and Look-Alike service delivery sites — individual facility locations with addresses, coordinates, and site types.

Key Features:

  • ~14,000 individual facility locations

  • Addresses, lat/long coordinates, and operating hours

  • Site status and health center type classification

  • Quarterly updates with snapshot-based history

Column Reference

Column
Type
Description

site_key

VARCHAR

Site identifier (MD5 hash of organization + site name + address)

health_center_type

VARCHAR

Type of health center (e.g., Community Health Center, Migrant Health Center, Look-Alike)

health_center_name

VARCHAR

Organization name operating this site

site_name

VARCHAR

Name of this service delivery site

site_address

VARCHAR

Street address (suppressed for domestic violence shelters)

site_city

VARCHAR

City where the site is located

state_abbrev

VARCHAR

Two-letter USPS state abbreviation

zip_code

VARCHAR

ZIP code (suppressed for domestic violence shelters)

phone

VARCHAR

Site telephone number

operating_hours_per_week

NUMBER

Hours per week the site is open for patient visits

site_status

VARCHAR

Current site status (e.g., Active, Inactive)

fips_county

VARCHAR

Five-digit Federal county code — join key to AHRF, HPSA, and other county-level tables

county_name

VARCHAR

County name

state_name

VARCHAR

Full state name

longitude

NUMBER

Site longitude (WGS84)

latitude

NUMBER

Site latitude (WGS84)

snapshot_month

VARCHAR

Month this data snapshot was captured (e.g., '2026-02'). Use for trend analysis across time periods.

feed_id

VARCHAR

FK to FEEDS — identifies which dataset this row belongs to

feeds_files_id

VARCHAR

FK to FEEDS_FILES — identifies which batch loaded this data

created_at

TIMESTAMP

When the source data was loaded into the warehouse

updated_at

TIMESTAMP

When the table was last rebuilt

Data Quality

Standardization

  • Column mapping: AHRF source columns embed year suffixes (e.g., MD_NF_24, MD_NF_23) that change annually. An automated column mapper produces year-agnostic names (total_mds, mds_per_100k).

  • Type casting: All raw data arrives as VARCHAR and is cast to appropriate types (NUMBER, DATE) in the staging layer using TRY_TO_NUMBER().

  • Geospatial aggregation: Shortage Designation shapefiles are intersected with county boundaries to produce county-level MUA/MUP summaries.

  • HPSA aggregation: Component-level HPSA records are aggregated to county level per discipline, taking the worst (highest) score per county.

  • Deduplication: Each source publishes full files. The pipeline deduplicates to the latest version per primary key.

Data Freshness

Check when data was last updated:

Getting Started

Platform Schema Reference

This dataset is available on both Snowflake and Databricks. Queries use schema-only references — the database is already set by the share or catalog context:

Platform
Schema
Example

Snowflake

DWV

DWV.AHRF

Databricks

hrsa_dwv

hrsa_dwv.ahrf

Discover Available Data

Start with the FEEDS table to see what's available, and FEEDS_FILES to understand data freshness and load history.

Working with Data Lineage

Every data row links to FEEDS_FILES via feeds_files_id, which tells you exactly which batch loaded that data. Use this to filter to the current data version or trace any row back to its source load.

Healthcare Workforce Analysis

FQHC Network Analysis

Complete County Health Profile

Payer Network Adequacy

Tracking Data Changes Over Time

FEEDS_FILES records every batch load with row_count_delta showing what changed. Use this to monitor source data updates and understand how the dataset evolves.

circle-check
circle-check

HRSA Data Sources

Last updated