HRSA Healthcare Resources Dataset
About the Dataset
The HRSA Healthcare Resources Dataset provides comprehensive, county-level access to US healthcare workforce, shortage area designations, and health center locations from the Health Resources and Services Administration. Four government datasets are combined into a single database: physician and nursing workforce counts (AHRF), medically underserved area designations (MUA/MUP), health professional shortage area scores (HPSA), and 14,000+ federally qualified health center sites.
Get Full Access | Snowflake Marketplace | Free Trial
Quick Access
Tables: AHRF, SHORTAGE_DESIGNATIONS, HPSA, HEALTH_CENTER_SITES Sources: 4 government data sources from data.hrsa.gov Update Frequency: AHRF annual, Shortage Designations monthly, HPSA and Health Center Sites quarterly Geography: All 3,200+ US counties
Overview
The HRSA Healthcare Resources Dataset provides comprehensive access to US healthcare workforce and shortage data including:
AHRF (Area Health Resources Files) - Physician and nursing workforce counts with per-100k population rates by county
SHORTAGE_DESIGNATIONS - Medically Underserved Areas and Populations (MUA/MUP) designations by county
HPSA - Health Professional Shortage Area scores for Primary Care, Dental, and Mental Health by county
HEALTH_CENTER_SITES - FQHC and Look-Alike service delivery site locations with addresses and coordinates
Metadata Tables
Every Dataplex data product includes these standard metadata tables:
FEEDS
Dataset catalog — available tables, descriptions, update dates
FEEDS_FILES
Batch load history with is_latest flag for data freshness
CHANGELOG
Change log — data loads, schema changes, corrections
DATA_DICTIONARY
Column descriptions for all tables
Entity Relationship Diagram
All four data tables join on fips_county for cross-source county-level analysis. Each table links to FEEDS and FEEDS_FILES via feed_id and feeds_files_id for data lineage and freshness tracking.
Data Tables
AHRF (Area Health Resources Files)
Health workforce data by US county — physicians, nurses, and population with per-100k rates. Unpivoted to latest and previous data periods for trend analysis.
Key Features:
~3,235 rows per vintage (one per US county)
Physician and nursing counts with population-adjusted rates
Latest and previous data years for year-over-year comparison
Annual updates from HRSA (~Q1 each year)
Column Reference
fips_county
VARCHAR
Five-digit Federal county code (state FIPS + county FIPS)
county_name
VARCHAR
County name with state abbreviation (e.g., New York, NY)
state_name
VARCHAR
Full state name
state_abbrev
VARCHAR
Two-letter USPS state abbreviation
fips_state
VARCHAR
Two-digit state FIPS code
vintage_year
VARCHAR
AHRF release identifier (e.g., '2024-2025'). First four digits = latest data year in that release.
data_year
NUMBER
Calendar year for this data point (e.g., 2024). Query WHERE data_year = 2024 for specific years.
total_mds
NUMBER
Non-federal physicians (MDs) in the county
rn_full_time
NUMBER
Full-time registered nurses
rn_part_time
NUMBER
Part-time registered nurses
rn_total
NUMBER
Total registered nurses (full-time + part-time)
lpnlvn_full_time
NUMBER
Full-time licensed practical/vocational nurses
lpnlvn_part_time
NUMBER
Part-time licensed practical/vocational nurses
lpnlvn_total
NUMBER
Total LPN/LVNs (full-time + part-time)
nursing_workforce_total
NUMBER
All nurses: RN + LPN/LVN, full and part-time
population
NUMBER
County population estimate (NULL for some territories)
mds_per_100k
NUMBER
Physicians per 100,000 population
nurses_per_100k
NUMBER
Total nursing workforce per 100,000 population
feed_id
VARCHAR
FK to FEEDS — identifies which dataset this row belongs to
feeds_files_id
VARCHAR
FK to FEEDS_FILES — identifies which batch loaded this data
created_at
TIMESTAMP
When the source data was loaded into the warehouse
updated_at
TIMESTAMP
When the table was last rebuilt
SHORTAGE_DESIGNATIONS (MUA/MUP)
Medically Underserved Areas and Populations by county — federal shortage area designations with population counts.
Key Features:
~3,200 rows (one per US county)
MUA/MUP designation counts and affected population
Derived from HRSA shapefiles with geospatial county aggregation
Monthly updates from HRSA
Column Reference
fips_county
VARCHAR
Five-digit Federal county code
mua_designation_count
NUMBER
Number of MUA/MUP designations in the county
mua_population
NUMBER
Population in Medically Underserved Areas within the county
feed_id
VARCHAR
FK to FEEDS — identifies which dataset this row belongs to
feeds_files_id
VARCHAR
FK to FEEDS_FILES — identifies which batch loaded this data
created_at
TIMESTAMP
When the source data was loaded into the warehouse
updated_at
TIMESTAMP
When the table was last rebuilt
HPSA (Health Professional Shortage Areas)
Health Professional Shortage Areas by county and discipline — Primary Care, Dental, and Mental Health shortage scores aggregated to county level.
Key Features:
~18,000 designation records across 3 disciplines
HPSA scores (0-26 scale, higher = more severe shortage)
Provider shortage estimates (additional providers needed)
Quarterly updates from HRSA with snapshot-based history
Column Reference
fips_county
VARCHAR
Five-digit Federal county code
county_name
VARCHAR
County name
state_name
VARCHAR
Full state name
state_abbrev
VARCHAR
Two-letter USPS state abbreviation
discipline
VARCHAR
HPSA discipline: PRIMARY_CARE, DENTAL, or MENTAL_HEALTH
snapshot_month
VARCHAR
Month this data snapshot was captured (e.g., '2026-02'). Use for trend analysis across time periods.
hpsa_score
NUMBER
Worst (highest) HPSA score in this county for this discipline (0-26 scale, higher = more severe shortage)
designation_count
NUMBER
Number of active HPSA designations in this county for this discipline
designated_population
NUMBER
Total population in designated shortage areas within this county
underserved_population
NUMBER
Estimated underserved population in this county for this discipline
provider_shortage
NUMBER
Number of additional providers needed to remove the shortage designation
last_designation_update
DATE
Most recent designation update date for this county and discipline
feed_id
VARCHAR
FK to FEEDS — identifies which dataset this row belongs to
feeds_files_id
VARCHAR
FK to FEEDS_FILES — identifies which batch loaded this data
created_at
TIMESTAMP
When the source data was loaded into the warehouse
updated_at
TIMESTAMP
When the table was last rebuilt
HEALTH_CENTER_SITES (FQHC and Look-Alike Sites)
FQHC and Look-Alike service delivery sites — individual facility locations with addresses, coordinates, and site types.
Key Features:
~14,000 individual facility locations
Addresses, lat/long coordinates, and operating hours
Site status and health center type classification
Quarterly updates with snapshot-based history
Column Reference
site_key
VARCHAR
Site identifier (MD5 hash of organization + site name + address)
health_center_type
VARCHAR
Type of health center (e.g., Community Health Center, Migrant Health Center, Look-Alike)
health_center_name
VARCHAR
Organization name operating this site
site_name
VARCHAR
Name of this service delivery site
site_address
VARCHAR
Street address (suppressed for domestic violence shelters)
site_city
VARCHAR
City where the site is located
state_abbrev
VARCHAR
Two-letter USPS state abbreviation
zip_code
VARCHAR
ZIP code (suppressed for domestic violence shelters)
phone
VARCHAR
Site telephone number
operating_hours_per_week
NUMBER
Hours per week the site is open for patient visits
site_status
VARCHAR
Current site status (e.g., Active, Inactive)
fips_county
VARCHAR
Five-digit Federal county code — join key to AHRF, HPSA, and other county-level tables
county_name
VARCHAR
County name
state_name
VARCHAR
Full state name
longitude
NUMBER
Site longitude (WGS84)
latitude
NUMBER
Site latitude (WGS84)
snapshot_month
VARCHAR
Month this data snapshot was captured (e.g., '2026-02'). Use for trend analysis across time periods.
feed_id
VARCHAR
FK to FEEDS — identifies which dataset this row belongs to
feeds_files_id
VARCHAR
FK to FEEDS_FILES — identifies which batch loaded this data
created_at
TIMESTAMP
When the source data was loaded into the warehouse
updated_at
TIMESTAMP
When the table was last rebuilt
Data Quality
Standardization
Column mapping: AHRF source columns embed year suffixes (e.g.,
MD_NF_24,MD_NF_23) that change annually. An automated column mapper produces year-agnostic names (total_mds,mds_per_100k).Type casting: All raw data arrives as VARCHAR and is cast to appropriate types (NUMBER, DATE) in the staging layer using
TRY_TO_NUMBER().Geospatial aggregation: Shortage Designation shapefiles are intersected with county boundaries to produce county-level MUA/MUP summaries.
HPSA aggregation: Component-level HPSA records are aggregated to county level per discipline, taking the worst (highest) score per county.
Deduplication: Each source publishes full files. The pipeline deduplicates to the latest version per primary key.
Data Freshness
Check when data was last updated:
Getting Started
Platform Schema Reference
This dataset is available on both Snowflake and Databricks. Queries use schema-only references — the database is already set by the share or catalog context:
Snowflake
DWV
DWV.AHRF
Databricks
hrsa_dwv
hrsa_dwv.ahrf
Discover Available Data
Start with the FEEDS table to see what's available, and FEEDS_FILES to understand data freshness and load history.
Working with Data Lineage
Every data row links to FEEDS_FILES via feeds_files_id, which tells you exactly which batch loaded that data. Use this to filter to the current data version or trace any row back to its source load.
Healthcare Workforce Analysis
FQHC Network Analysis
Complete County Health Profile
Payer Network Adequacy
Tracking Data Changes Over Time
FEEDS_FILES records every batch load with row_count_delta showing what changed. Use this to monitor source data updates and understand how the dataset evolves.
Ready to access HRSA data?
Snowflake
Databricks
Questions? Contact our team for a walkthrough.
HRSA Data Sources
Last updated

