National Center for State, Tribal, Local and Territorial Public Health Infrastructure and Workforce
National Center For State, Tribal, Local, And Territorial Public Health Infrastructure And Workforce datasets in the CDC Open Data Catalog
This page contains all datasets in the National Center For State, Tribal, Local, And Territorial Public Health Infrastructure And Workforce category of the CDC Open Data Catalog.
Total Datasets in Category: 2 Last Updated: 07/14/2025
CDC Text Corpora for Learners: HTML Mirrors of MMWR, EID, and PCD
Description: The attached ZIP archives are part of the CDC Text Corpora for Learners program. This version, comprised of 33,567 articles, was constructed on 2024-03-01 using source content retrieved on 2024-01-09. The attached three ZIP archives contain the 33,567 articles in 33,576 compiled HTML mirrors of the MMWR Morbidity and Mortality Weekly Report including its series: Weekly Reports, Recommendations and Reports, Surveillance Summaries, Supplements, and Notifiable Diseases, a subset of Weekly Reports, constructed ad hoc; EID Emerging Infectious Diseases; and PCD Preventing Chronic Disease.There is one archive per series. The archive attachments are located in the About this Dataset section of this landing page. In that section when you click Show More, the attachments are located in the section Attachments. The retrieval and organization of the files included making as few changes to raw sources as possible, to support as many downstream uses as possible.
Schema: dwv_pub_health_infra
Table Name: cdc_text_corpora_html_mirrors_mmwr_eid___ut5n_bmc3
Dataset ID: ut5n-bmc3
Category: National Center for State, Tribal, Local, and Territorial Public Health Infrastructure and Workforce
Tags: ai, corpora, corpus, data science, eid
CDC Text Corpora for Learners: MMWR, EID, and PCD Article Metadata
Description: This landing page is part of the CDC Text Corpora for Learners program; this includes the compiled 33,576 CDC Text for Learners HTML mirrors of the MMWR Morbidity and Mortality Weekly Report including its series: Weekly Reports, Recommendations and Reports, Surveillance Summaries, Supplements, and Notifiable Diseases, a subset of Weekly Reports, constructed ad hoc; EID Emerging Infectious Diseases; and PCD Preventing Chronic Disease The data represented here is the tabulated metadata of the combined 33,567 articles of the MMWR, EID, and PCD collections whose contents are organized into three ZIP archived JSON files per collection. The JSON value output formats include UTF-8 HTML, UTF-8 markdown, and ASCII plain text. The JSON files are located in the program's repository. This version was constructed on 2024-03-01 using source content retrieved on 2024-01-09.
Schema: dwv_pub_health_infra
Table Name: cdc_text_corpora_learners_mmwr_eid_pcd___7rih_tqi5
Dataset ID: 7rih-tqi5
Category: National Center for State, Tribal, Local, and Territorial Public Health Infrastructure and Workforce
Tags: corpora, corpus, data science, eid, harvest-cdc-journals
Last updated