Democratizing Data

Democratizing Data builds a community-driven data ecosystem by identifying how datasets are used and reducing barriers to accessing high-quality public data. The initiative enhances the discoverability, usability, and relevance of data for researchers, policymakers, and stakeholder communities. A suite of tools and strategic partnerships supports this work by connecting users to the data, insights, and networks needed to inform decisions and generate impact.

RESOURCES

📋 Technical Report Available

This report introduces a methodology for identifying dataset mentions in research publications and comparing coverage across citation databases. Tracking these mentions helps measure the reach of federal datasets and supports future data investment decisions. The analysis also highlights the range of research topics where these datasets are used across disciplines.

Report Highlights:

  • Code repository for data cleaning and standardization
  • Data schemas by citation database
  • Standardized institution tables using IPEDS identifiers
  • Data visualizations summarizing report findings
Professional data analysis workspace with charts, graphs, and technical documentation representing comprehensive research methodology

Key Areas

FOR FEDERAL AGENCIES

Evidence-based policy making

The platform provides federal agencies with comprehensive insights into how their data assets are being utilized in scientific research. Our machine learning algorithms analyze over 90 million documents to identify dataset citations, research applications, and impact metrics that support evidence-based decision making.

FOR RESEARCHERS

Discover trusted datasets

Researchers can use our platform to discover relevant federal datasets for their work, understand how data have been used in similar studies, and access tools that facilitate responsible data use. Our search and discovery tools help connect researchers with the data they need for high-quality research.

GET INVOLVED

Ways to democratize data

The goal is to develop a community of practice through workshops, webinars, and direct engagement with the user community. The user community built the initial models through a Kaggle competition; a Show Us the Data conference brought together researchers, academic institutions, chief data officers, and publishers; subsequent conferences have built on the ideas. More are planned.

HOW TO LEARN AND CONTRIBUTE

Discover insights from leading researchers and contribute to the future of trusted data through podcasts, presentations, and community engagement.

Learning and understanding data visualization representing knowledge sharing and research insights

Learn from listening to podcasts

Listen to researchers share how they used federal data to drive meaningful insights and policy changes.

Dr. Ray Hart: National Assessment of Educational Progress

Dr. Ray Hart: used National Assessment of Educational Progress (NCES) to examine the effectiveness of large urban schools in overcoming poverty-related challenges. His report, "Mirrors or Windows," challenges myths and provides fresh perspectives on urban education.

Listen Now

Dr. Becca Jablonski: Agricultural Resource Management Survey

Dr. Becca Jablonski: used Agricultural Resource Management Survey (Food And Agricultural Research) to study the gap in support for small producers of artisan and locally produced food, relative to their large-scale counterparts. She explores sustainability policies and shares insights for researchers.

Listen Now

Understanding consumer buying choices

Dr. Chen Zhen: used retail scanner data (from Food And Agricultural Research) to construct panel price indexes to understand how changes in food prices can affect consumer behavior and ultimately public health outcomes.

Listen Now

Dr. Tiffany Oliver: Survey of Earned Doctorates

Dr. Tiffany Oliver: used Survey of Earned Doctorates (NCSES) to explore the journeys of Black women earning STEM PhDs annually. She discusses her research on this demographic's educational history and postgraduation plans.

Listen Now

Dr. Janet Currie: Vital Statistics and the Supplemental Nutrition Program for Women Infants and Children

Dr. Janet Currie: used the Vital Statistics and the Supplemental Nutrition Program for Women Infants and Children (WIC) to investigate the effectiveness of government programs and maternal participation in improving children's health and well-being.

Listen Now

Dr. Julia Lane: UMETRICS and the Survey of Earned Doctorates

Dr. Julia Lane: used UMETRICS and the Survey of Earned Doctorates (NCSES) to explore how research experience influences career choices in STEM, focusing on the impact of gender and race on doctoral education.

Listen Now

Learn from presentations and webinars

Access insights from federal agencies and research organizations through presentations, webinars, and conferences that demonstrate how data drives scientific discovery and policy impact.

Data insights and analytics visualization from presentations and webinars

Federal agencies have shared valuable insights through presentations at the Federal Committee on Survey Methodology and The Council of Professional Associations on Federal Statistics.

Explore our collection of webinars and conferences that showcase how data contributes to the value of science.

Explore Events