The Democratizing Data initiative has a number of associated publications, reports, and additional resources that provide additional information on our guiding principles and processes.
Public data are foundational to our democratic system. People need consistently high-quality information from trustworthy sources. In the new economy, wealth is generated by access to data; government's job is to democratize the data playing field. Yet data produced by the American government are getting worse and costing more. In Democratizing Our Data, Julia Lane argues that good data are essential for democracy. Her book is a wake-up call to America to fix its broken public data system.
Julia Lane, Ernesto Gimeno, Ekaterina Levitskaya, Zheyuan Zhang, and Alberto Zigoni
This article describes how data science techniques—machine learning and natural language processing—can be used to open the black box of government data. It then describes how an incentive structure can be established—using human–computer interaction techniques —to create a new and sustainable data ecosystem. The particular focus is on the United States and on scientific researchers, who are major users of government data. However, the approach can be deployed to other use cases, such as data mentions in newspapers and government reports, and many other countries.
Edited by Julia Lane, Ian Mulvany, and Paco Nathan
This ground-breaking book explores how automating the search for and discovery of datasets can help tackle irreproducibility in social science.
The Show US the Data podcast series is a series of six five-minute podcasts, discussing a variety of data sets from agencies that participated in the Show US the Data pilot study.