List of publications in data science
This is a list of publications in data science, generally organized by order of use in a data analysis workflow.
See the list of publications in statistics for more research-based and fundamental publications; while this list is more applied, business oriented, and cross-disciplinary.
General article inclusion criteria are:
- Papers from notable practitioners or notable professors, either with a Wikipedia page or reference to their notability
- Common knowledge all data professionals should know, with references validating this claim
- Highly cited applied statistics and machine learning publications
- Discussion-facilitating papers on the field of data science as a whole
When possible, a reference is used to validate the inclusion of the publication in this list.
History
Statistical Modeling: The Two CulturesData Scientist: The Sexiest Job of the 21st Century
50 Years of Data Science
'The Composable Data Management System Manifesto'''''
Data collection and organization
Tidy Data'Data Organization in Spreadsheets'''''
Data visualizations
'Quantitative Graphics in Statistics: A Brief History'''''Tooling
Hidden Technical Debt in Machine Learning Systems'A few useful things to know about machine learning'''''