Social data science
Social data science is an interdisciplinary field that addresses social science problems by [|applying or designing computational and digital methods]. As the name implies, Social Data Science is located primarily within the social science, but it relies on technical advances in fields like data science, network science, and computer science. The data in Social Data Science is always about human beings and derives from social phenomena, and it could be structured data or unstructured data. The goal of Social Data Science is to yield new knowledge about social networks, human behavior, cultural ideas and political ideologies.
A social data scientist combines domain knowledge and specialized theories from the social sciences with programming, statistical and other data analysis skills.
Methods
Social data science employs a wide range of quantitative - both established methods in social science as well as new methods developed in computer science and interdisciplinary data science fields such as natural language processing and network science.Social Data Science is closely related to Computational Social Science, but also sometimes includes qualitative data, and mixed digital methods.
Common social data science methods include:
Quantitative methods:
- Machine learning
- Deep learning
- Social network analysis
- Randomized controlled trials
- Natural language processing, especially through text as data.
- surveys
- Interviewing
- Observation
- Ethnography
- Content analysis
- Discourse analysis
- Controversy mapping
- Spatial analysis
- Quali-quantitative methods
- Computational ethnography
Data
Social data scientists use both digitized data and natively digital data. Since such data often take the form of found data that were originally produced for other purposes than research, data scraping, cleaning and other forms of preprocessing and data mining occupy a substantial part of a social data scientist's job.Sources of SDS data include:
- Text data
- Sensor data
- Register data
- Survey data
- Geo-location data
- Observational data
Relations to other fields
Social sciences
Social data science is part of the social sciences along with established disciplines and newer interdisciplinary fields like behavioral science, criminology, international relations, and cognitive science. As such, its fundamental unit of study is social relations, human behavior and cultural ideas, which it investigates by using quantitative and/or qualitative data and methods to develop, test and improve fundamental theories concerning the nature of the human condition. SDS also differs from traditional social science in two ways.- First, its primary object is digitized phenomena and data in the widest sense of this word, ranging from digitized text corpora to the footprints gathered by digital platforms and sensors.
- Secondly, more than simply applying existing quantitative and qualitative social science methods, social data science seeks to develop and disrupt these via the import and integration of state of the art of data science techniques
Data Science
Computational Social Science
Like computational social science, social data science uses data science methods to solve social science problems. This includes the reappropriation and refinement of methods developed by data scientists to better fit the questions and data of the social sciences as well as their specialized domain knowledge and theories. Unlike computational social science, social data science also includes critical studies of how digital platforms and computational processes affect wider society and of how computational and non-computational approaches integrate and combine.Digital Methods
While most social data science researchers are closely affiliated with or part of computational social science, some qualitative oriented social data scientists are influenced by the fields of digital humanities and digital methods that emerged from science and technology studies. Like digital methods, the aim is here to repurpose the 'methods of the medium' to study digitally-mediated society and to engage in an ongoing discussions about bias in science and society by bringing computational social science and Digital Methods into dialogue. SDS is also related to digital sociology and digital anthropology, but to a higher degree aspires to augment qualitative data and digital methods with state of the art data science techniques.History of the field
The origin of term "social data science" coincided with the emergence of a number of research centers and degree programs. In 2016, the Copenhagen Center for Social Data Science - the first academic institution using the SDS name - was launched at the University of Copenhagen. The plan for an interdisciplinary center working at the intersection of the social and computational sciences was rooted in the Copenhagen Networks Study from 2011 to 2016 by researchers from the Technical University of Denmark and the University of Copenhagen. The University of Oxford and the University of Copenhagen were among the first research institutions to offer degree programmes in SDS. In 2018, the University of Oxford launched the one-year MSc in Social Data Science, which was followed by the two-year master's programme at the University of Copenhagen in 2020. Since then, an increasing number of universities have begun to offer [|graduate programs or specializations in social data science]Social data science has emerged after the increasing availability of digitized social data, sometimes referred to as Big Data, and the ability to apply computational methods to this data at a low cost, which has offered novel opportunities to [|address questions about social phenomena and human behavior]. While the origin of social data can be traced back to 1890s, the social data boom in the 21th century is a direct consequence of the increasing availability of consumer data resulting from the advent of e-commerce Subsequent waves of availability of unstructured social data include Amazon.com review system and Wikipedia, and more recently, social media, which has played a key role in the emergence of the digital attention economy and big tech.
Criticism and debates
Data scientists have played a vital role in the data revolution, both during the original tech-optimist phase where big data and the Internet was seen as the solution to many societal and scientific problems, and as participants in the tech-lash that followed in its wake as result of, among other things, the Facebook–Cambridge Analytica data scandal. Social data science researchers and research projects have been especially impactful in debates and criticism revolving around:- Surveillance capitalism
- Digital disinformation
- Algorithmic bias
- The replication and validity crisis on the social sciences
- Ethics and privacy
- Data governance
Impact and examples
- Nature Human Behaviour
- Nature Computational Science
- The Journal of Computational Social Science
- Big Data and Society
- Science Advances
- Nature Communications
- Scientific Reports
- PLOS ONE
Education and Research Institutions
There are multiple specific definitions of social data science, but several institutions around the world currently offer degree and research programs under the rubric of Social Data Science.Education
- M.Sc. in Social Data Science - University of Copenhagen
- M.Sc. in Social Data Science and Policy - Amrita Vishwa Vidyapeetham
- MSc in Social Data Science - University of Oxford
- MSc in Social and Economic Data Science - University of Konstanz
- BSc in Social Data Science - University of Hong Kong
- M.S. in Social Data Science - Pohang University of Science and Technology
- P.Grad.Dip in Social Data Science - University of Dublin
- MSc Applied Social Data Science - The London School of Economics
- Master of Science in Social Data Science - Central European University
- MSc Social Data Science - University of Essex
- MSc in Techno-Anthropology - University of Aalborg
- MSc Social Data Science - University College Dublin
- BSc in Social Data Science - Witten/Herdecke University
- Quantitative Analysis and Social data Science - KU Leuven
- Human and Social Data Science MSc - University of Sussex
- International Diploma in Data Science for the Social Sciences - Facultad Latinoamericana de Ciencias Sociales