Social Science One

Social Science One
Name	Social Science One
Formation	2018
Founders	Harvard University; Microsoft Research (collaboration origins); private-sector partners
Type	Research consortium
Location	Cambridge, Massachusetts; also associated with Menlo Park, California and Palo Alto, California
Focus	Social science research, data sharing, computational social science

Contents

Background and Origins
Structure and Governance
Data Partnerships and Releases
Research Projects and Outcomes
Privacy, Ethics, and Legal Issues
Criticisms and Controversies

Social Science One is a research consortium founded in 2018 to enable large-scale empirical research by pairing academic teams with proprietary datasets from private-sector technology platforms. It was created following public debates and legal actions involving major technology firms and academic institutions, with the goal of fostering collaboration between scholars and companies while addressing access, privacy, and reproducibility challenges. The initiative has been associated with collaborations involving leading universities, major technology companies, and interdisciplinary research groups across the United States and Europe.

Background and Origins

Social Science One emerged in the aftermath of high-profile events and inquiries concerning data use by technology platforms, including scrutiny connected to Cambridge Analytica, litigation such as actions involving Facebook, Inc., and investigations by regulators like the Federal Trade Commission. The project was announced amid conversations among scholars from institutions including Harvard University, Princeton University, Yale University, University of Pennsylvania, and technology leaders from firms with operations in Menlo Park, California and Seattle, Washington. Influences on its creation included prior collaborations and reports from entities such as The Brookings Institution, The Alan Turing Institute, National Science Foundation, and advisory panels convened after the revelations that involved actors like Christopher Wylie and companies tied to SCL Group.

Structure and Governance

The consortium was organized as a partnership model linking academic teams, corporate data stewards, and independent governance structures. Governing participants included representatives from universities such as Harvard University, Columbia University, Stanford University, and research organizations like The Center for European Policy Studies and Data & Society Research Institute. The oversight apparatus drew on principles from frameworks advocated by bodies like OECD and Council of Europe and included ethics review mechanisms akin to processes at institutional review boards at University of Chicago and Massachusetts Institute of Technology. Funding and administrative support involved philanthropy and institutional commitments similar to foundations like Ford Foundation and Open Society Foundations in collaborative models elsewhere.

Data Partnerships and Releases

A core activity was negotiating data-sharing agreements between academic teams and platform providers. Partner firms with which processes were publicly discussed included companies headquartered in Menlo Park, California and Mountain View, California, with legal contexts influenced by precedents such as litigation against Facebook, Inc. and regulatory actions by bodies like the European Commission and national data protection authorities including Information Commissioner's Office in the United Kingdom. Data releases were structured to balance research requirements described by investigators from Princeton University, University of Michigan, University of California, Berkeley, New York University, and University of Oxford, while incorporating controls used in other collaborations, such as data enclaves similar to those at Inter-university Consortium for Political and Social Research and access agreements modeled after practices at RAND Corporation.

Research Projects and Outcomes

Academic teams conducted studies on topics intersecting technology platforms and public life, involving scholars from groups at Harvard Kennedy School, Columbia University Mailman School of Public Health, Stanford Graduate School of Business, Yale School of Management, London School of Economics, University of Cambridge, and University of Toronto. Projects produced findings concerning information flows, media ecosystems, and civic engagement that referenced casework around events such as the 2016 United States presidential election, the Brexit referendum, and the role of platforms during public-health crises like the COVID-19 pandemic. Outcomes were disseminated via venues such as Science, Nature, Proceedings of the National Academy of Sciences, and conferences like International Communication Association and Association for Computing Machinery.

Privacy, Ethics, and Legal Issues

The initiative operated at the intersection of privacy law and research ethics, engaging with statutes and regimes such as the General Data Protection Regulation and litigation trends shaped by decisions of courts in United States District Court for the Northern District of California and rulings involving companies similar to Facebook, Inc.. Ethical deliberations involved scholars affiliated with institutions like Georgetown University and University of California, Los Angeles and adhered to norms exemplified by guidelines from American Association for Public Opinion Research and institutional review boards at Duke University. Technical measures for privacy protection drew on methods similar to differential privacy research at Microsoft Research and synthetic-data approaches discussed at venues like NeurIPS and ICML.

Criticisms and Controversies

The consortium faced critique from academics, journalists, and civic groups including commentators associated with The New York Times, The Washington Post, The Guardian, and think tanks such as Brennan Center for Justice and Electronic Frontier Foundation. Criticisms targeted transparency, access constraints, and perceived delays, echoing earlier debates involving Cambridge Analytica and inquiries by bodies like the U.S. Congress. Legal tensions paralleled disputes seen in cases involving corporate disclosure practices addressed by the Securities and Exchange Commission and academic debates on reproducibility promoted by organizations like Center for Open Science. Despite controversies, proponents compared the effort to other public–private research collaborations undertaken with institutions such as IBM Research, Google Research, and Apple in domains where proprietary data posed research barriers.

Category:Research consortia Category:Computational social science