Microsoft Academic (archive)

Microsoft Academic (archive)
Name	Microsoft Academic (archive)
Owner	Microsoft
Launched	2009
Dissolved	2021
Type	Academic search engine and bibliographic database

Contents

Microsoft Academic (archive) was a scholarly search service and bibliographic database developed by Microsoft Research and later maintained by Microsoft that provided indexing of scientific literature, citation graphs, and scholarly metadata. It operated alongside services such as Google Scholar, PubMed, Scopus, Web of Science and competed with platforms like Semantic Scholar, arXiv, ResearchGate and Academia.edu in discoverability and citation analysis. The project connected entities including Microsoft Research Cambridge, Microsoft Research Redmond, Allen Institute for AI, Cornell University and Stanford University through shared bibliometric practices and metadata standards.

History and development

Microsoft Research launched early iterations tied to projects led by researchers at Microsoft Research Redmond and Microsoft Research Cambridge, building on earlier initiatives such as MSN Search, Bing enhancements and the academic search prototype associated with Vannevar Bush-inspired scholarly discovery. Development involved collaborations with teams familiar with DBLP, CiteSeerX and Microsoft Academic Graph work that paralleled efforts at Google Research, IBM Research and Yahoo! Research. Over time the platform evolved through product phases that referenced initiatives at University of Washington, Carnegie Mellon University and University of California, Berkeley before Microsoft announced discontinuation that affected partnerships with institutions like National Institutes of Health, European Research Council and Wellcome Trust.

The service offered citation network features, author disambiguation, subject categorization, and API access that implicated technologies from Microsoft Azure, Azure Cosmos DB, SQL Server and machine learning frameworks from PyTorch, TensorFlow and CNTK. Its architecture centered on the Microsoft Academic Graph schema, integrating entity types similar to ORCID, CrossRef, DOI and PubMed ID identifiers while supporting export formats adopted by Zotero, EndNote and Mendeley. Search capabilities leveraged natural language processing techniques researched at Microsoft Research Redmond, University of Cambridge and Massachusetts Institute of Technology to power relevance ranking comparable to systems at Google Scholar and Semantic Scholar.

Coverage included journals, conference proceedings, books, patents and preprints from publishers and venues such as Nature (journal), Science (journal), Proceedings of the National Academy of Sciences, IEEE, ACM, Springer Nature and Elsevier. Indexing drew on ingestion pipelines similar to CrossRef metadata harvesting, open repositories like arXiv and institutional repositories housed at Harvard University, MIT Libraries and Oxford University. Subject coverage mapped to taxonomies used by Medical Subject Headings, Library of Congress Classification and domain ontologies developed at European Bioinformatics Institute and National Center for Biotechnology Information for biomedical and life sciences content.

Researchers in fields including biology, computer science, physics, chemistry and medicine used the platform alongside citation tools developed by Clarivate Analytics and metrics from Altmetric and Dimensions (database). Librarians at institutions such as University of California, University of Cambridge and University of Oxford evaluated the service for collection development and discovery while bibliometricians compared its citation counts to those from Scopus and Web of Science. The platform received attention from media outlets like The New York Times, The Guardian and Nature (journal) and was discussed at conferences including International Conference on Science and Technology Indicators, SIGIR and The Web Conference.

Microsoft announced the service closure which prompted archiving efforts by stakeholders including Internet Archive, CrossRef and academic libraries at Stanford University and Harvard University to preserve portions of the Microsoft Academic Graph for reproducibility in projects tied to Open Science and long-term data stewardship practices advocated by organizations such as SPARC and DataCite. The legacy influenced successor efforts and integrations by Semantic Scholar, OpenAlex and repository infrastructures at Zenodo and continues to inform bibliometric research, policy discussions at European Commission and data sharing standards promoted by FORCE11 and Research Data Alliance.

Category:Bibliographic databases Category:Microsoft services Category:Academic search engines