LLMpediaThe first transparent, open encyclopedia generated by LLMs

DataCite

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Elsevier Hop 3
Expansion Funnel Raw 1 → Dedup 1 → NER 0 → Enqueued 0
1. Extracted1
2. After dedup1 (None)
3. After NER0 (None)
Rejected: 1 (not NE: 1)
4. Enqueued0 ()
DataCite
NameDataCite
Formation2009
TypeNonprofit organization
HeadquartersUnknown
Region servedInternational

DataCite is a global nonprofit organization coordinating persistent identifier services for research data, supporting open science, reproducible research, and scholarly communication. The organization collaborates with publishers, libraries, archives, funders, and research infrastructures to enable discovery, citation, and reuse of datasets across platforms such as Crossref, ORCID, Zenodo, and Figshare. DataCite's activities intersect with major initiatives and institutions including the International Science Council, Research Data Alliance, European Commission, National Science Foundation, and World Data System.

History

Founded in 2009, DataCite emerged from collaborations among institutions including the British Library, Deutsche Nationalbibliothek, and the California Digital Library influenced by international efforts such as the Royal Society, CERN, and the Organisation for Economic Co-operation and Development. Early milestones involved partnerships with registries like Crossref, DOI Foundation, and Handle System stewards, alongside projects funded by the European Union, Horizon 2020, and national agencies such as the German Research Foundation and the Australian Research Council. Over time the organization engaged with repositories and infrastructures including Dryad, PANGAEA, EMBL-EBI, and DANS to scale DOI assignment, metadata aggregation, and preservation workflows in coordination with initiatives like FORCE11, CODATA, and the Global Biodata Coalition.

Mission and Governance

DataCite's mission emphasizes persistent identification, standardized metadata, and interoperability supporting stakeholders such as libraries, archives, museums, funders, and publishers including Elsevier, Springer Nature, Wiley, and Taylor & Francis. Governance incorporates representatives from member organizations, national consortia, university presses, and data centers such as Harvard Library, Stanford University, ETH Zurich, and the National Institute of Standards and Technology, operating within frameworks influenced by the DOI Foundation, ICANN, and the International Organization for Standardization. Strategic planning aligns with policy priorities from the European Research Council, Wellcome Trust, Gates Foundation, and UK Research and Innovation while liaising with standards bodies like W3C and ISO technical committees.

DOI Registration and Metadata Services

DataCite provides DOI registration services in collaboration with registration agencies and service providers like Crossref, mEDRA, and CNMARC, implementing metadata schemas interoperable with schema.org, Dublin Core, and CERIF. Metadata services support bibliographic attributes used by ORCID, Scopus, PubMed, and Web of Science for attribution and discovery, and integrate with repository platforms such as DSpace, Invenio, EPrints, and Islandora. The organization maintains persistent resolution using infrastructures related to the Handle System, DOI Foundation, Internet Archive, CLOCKSS, and Portico to ensure long-term access for datasets deposited in repositories including Zenodo, Figshare, Dryad, PLOS, and arXiv.

Members and Community

Members include national libraries, university libraries, research institutions, commercial publishers, and data centers from regions represented by institutions such as the Library of Congress, British Library, Bibliothèque nationale de France, Deutsche Nationalbibliothek, National Diet Library, and National Library of China. Community engagement occurs via working groups, taskforces, and conferences involving collaborators such as Research Data Alliance, FORCE11, European University Association, Association of Research Libraries, and the Confederation of Open Access Repositories. Membership activities link to funders and consortia like Coalition S, Plan S, the Dutch Research Council, Japan Society for the Promotion of Science, and the National Institutes of Health, fostering interoperable practices across infrastructures including EMBL-EBI, SRA, ICPSR, and UK Data Service.

Technical Infrastructure and Standards

Technical infrastructure relies on metadata schemas, APIs, and protocols interoperable with HTTP, OAI-PMH, RESTful services, JSON-LD, and RDF to interconnect with systems such as ORCID, Crossref, DataCite Commons, and institutional repositories at Harvard, MIT, and Oxford. Standards alignment involves collaborations with W3C, ISO, NISO, and the Research Data Alliance to define persistent identifier best practices, machine-actionable metadata, and FAIR principles promoted by the FORCE11 community, ELIXIR, and the GO FAIR initiative. The platform integrates with indexing engines and discovery services like Google Scholar, BASE, DataCite Search, and Europe PMC while depending on authentication and authorization frameworks from eduGAIN, Shibboleth, and OAuth ecosystems used by JSTOR, Scopus, and ProQuest.

Impact and Use Cases

DataCite-enabled DOIs underpin citation practices across journals and publishers including Nature, Science, PLOS, Elsevier, and Springer Nature, supporting reproducible workflows in projects at CERN, Human Genome Project, NASA, WHO, and IPCC assessments. Use cases span data citation for grants managed by the National Science Foundation, European Commission projects, and funders like the Wellcome Trust; repository linking for Dryad, Zenodo, and PANGAEA; and integration in scholarly profiles such as ORCID, ResearchGate, and Google Scholar. The service enhances metadata-driven discovery for initiatives like the Global Biodiversity Information Facility, Copernicus, NASA EOSDIS, EMBL-EBI, and the World Data System, contributing to metrics and assessment practices involving Altmetric, Dimensions, Clarivate Analytics, and OpenAIRE.

Category:Identifiers