Generated by GPT-5-mini| DataCite DOIs | |
|---|---|
| Name | DataCite DOIs |
| Type | Identifier service |
| Founded | 2009 |
| Location | Global |
| Parent organization | DataCite |
DataCite DOIs DataCite DOIs are persistent digital identifiers issued by DataCite to make research outputs discoverable and citable. They play a central role in scholarly communications connecting datasets, publications, and institutional repositories maintained by entities such as Harvard University, Max Planck Society, European Commission, National Institutes of Health, and Wellcome Trust. Designed to interoperate with infrastructures like CrossRef, ORCID, Zenodo, Figshare, and Dryad Digital Repository, they support reproducible research practices promoted by groups including CODATA, Research Data Alliance, Square Kilometer Array, and Human Genome Project.
DataCite DOIs function as persistent identifiers similar in purpose to identifiers used by Library of Congress, Getty Research Institute, Smithsonian Institution, British Library, and National Library of Medicine. They follow the Digital Object Identifier model first coordinated by the International DOI Foundation and complement registration agencies such as CrossRef and mEDRA. Data centers and repositories at institutions like California Institute of Technology, Massachusetts Institute of Technology, Stanford University, University of Oxford, University of Cambridge, and ETH Zurich use these DOIs to link datasets to outputs from projects like Large Hadron Collider, European Space Agency, NASA, CERN, and NOAA. Funding agencies including National Science Foundation, Horizon 2020, European Research Council, Gates Foundation, and NIH increasingly require persistent identifiers for compliance and tracking. Archives and consortia such as DataVerse, ICPSR, PANGAEA, UK Data Service, and Australian Research Data Commons also register DataCite DOIs for long-term access.
Organizations such as California Digital Library, British Library, INIST-CNRS, TIB Hannover, and Surrey Libraries become DataCite members or work through DataCite providers to mint DOIs. The minting workflow parallels registration systems used by CrossRef and involves metadata deposition to DataCite services and local repository integration like DSpace, EPrints, CKAN, Fedora Commons, and Islandora. DataCite works with registrars and affiliates including CDL, PANGAEA, CERN Document Server, Zenodo, and Figshare to allocate prefixes and suffix resolution policies. Institutions such as University of California, National Library of Finland, CSIRO, CNRS, and Max Planck Digital Library follow prescribed workflows for versioning, landing pages, and embargo handling that align with practices from ORCID, Crossref, OpenAIRE, Re3data, and RO-Crate.
The DataCite Metadata Schema is aligned with international standards and bodies such as ISO 26324, Dublin Core Metadata Initiative, W3C, NISO, and OCLC. It integrates controlled vocabularies and identifiers like ORCID for researchers, GRID for institutions, and taxonomies used by Library of Congress, Getty Vocabularies, MeSH, and WoS indices. Repositories and publishers including PLOS, Springer Nature, Elsevier, Wiley, and Nature Research rely on consistent schema terms for creators, contributors, funders (e.g., European Research Council, Wellcome Trust), and resourceType classifications used across infrastructures like CrossRef, DataCite Commons, OpenAIRE, and Scholix. DataCite metadata supports citation tracking with systems such as Scopus, Web of Science, Google Scholar, and institutional CRIS systems at Princeton University, Yale University, and Columbia University.
Resolution of DataCite DOIs depends on the Handle System originally used by CNRI and governed by the International DOI Foundation. Persistent resolution services interact with content delivery networks and repository infrastructures at Amazon Web Services, European Data Portal, CERN, CLOCKSS, and Portico to ensure availability. Long-term preservation collaborations with National Library of Australia, Bibliothèque nationale de France, Deutsche Nationalbibliothek, Internet Archive, and LOCKSS aim to sustain landing pages and access even when originating repositories such as Zenodo or Figshare evolve. Legal deposit institutions and national archives like Library and Archives Canada coordinate policies to address takedown, orphaned DOIs, and redirection consistent with norms from Creative Commons licensing and mandates by funders like NSF and European Commission.
DataCite governance builds on stakeholder engagement with members including Chinese Academy of Sciences, Japanese National Institute of Informatics, Korea Institute of Science and Technology Information, Australian Research Council, and regional consortia like Jisc. Policies for DOI assignment, metadata quality, and machine-actionable metadata are informed by advisory groups, working groups, and standards bodies such as W3C, ISO, NISO, RDA, and CODATA. Licensing and access policies intersect with publishers and funders such as Wellcome Trust, Bill & Melinda Gates Foundation, UKRI, Horizon Europe, and NIH to define persistent identifier stewardship, provenance recording, and community best practices adopted by universities like Uppsala University, Lund University, and University of Edinburgh.
DataCite DOIs are used for dataset citation in journals published by Nature Publishing Group, Elsevier, PLOS, IEEE, and ACM; for linking clinical datasets in collaborations involving World Health Organization, Centers for Disease Control and Prevention, and European Centre for Disease Prevention and Control; and for connecting software and code repositories such as GitHub, Bitbucket, GitLab, and archival services like Software Heritage. Large infrastructure projects like Square Kilometre Array, ITER, LIGO, James Webb Space Telescope, and Human Cell Atlas use DOIs to track data provenance. Institutional repositories at MIT, University of Toronto, University of Melbourne, Seoul National University, and National University of Singapore integrate DOIs into assessment systems, ORCID profiles, and research information management platforms like Symplectic, Pure, and Converis.
Critiques of DataCite DOI practices come from communities concerned with metadata incompleteness noted by groups such as SPARC, Open Knowledge Foundation, Force11, ScholarsPortal, and COAR. Challenges include cross-repository interoperability issues highlighted in comparisons with CrossRef and legacy systems used by PubMed Central and arXiv, sustainability concerns voiced by national libraries and funders like NSF and ERC, and legal/policy complexities in jurisdictions represented by European Commission, United States Department of Health and Human Services, and China National Knowledge Infrastructure. Additional debates involve citation normalization across indices like Scopus and Web of Science and technical concerns about DOI granularity, versioning, and costs raised by universities, consortia, and repository platforms including DSpace, EPrints, Figshare, and Zenodo.
Category:Persistent identifiers