LLMpediaThe first transparent, open encyclopedia generated by LLMs

IEEE DataPort

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: IEEE Xplore Hop 4
Expansion Funnel Raw 98 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted98
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
IEEE DataPort
NameIEEE DataPort
TypeData repository
OwnerIEEE
Launched2016
CountryUnited States
Websitedata.ieee.org

IEEE DataPort

IEEE DataPort is a digital repository and data management service operated by IEEE that provides storage, sharing, and publication for research datasets. It serves researchers, institutions, and industry users by integrating dataset hosting with citation, metadata standards, and DOI assignment to facilitate reproducible research and data-driven collaboration. The platform interfaces with academic publishers, funding agencies, and professional societies to support dataset discoverability and long-term preservation.

Overview

IEEE DataPort aligns with data stewardship practices established by organizations such as National Science Foundation, European Commission, Wellcome Trust, OpenAIRE, and Research Data Alliance. It interoperates with identifier systems including Digital Object Identifier and metadata schemas endorsed by CrossRef, DataCite, and ORCID. The service complements repositories like Figshare, Dryad, Zenodo, ICPSR, and Zenodo-adjacent infrastructures while aligning with initiatives from FAIR Data Principles advocates and standards bodies such as ISO committees and NIST. Stakeholders include universities like Massachusetts Institute of Technology, Stanford University, University of Cambridge, research institutes such as CNRS, and corporations including Siemens, IBM, and Microsoft Research.

History and Development

Development traces to IEEE governance and programmatic initiatives parallel to efforts by IEEE Standards Association, IEEE Xplore, and philanthropic and research funding trends from National Institutes of Health and Bill & Melinda Gates Foundation. Early design choices reflected recommendations from panels including contributors from DARPA, European Research Council, JISC, and representatives of consortia such as World Data System and Data Conservancy. Launch milestones coincided with conferences like IEEE International Conference on Big Data, IEEE International Conference on Robotics and Automation, and IEEE Global Humanitarian Technology Conference. Architectural evolution referenced technologies advanced by Amazon Web Services, Google Cloud Platform, and containerization trends championed by Docker and Kubernetes.

Platform Features and Services

IEEE DataPort offers DOI minting through CrossRef and DataCite workflows, metadata enrichment compatible with Dublin Core and schema.org, and persistent identifiers linked to ORCID researchers. Features parallel tools from GitHub for versioning and Jupyter Notebook integration for computational reproducibility, and support for large datasets akin to capabilities demonstrated by AWS S3 and Google Cloud Storage. The service facilitates peer review integration with journals such as IEEE Transactions on Pattern Analysis and Machine Intelligence, Nature, Science, and publication platforms like Springer Nature and Elsevier. Security and compliance policies reference standards from ISO 27001, SOC 2, and frameworks used by National Institute of Standards and Technology. DataPort provides analytics comparable to metrics in Altmetric and repository dashboards used by Figshare.

Data Submission and Curation

Submission workflows incorporate metadata templates influenced by DataCite Metadata Schema, with curation practices modeled after repositories like ICPSR and Dryad. Contributors from institutions such as Harvard University, University of Oxford, Princeton University, and national laboratories like Argonne National Laboratory and Los Alamos National Laboratory have deposited datasets, often accompanied by code hosted on GitHub or linked to computational artifacts using Binder and JupyterHub. Curation roles mirror professional archivists in organizations including Library of Congress, The British Library, and National Archives and Records Administration. Peer review of datasets has been promoted in coordination with journals such as PLOS ONE, Scientific Data, and Data Science Journal.

Access, Licensing, and Usage Policies

Access models range from open access options consistent with mandates from Plan S and funders like Wellcome Trust to restricted access arrangements aligned with institutional policies at University of California campuses and governmental requirements like those from U.S. Department of Energy. Licensing choices include Creative Commons variants and bespoke agreements reflecting intellectual property practices at corporations such as Intel and Qualcomm. Usage tracking and citation practices follow guidance from DataCite, CrossRef, and bibliometric services like Web of Science and Scopus. Data governance considerations reference ethical frameworks used by Belmont Report-aligned review boards and compliance regimes such as HIPAA and GDPR.

Community, Partnerships, and Impact

IEEE DataPort engages with academic communities at conferences including NeurIPS, ICML, CVPR, and SIGGRAPH, and partners with professional societies such as ACM, AAAS, SIAM, and regional IEEE sections. Collaborations extend to infrastructure providers like Internet2 and initiatives such as CERN Open Data and Human Cell Atlas for domain-specific datasets. Impact is measured through dataset reuse in publications indexed by Google Scholar, PubMed, and citations tracked by CrossRef, contributing to reproducibility efforts advocated by organizations such as Committee on Publication Ethics and Center for Open Science. Community programs mirror training and outreach by Data Carpentry, Software Carpentry, and institutional libraries at University of Toronto and University of Melbourne to increase data literacy and stewardship.

Category:Data repositories