Generated by GPT-5-mini| PANGAEA (data publisher) | |
|---|---|
| Name | PANGAEA (data publisher) |
| Established | 1987 |
| Location | Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research; University of Bremen; MARUM |
PANGAEA (data publisher) is an international digital data repository focused on georeferenced earth and environmental science data, serving researchers, institutions, and publishers worldwide. It facilitates long-term preservation and dissemination of observational, experimental, and model-derived datasets, supporting reproducible research and citation practices across disciplines. The repository interfaces with academic infrastructures, funding agencies, and scholarly publishers to enable data sharing and integration.
PANGAEA operates as a curated open data archive hosting datasets from disciplines such as oceanography, climatology, paleoclimatology, geochemistry, marine biology, glaciology, seismology, hydrology, and remote sensing, with metadata compatible with indexing services like DataCite, Crossref, ORCID, GBIF, and GEOSS. The platform assigns persistent identifiers and follows community standards used by organizations including the World Data System, International Oceanographic Commission, Intergovernmental Oceanographic Commission, European Space Agency, National Oceanic and Atmospheric Administration, United Nations Environment Programme and research infrastructures such as ERIC-level consortia. PANGAEA integrates with laboratory networks, national research centers such as the Alfred Wegener Institute, academic publishers like Nature Research, Elsevier, and Springer Nature, and aggregators including Copernicus and Paleoceanography.
Founded in the late 1980s at the Alfred Wegener Institute, the archive evolved through collaborations with the University of Bremen, MARUM, and European research projects such as EuroGOOS, EU FP6, Horizon 2020, and ESFRI initiatives. Early development drew on partnerships with data centers like World Data Center for Marine Environmental Sciences and standards groups such as ISO 19115, OGC, and Dublin Core. Over decades the infrastructure expanded through integration with projects including EMODnet, SeaDataNet, GEOTRACES, PAGES, ICSC, SCAR, and international programs like Global Ocean Observing System and International Continental Scientific Drilling Program.
PANGAEA provides data ingestion, curation, DOI assignment, long-term preservation, and discovery services supporting submissions from principal investigators, research cruises, observatories, and laboratory studies. It performs quality control, metadata enhancement, formatting into standards such as NetCDF, CSV, and ISO 19139, and links datasets to publications in journals like Science, Nature, Geophysical Research Letters, Journal of Geophysical Research, Deep-Sea Research, and Marine Ecology Progress Series. The repository supports data citation practices consistent with guidelines from bodies like the Joint Declaration of Data Citation Principles, CODATA, RDA, and funding agencies such as the European Commission and National Science Foundation.
The platform enforces curated access, embargo options, and licensing workflows compatible with Creative Commons licenses and community agreements from organizations like DataCite and OpenAIRE. It implements data usage terms aligned with mandates from European Research Council, Horizon Europe, German Research Foundation, National Institutes of Health, and publisher requirements from PLOS and Wiley. PANGAEA supports metadata transparency required by initiatives such as FAIR principles, Open Data Charter, GO FAIR, and reporting frameworks used by Intergovernmental Panel on Climate Change assessments.
PANGAEA’s architecture combines database backends, archival storage, and web services with APIs that interoperate with platforms such as GitHub, Zenodo, Figshare, Linked Open Data endpoints, and catalogues like DataONE. It adheres to metadata standards including ISO 19115, Darwin Core, EML, and uses controlled vocabularies and ontologies promoted by GBIF, BODC, PANGAEA Controlled Vocabularies Consortium and community efforts led by W3C, OGC, and IETF. Technical features include RESTful services, OAI-PMH harvesting, checksum-based preservation, and integration with identity services like ORCID and Shibboleth.
Users range from individual researchers and principal investigators to large consortia, research infrastructures, and governmental programs including the European Commission, ESA, NOAA, NASA, DFG, BMBF, and international initiatives like IPCC and UNESCO. PANGAEA partners with observatory networks, marine institutes, and data aggregators such as Alfred Wegener Institute, GEOMAR, Helmholtz Centre for Ocean Research Kiel, British Antarctic Survey, SCRIPPS Institution of Oceanography, Woods Hole Oceanographic Institution, CNRS, CSIC, AWI, IFREMER, IMR, NIOZ, NERSC, EMSO, and ICOS.
PANGAEA has contributed to high-impact studies and reports by enabling reuse of datasets used in publications in Nature, Science Advances, PNAS, Geology, Quaternary Research, and assessments by IPCC. Notable datasets include long-term oceanographic time series from observatory programs, paleoclimate proxy compilations used in PAGES syntheses, GEOTRACES trace-metal profiles, and multidisciplinary cruise data supporting campaigns like RV Polarstern expeditions, RRS James Clark Ross voyages, and RV Sonne studies. The repository’s role in data preservation and citation has been recognized in community guidelines from CODATA, RDA, and major funding agencies, fostering reproducibility in earth and environmental sciences.
Category:Data repositories