LLMpediaThe first transparent, open encyclopedia generated by LLMs

PANGEA (data repository)

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Global Drifter Program Hop 4
Expansion Funnel Raw 1 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted1
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
PANGEA (data repository)
NamePANGEA (data repository)
TypeData repository
Founded2004
OwnerAlfred Wegener Institute (listed operator)
CountryGermany
AccessOpen access (selective embargo)
DisciplinesEarth sciences, environmental science, marine geology, geophysics

PANGEA (data repository)

PANGEA is an open data repository focused on geoscientific and environmental datasets, used for archiving and sharing observational, experimental, and model data from field campaigns, cruises, and laboratory studies. It supports data deposit and dissemination for projects associated with institutions such as the Alfred Wegener Institute, the European Commission, the German Research Foundation, and international programs like the International Ocean Discovery Program and the Intergovernmental Panel on Climate Change. The repository interoperates with services and infrastructures used by the European Space Agency, the National Oceanic and Atmospheric Administration, and global initiatives including the Group on Earth Observations.

Overview

PANGEA provides persistent identifiers, metadata standards, and long-term preservation for datasets produced by researchers affiliated with organizations like the Helmholtz Association, the Max Planck Society, the British Geological Survey, and the Smithsonian Institution. Its platform implements community standards promoted by the Research Data Alliance, the DataCite consortium, and the World Data System, enabling citation practices consistent with guidelines from the Committee on Publication Ethics and the International Council for Science. Users deposit datasets that complement journal publications in outlets such as Nature, Science, the Journal of Geophysical Research, and Earth and Planetary Science Letters, while linking to projects funded by the European Research Council and Horizon programs.

History and development

Established in 2004 under auspices connected to the Alfred Wegener Institute and German polar research programs, the repository evolved through collaborations with European infrastructures including the European Plate Observing System and the European Marine Observation and Data Network. Early governance involved partnerships with institutions like the Federal Institute for Geosciences and Natural Resources and the German Climate Computing Center, and later integration efforts referenced standards used by NASA, the British Antarctic Survey, the Scripps Institution of Oceanography, and the Lamont–Doherty Earth Observatory. Over successive phases the platform adopted metadata schemes from the International Organization for Standardization and the Open Geospatial Consortium, enabling alignment with initiatives such as the Global Biodiversity Information Facility and the Ocean Biogeographic Information System.

Data types and scope

The repository accepts a wide range of geoscientific outputs, including oceanographic profiles collected on research vessels like the RRS Discovery and RV Polarstern, sediment core descriptions used by the Integrated Ocean Drilling Program, geochemical datasets from labs at ETH Zurich and the University of Cambridge, and geophysical surveys associated with organizations such as the Geological Survey of Norway and the US Geological Survey. It hosts datasets supporting studies by research groups linked to universities including the University of Oxford, Massachusetts Institute of Technology, Humboldt University of Berlin, and the University of Tokyo, covering fields represented by journals like Geology, Paleoceanography, and Marine Geology. Data types span bathymetry, seismic reflection records, multi-beam sonar, isotopic measurements from laboratories such as the Max Planck Institute for Chemistry, and time series from observatories operated by the International Arctic Research Center and the Pacific Marine Environmental Laboratory.

Submission and curation policies

Submission workflows mirror practices used by repositories such as PANGAEA, Dryad, and Zenodo, requiring metadata compliant with standards advocated by DataCite, ISO, and the Global Change Master Directory. Curatorial review involves checks comparable to those at the British Library and the National Aeronautics and Space Administration’s Earth Science Data and Information System, ensuring provenance tracking aligned with the Open Archives Initiative and persistent identifier registration through Crossref and DataCite. Deposit agreements reflect licensing options familiar to contributors to repositories backing publications in Proceedings of the National Academy of Sciences, Frontiers, and Elsevier journals, and provide embargo mechanisms used by grant programs from the European Research Council, the Wellcome Trust, and the German Research Foundation.

Access, tools, and interoperability

Access mechanisms support programmatic retrieval through APIs and services interoperable with the Open Geospatial Consortium, the Network Common Data Form standards used by the National Center for Atmospheric Research, and the Climate and Forecast conventions promoted by the World Meteorological Organization. The repository integrates with data discovery platforms such as Copernicus, the Global Earth Observation System of Systems, and the GEO portal, and can be linked from research data management systems implemented at institutions like ETH Zurich, University of California, Berkeley, and the Barcelona Supercomputing Center. Tools for visualization and download interface with software ecosystems including MATLAB, R packages maintained by the Comprehensive R Archive Network, Python libraries used by the Scientific Python community, and GIS applications like Esri ArcGIS and QGIS.

Governance and funding

Governance structures reflect oversight similar to arrangements at the Helmholtz Association and the Alfred Wegener Institute, with advisory input from scientific committees comprising members associated with the European Commission, the International Council for Science, and national funding agencies such as the German Federal Ministry of Education and Research. Funding streams have included institutional support, project grants from Horizon Europe and the European Research Council, and collaborations with agencies like the National Science Foundation and the German Research Foundation, paralleling financial models used by infrastructures such as EUDAT and the European Open Science Cloud. Collaborative agreements and memoranda of understanding have linked the repository to partner organizations including the International Oceanographic Commission, the Intergovernmental Oceanographic Commission, and major research universities worldwide.

Category:Earth sciences data repositories