Generated by GPT-5-mini| DPHEP | |
|---|---|
| Name | DPHEP |
| Formation | 2009 |
| Type | International collaboration |
| Region served | Worldwide |
| Headquarters | CERN |
| Field | High-energy physics data preservation |
DPHEP is an international collaboration focused on long-term preservation and reuse of data from high-energy physics experiments. Founded through initiatives at major laboratories and research councils, the project brings together experts from particle accelerators, detection collaborations, research infrastructures, funding agencies, and archival institutions to address the technical, organizational, and policy challenges of preserving experimental data. DPHEP convenes stakeholders from laboratories such as CERN, Fermilab, KEK, DESY, and institutes including INFN, SLAC, and IHEP to develop practices that enable reproducible analysis, software preservation, and sustained access to datasets.
DPHEP originated from concerns raised after retirements and shutdowns of experiments at facilities like the Large Hadron Collider, Tevatron, and LEP about loss of valuable datasets. Early meetings involved representatives from CERN, Fermilab, DESY, KEK, and SLAC National Accelerator Laboratory, with involvement from research councils such as the European Research Council and the U.S. Department of Energy. Workshops in Geneva and Fermilab led to formal working groups that produced community reports and recommendations, influenced by archival practice at institutions like the National Archives and Records Administration and digital preservation initiatives such as those by the Digital Preservation Coalition. DPHEP’s trajectory intersected with policy developments at organizations like the European Organization for Nuclear Research and global dialogues at forums including the International Council for Science.
DPHEP’s mission encompasses safeguarding scientific value and enabling future reuse of datasets produced by collaborations such as ATLAS, CMS, LHCb, and ALICE. Objectives include defining preservation levels, establishing metadata standards, maintaining software stacks linked to datasets, and advising funders such as the Science and Technology Facilities Council and the National Science Foundation on sustainable stewardship. The collaboration works to align practices with infrastructure providers like Zenodo, INSPIRE-HEP, and national data centers including CERN Open Data Portal partners, while engaging with governance bodies such as the European Strategy Group for High Energy Physics to integrate preservation into experimental lifecycle planning.
DPHEP is organized around an international steering board, technical working groups, and institutional members drawn from accelerator laboratories, university groups, and computing centers. Membership includes major experimental collaborations (e.g., BaBar, Belle II, H1), national laboratories (e.g., Brookhaven National Laboratory, Argonne National Laboratory), and research infrastructures like GRIDPP and OpenAIRE. The steering board liaises with funding agencies such as the Swiss National Science Foundation and multilateral bodies like the European Commission, while technical groups coordinate with archival organizations such as the International Council on Archives and standards bodies including the Open Grid Forum and W3C to ensure interoperability.
DPHEP articulated a tiered model of preservation levels ranging from basic documentation to full reproducibility, drawing on practices from collaborations like OPAL and ZEUS. The models emphasize capture of provenance and metadata consistent with schemas adopted by INSPIRE-HEP and catalogue standards promoted by ISO committees and the Research Data Alliance. Technical standards center on containerization and virtualization technologies from projects at CERN IT and software preservation approaches employed by ROOT and Geant4 user communities. DPHEP advocates persistent identifiers via systems like DOI and aligns with open metadata initiatives exemplified by ORCID and DataCite to ensure dataset citation and researcher attribution.
DPHEP coordinated community reports and action plans, ran topical workshops with partners such as IHEP Beijing and TRIUMF, and supported pilot preservation efforts for datasets from experiments including ALEPH, CDF, and D0. Activities include development of exemplar repositories, integration of preserved software environments using technologies pioneered at Swansea University and computing deployments like CERNVM and Docker, and promotion of best practices through training linked to schools such as the CERN Summer Student Programme and the T2K collaboration training events. DPHEP also engaged in policy advocacy with funders like the European Research Council and facilitated interoperability trials with services such as Zenodo and national research infrastructures like GESIS.
DPHEP’s influence is evident in adoption of preservation plans in experiment proposals to bodies like the European Strategy Group for High Energy Physics and in integration of data release practices at portals maintained by CERN Open Data Portal and national laboratories such as Fermilab Scientific Data. Legacy outcomes include standardized preservation level frameworks used by collaborations including ATLAS, CMS, BaBar, and Belle II; widespread use of persistent identifiers via DataCite; and increased coordination between particle physics and archival communities represented by organizations like the Digital Preservation Coalition and the International Council on Archives. DPHEP’s work continues to inform reproducibility efforts in experimental physics, influence training at institutions like Imperial College London and University of Oxford, and shape policies at agencies such as the National Science Foundation and the European Commission.
Category:Data preservation Category:High-energy physics organizations