Merritt (repository)

Merritt (repository)
Name	Merritt
Type	Digital preservation repository
Owner	California Digital Library
Country	United States
Established	2012

Contents

Overview
History and Development
Architecture and Features
Content Types and Collections
Access, Preservation, and Metadata
Use Cases and Integration
Governance, Funding, and Sustainability

Merritt (repository)

Merritt is a distributed digital repository service developed and operated by the California Digital Library to support long‑term storage, preservation, and access for scholarly and cultural content. It provides a managed environment for ingest, versioning, replication, and dissemination of digital objects created by universities, libraries, museums, and research projects across the University of California system and partner organizations. Merritt integrates with identity, workflow, and discovery services to connect content with institutional infrastructures such as DSpace, Fedora (repository), Orcid, and California Digital Library systems.

Overview

Merritt functions as a preservation and access platform that emphasizes replication, fixity, and metadata provenance to ensure digital objects remain usable over time. Institutions use Merritt alongside systems like Figshare, Zenodo, Internet Archive, JSTOR, and Portico to satisfy archival commitments, funder mandates from agencies such as the National Science Foundation and National Institutes of Health, and publisher deposit workflows for organizations including Elsevier and Springer Nature. Merritt supports common scholarly outputs produced by entities such as Los Alamos National Laboratory, Smithsonian Institution, and campus libraries within the Association of Research Libraries network.

History and Development

Merritt was developed by the California Digital Library in response to preservation needs articulated by the University of California libraries and partners, following precedents set by projects such as LOCKSS and PREMIS. Initial design work drew on preservation research at institutions like UC Berkeley, Stanford University, and collaborations with vendors and projects including Amazon Web Services and DuraSpace. Public deployments began in the 2010s as repositories and publishers sought sustainable alternatives to proprietary archival solutions; Merritt evolved through iterative releases that incorporated standards from ISO 16363, OAIS (Open Archival Information System), and the Digital Preservation Coalition guidance.

Architecture and Features

Merritt’s architecture emphasizes modular services for ingest, archival storage, replication, and access. Core components interoperate with identity and authentication systems such as Shibboleth and OpenID Connect, and with metadata registries including Schema.org and Dublin Core. Storage backends have been implemented using object stores comparable to Amazon S3 patterns and software components used by LOCKSS and Fedora (repository). Features include content versioning, cryptographic checksums for fixity modeled after NIST recommendations, replication across geographically separated datacenters, and automated workflows that echo practices from DataCite PID assignment and ORCID integration. Merritt exposes APIs for programmatic access that facilitate interoperability with discovery systems like Blacklight and harvesting protocols used by OAI-PMH endpoints.

Content Types and Collections

Merritt ingests a wide range of scholarly and cultural formats: textual monographs and articles similar to collections held by Project Gutenberg, research datasets akin to holdings at Dryad, geospatial data used in projects with US Geological Survey, audiovisual materials comparable to Library of Congress collections, and software artifacts like packages archived by Software Heritage. Collections managed via Merritt often represent institutional repositories at campuses such as UCLA, UC Berkeley, and UC San Diego, research centers funded by National Endowment for the Humanities, and data produced under grants from NASA and National Oceanic and Atmospheric Administration.

Access, Preservation, and Metadata

Merritt implements preservation workflows grounded in community standards such as PREMIS, METS, and BagIt packaging profiles. Metadata strategies integrate descriptive and technical metadata drawn from schemas used by Dublin Core, MODS, and DataCite to support discovery and rights management consistent with policies from organizations like Creative Commons and legal frameworks influenced by United States Copyright Office. Preservation preservation measures include periodic fixity checks, format identification routines comparable to PRONOM and DROID processes, and migration pathways aligned with guidance from the Library of Congress digital preservation outreach programs.

Use Cases and Integration

Institutions deploy Merritt for a variety of needs: institutional repository backends for campuses such as UC Davis, data management for laboratories funded by National Institutes of Health, supplement hosting for publishers like PLOS and PeerJ, and curated collections for museums partnered with Getty Research Institute. Integrations enable submission and publication workflows tied to editorial systems like Editorial Manager, research data management platforms such as KiltHub, and preservation audit workflows used by consortia including the Digital Preservation Network. APIs allow batch ingest from content management systems used by academic units, and link resolvers employed by library services such as WorldCat.

Governance, Funding, and Sustainability

Governance of Merritt resides with the California Digital Library and involves advisory participation from campus libraries across the University of California system, partner institutions, and stakeholder consortia. Funding models combine institutional allocations, grant support from agencies like the Andrew W. Mellon Foundation and Institute of Museum and Library Services, and service contracts with university clients. Sustainability planning draws on models used by organizations such as Portico and LOCKSS to balance cost recovery, community governance, and technical evolution while meeting audit criteria from standards bodies including ISO and peer review frameworks administered by the Digital Preservation Coalition.

Category:Digital preservation repositories