LLMpediaThe first transparent, open encyclopedia generated by LLMs

MetaCyc

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Identifiers.org Hop 6
Expansion Funnel Raw 70 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted70
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
MetaCyc
NameMetaCyc
TitleMetaCyc
ProducerBioCyc Project
CountryUnited States
LanguagesEnglish
CostFree for academic use
DisciplinesBiochemistry, Systems Biology, Genomics
Temporal1990s–present

MetaCyc

MetaCyc is a curated database of metabolic pathways and enzymes derived from experimentally verified biochemical literature. It serves as a reference resource for pathway analysis, metabolic reconstruction, and comparative genomics used by researchers at institutions such as National Institutes of Health, University of California, San Diego, and companies like Genentech and Illumina. The resource is integrated with computational platforms and educational projects including EcoCyc, KEGG, and Reactome.

Overview

MetaCyc summarizes experimentally elucidated metabolic pathways and enzymatic reactions drawn from primary literature authored by scientists affiliated with institutions such as Harvard University, Stanford University, Massachusetts Institute of Technology, and European Molecular Biology Laboratory. It catalogs small-molecule transformations, cofactors, transport events, and regulatory information relevant to organisms studied at centers like Max Planck Society and Broad Institute. MetaCyc interoperates with standards and ontologies developed by communities including Gene Ontology Consortium, Systems Biology Markup Language, and Open Biological and Biomedical Ontology groups.

History and Development

Development began in the 1990s under initiatives led by researchers associated with SRI International and later coordinated by groups at Lawrence Berkeley National Laboratory and University of California, San Diego. Major milestones include curated releases that incorporated pathways from classic biochemical studies by investigators at Rockefeller University and pathway compilations influenced by databases such as Swiss-Prot and BRENDA. Collaborative funding and partnerships involved organizations like National Science Foundation and Howard Hughes Medical Institute supporting expansion and integration with projects including BioCyc and Encyclopedia of Life.

Content and Scope

MetaCyc contains thousands of pathways and reactions curated from peer-reviewed studies in journals such as Journal of Biological Chemistry, Nature, Science, Proceedings of the National Academy of Sciences, and EMBO Journal. Entries link enzymes to gene products studied in model organisms like Escherichia coli, Saccharomyces cerevisiae, Arabidopsis thaliana, and Homo sapiens research. The database covers metabolites characterized in collections such as PubChem, ChEBI, and Human Metabolome Database, and integrates taxonomic context referencing authorities like NCBI Taxonomy and Catalogue of Life.

Data Model and Curation

The MetaCyc data model represents pathways, reactions, enzymatic activities, and small molecules with structured attributes inspired by standards from Systems Biology Ontology and SBML formats used by consortia including BioModels Database. Curation workflows rely on expert annotators trained at institutions associated with European Bioinformatics Institute and Cold Spring Harbor Laboratory who extract evidence from experimental reports by researchers at University of Cambridge, Yale University, University of Oxford, and Caltech. Provenance tracking relates entries to publications indexed in PubMed and curated terminologies such as UniProt identifiers.

Access and Tools

MetaCyc is accessible through web interfaces and programmatic APIs used by computational platforms like Cyc, Pathway Tools, and analysis suites developed at Lawrence Livermore National Laboratory. Users integrate MetaCyc content with visualization tools such as those from Cytoscape and pathway enrichment workflows implemented in software by groups at EMBL-EBI and Cold Spring Harbor Laboratory. Educational and training resources reference curricula from MIT OpenCourseWare and workshops hosted at Gordon Research Conferences.

Applications and Use Cases

Researchers apply MetaCyc for metabolic network reconstruction in projects at DOE Joint Genome Institute, strain engineering efforts at Amyris, and synthetic biology designs performed at iGEM Foundation teams. Clinical researchers use pathway context in studies at Mayo Clinic and Johns Hopkins University for biomarker discovery and metabolic disease interpretation. Ecologists and microbiologists reference pathways when characterizing environmental metagenomes collected by expeditions organized by Woods Hole Oceanographic Institution and analyses supported by National Oceanic and Atmospheric Administration datasets.

Licensing and Availability

MetaCyc content is distributed under terms permitting academic access with licensing arrangements coordinated by the BioCyc Project and organizations such as SRI International and The Salk Institute. Commercial use typically requires agreements with the database maintainers and associated institutions like Lawrence Berkeley National Laboratory; mirror and integration policies align with practices at European Molecular Biology Laboratory and National Center for Biotechnology Information. Data export formats support integration with community standards endorsed by Global Alliance for Genomics and Health.

Category:Biological databases Category:Metabolism Category:Bioinformatics