LLMpediaThe first transparent, open encyclopedia generated by LLMs

MetaboLights

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: EMBL-EBI Hop 4
Expansion Funnel Raw 66 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted66
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
MetaboLights
TitleMetaboLights
DisciplineMetabolomics
CountryUnited Kingdom
Established2012
PublisherEuropean Bioinformatics Institute
AccessOpen

MetaboLights is an open‑access, general‑purpose repository for metabolomics studies, serving as a central archive for experimental raw data, processed results, and associated metadata. Launched and maintained within a network of European life‑science organizations, it connects datasets to biochemical resources, analytical platforms, and biological specimens. The resource supports reproducible research by integrating with standards, ontologies, and community initiatives across biological and chemical databases.

Overview

MetaboLights was developed by teams at the European Bioinformatics Institute European Bioinformatics Institute, in collaboration with partners such as the European Molecular Biology Laboratory European Molecular Biology Laboratory, the Wellcome Trust Wellcome Trust, and the European Commission European Commission. The project aligns with international standards bodies including the Metabolomics Society Metabolomics Society, the Human Metabolome Project Human Metabolome Project, and the International Society for Computational Biology International Society for Computational Biology. It interoperates with resources like the Protein Data Bank Protein Data Bank, the Gene Expression Omnibus Gene Expression Omnibus, and the Sequence Read Archive Sequence Read Archive to enable cross‑omics integration. Institutional adopters and funders such as the National Institutes of Health National Institutes of Health, the Biotechnology and Biological Sciences Research Council Biotechnology and Biological Sciences Research Council, and the European Research Council European Research Council have encouraged deposition to enhance transparency and reuse. MetaboLights sits within broader infrastructure efforts including ELIXIR ELIXIR, the Global Bioinformatics Institute Global Bioinformatics Institute, and the Research Data Alliance Research Data Alliance.

Data Content and Structure

Datasets archived in MetaboLights typically include raw mass spectrometry files, nuclear magnetic resonance spectra, chromatograms, identification tables, and experimental metadata linking samples to biobanks and clinical cohorts such as UK Biobank UK Biobank and the 1000 Genomes Project 1000 Genomes Project where appropriate. MetaboLights leverages controlled vocabularies and ontologies like the Chemical Entities of Biological Interest Chemical Entities of Biological Interest (ChEBI), the Gene Ontology Gene Ontology, and the Experimental Factor Ontology Experimental Factor Ontology to annotate compounds, pathways, and experimental variables. The repository adopts data models influenced by standards bodies including the Metabolomics Standards Initiative Metabolomics Standards Initiative and the Minimum Information About a Metabolomics Experiment guidelines endorsed by the International Organization for Standardization International Organization for Standardization. Structural links map entries to compound catalogs such as PubChem PubChem, ChEMBL ChEMBL, and UniProt UniProt for protein associations, while pathway cross‑references point to KEGG KEGG, Reactome Reactome, and MetaCyc MetaCyc resources. Taxonomic and organismal metadata rely on references like NCBI Taxonomy NCBI Taxonomy and the Catalogue of Life Catalogue of Life.

Submission and Curation Process

Researchers submit experiments via a web interface that guides depositors through metadata capture, file upload, and consent declarations consistent with funder policies such as those of the Wellcome Trust Wellcome Trust and the European Research Council European Research Council. Submissions are reviewed by curators with expertise connected to institutions like the University of Cambridge University of Cambridge metabolomics groups, the Max Planck Society Max Planck Society, and the Francis Crick Institute Francis Crick Institute to ensure compliance with standards from the Metabolomics Society Metabolomics Society and community checklists used by journals such as Nature Nature, Science Science, and PLOS Biology PLOS Biology. Curation workflows include annotation with ontologies from the OBO Foundry OBO Foundry and chemical validation against resources including ChemSpider ChemSpider and the Human Metabolome Database Human Metabolome Database. For sensitive clinical or human data, governance relies on frameworks established by the European Medicines Agency European Medicines Agency and the General Data Protection Regulation General Data Protection Regulation (as interpreted by national data controllers such as the Information Commissioner's Office Information Commissioner's Office).

Access and Tools

MetaboLights provides programmatic access through APIs designed for integration with platforms such as Galaxy Galaxy (computational biology), Cytoscape Cytoscape, and Jupyter notebooks used in collaborations with institutes like the European Molecular Biology Laboratory European Molecular Biology Laboratory and EMBL‑EBI partners. Data discovery leverages indexing by search engines and scholarly platforms including PubMed Central PubMed Central, Europe PMC Europe PMC, and CrossRef CrossRef to link studies to publications from journals like Analytical Chemistry Analytical Chemistry and Metabolomics Metabolomics (journal). Visualization and analysis are facilitated by third‑party tools such as XCMS XCMS, MetFrag MetFrag, and MZmine MZmine and by integration with cheminformatics toolkits like RDKit RDKit and Open Babel Open Babel. Long‑term preservation and identifiers follow practices recommended by DataCite DataCite and the Digital Object Identifier system used by publishers like Wiley John Wiley & Sons and Springer Nature Springer Nature.

Community and Governance

Governance and community engagement draw on stakeholder groups including the Metabolomics Society Metabolomics Society, ELIXIR Nodes such as ELIXIR‑UK ELIXIR‑UK and ELIXIR‑FR ELIXIR‑FR, and advisory panels with representatives from universities like Imperial College London Imperial College London, King’s College London King's College London, and the University of Oxford University of Oxford. Funding and strategic guidance have involved entities such as the European Commission European Commission, the Biotechnology and Biological Sciences Research Council Biotechnology and Biological Sciences Research Council, and philanthropic organizations like the Wellcome Trust Wellcome Trust. Community standards development is coordinated with working groups from the International Society for Computational Biology International Society for Computational Biology and the Research Data Alliance Research Data Alliance, while training and outreach engage conferences like the Metabolomics Society Annual Conference Metabolomics Society Annual Conference and workshops hosted by institutions such as the EMBL European Molecular Biology Laboratory and Cold Spring Harbor Laboratory Cold Spring Harbor Laboratory.

Category:Metabolomics databases