LLMpediaThe first transparent, open encyclopedia generated by LLMs

Genome in a Bottle

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Pacific Biosciences Hop 4
Expansion Funnel Raw 51 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted51
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Genome in a Bottle
NameGenome in a Bottle
Formation2012
TypeConsortium / Reference Material Provider
HeadquartersNational Institute of Standards and Technology
Parent organizationNational Institute of Standards and Technology

Genome in a Bottle is a public–private consortium established to develop well-characterized human genomic reference materials, benchmarks, and standards to enable accurate genomic sequencing and variant calling. It brings together federal agencies, academic centers, clinical laboratories, biotechnology companies, and standards organizations to produce reproducible resources that anchor clinical genomics, population sequencing, and regulatory science. The consortium’s outputs support validation across sequencing platforms, bioinformatics pipelines, and regulatory frameworks used by agencies and manufacturers.

Overview

Genome in a Bottle coordinates production of characterized human DNA samples and benchmark datasets to serve as reference materials for next-generation sequencing validation and performance assessment. Partners include National Institute of Standards and Technology, Food and Drug Administration, National Institutes of Health, Broad Institute, European Molecular Biology Laboratory, Wellcome Sanger Institute, Illumina, Thermo Fisher Scientific, PacBio, Oxford Nanopore Technologies, Guardant Health, 23andMe, Veracyte, Color Genomics, Ambry Genetics, GeneDx, Invitae, Baylor College of Medicine. The consortium integrates expertise from genomic centers, clinical laboratories, and standard-setting bodies such as Clinical Laboratory Improvement Amendments, College of American Pathologists, and International Organization for Standardization advisors.

History and Development

Initiated in the early 2010s amid rapid adoption of Illumina short-read sequencing and emerging long-read platforms like Pacific Biosciences and Oxford Nanopore, the initiative arose from discussions between National Institutes of Health program officers, regulators at the Food and Drug Administration, and technologists at the Broad Institute and National Institute of Standards and Technology. Early milestones included release of high-confidence variant callsets based on samples from well-characterized donors originally sequenced by projects such as the 1000 Genomes Project, Human Genome Project, and pilot efforts linked to Personal Genome Project contributors. Collaborations expanded to include private-sector sequencing providers and clinical diagnostic firms during efforts surrounding the Precision Medicine Initiative and the rise of clinical American College of Medical Genetics and Genomics guidelines for interpretation. The consortium’s profile grew through methods papers, datasets deposited in archives associated with European Nucleotide Archive, Sequence Read Archive, and use in regulatory submissions reviewed by Food and Drug Administration divisions.

Reference Materials and Standards

Genome in a Bottle produces DNA reference materials derived from cell lines and characterized genomes such as those from the International HapMap Project and the 1000 Genomes Project cohorts. Reference products have been used to benchmark variant types including single-nucleotide variants benchmarked against datasets generated by platforms from Illumina, Pacific Biosciences, and Oxford Nanopore Technologies, as well as structural variants characterized through technologies associated with Bionano Genomics and 10x Genomics. Standards development involved engagement with Clinical Laboratory Improvement Amendments stakeholders, the College of American Pathologists, and technical committees from International Organization for Standardization and International Electrotechnical Commission. Reference callsets and performance metrics are widely cited in literature from groups at Broad Institute, Wellcome Sanger Institute, Genome Research Limited, and translational labs at Mayo Clinic and Massachusetts General Hospital.

Methods and Technologies

Analytical methods combine short-read paired-end sequencing from Illumina instruments with long-read single-molecule sequencing from Pacific Biosciences and nanopore technology from Oxford Nanopore Technologies, supplemented by optical mapping from Bionano Genomics, linked-read barcoding formerly commercialized by 10x Genomics, and synthetic long-read approaches developed in collaborations with industry partners. Bioinformatics workflows include alignment and variant calling using toolkits and pipelines from efforts associated with Broad Institute such as GATK, de novo assembly methods rooted in algorithms published by teams at European Bioinformatics Institute and Wellcome Sanger Institute, and structural variant callers benchmarked in community challenges like those organized by the Global Alliance for Genomics and Health. Quality assessment draws on standards authored by committees linked to Clinical Laboratory Improvement Amendments and benchmarking frameworks discussed at symposia hosted by National Institute of Standards and Technology.

Applications and Impact

Reference materials and benchmarks from the consortium underpin analytical validation in clinical genomics laboratories operated by organizations such as Mayo Clinic Laboratories, Quest Diagnostics, and LabCorp, and inform regulatory submissions evaluated by the Food and Drug Administration and policy discussions at Centers for Medicare & Medicaid Services. They have accelerated development of diagnostic assays for rare disease testing in centers like Children’s Hospital of Philadelphia and oncology assays at institutions such as Memorial Sloan Kettering Cancer Center and Dana-Farber Cancer Institute. Population-scale sequencing programs including All of Us Research Program and national initiatives in the United Kingdom Department of Health and Social Care and Health Canada have used these benchmarks to harmonize pipelines. The consortium’s outputs also enabled interoperability in commercial genomics services like 23andMe and clinical laboratories such as Invitae and Ambry Genetics, and have been cited in guidelines from American College of Medical Genetics and Genomics.

Governance and Consortium Structure

Governance is coordinated by staff at National Institute of Standards and Technology in collaboration with steering committees comprising representatives from federal agencies including National Institutes of Health and Food and Drug Administration, academic partners such as Broad Institute, Wellcome Sanger Institute, and European Molecular Biology Laboratory, and industry contributors including Illumina, Thermo Fisher Scientific, Pacific Biosciences, and Oxford Nanopore Technologies. Working groups focused on sequencing technologies, bioinformatics benchmarking, reference material production, and regulatory engagement meet regularly and align with standards committees at International Organization for Standardization and professional societies such as College of American Pathologists. Data release and stewardship follow principles promoted by consortia like the Global Alliance for Genomics and Health and archival practices linked to European Nucleotide Archive and the Sequence Read Archive.

Category:Genomics