LLMpediaThe first transparent, open encyclopedia generated by LLMs

HGMD

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: UCSC Genome Browser Hop 4
Expansion Funnel Raw 83 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted83
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
HGMD
NameHGMD
TitleHuman Gene Mutation Database
DisciplineGenetics
CountryUnited Kingdom
Established1996
ProducerBioinformatics Institute / Institute of Medical Genetics
FormatsCurated database, downloadable datasets

HGMD

HGMD is a curated repository of reported germline mutations in human nuclear genes compiled to support medical genetics research, variant interpretation, and clinical diagnostics. It serves as a reference for clinicians at institutions such as Great Ormond Street Hospital, researchers at Cold Spring Harbor Laboratory, and bioinformaticians at European Bioinformatics Institute and Broad Institute. The resource is used alongside databases maintained by organizations like ClinVar, dbSNP, and Ensembl in workflows run in environments such as National Institutes of Health, Wellcome Trust Sanger Institute, and commercial diagnostics laboratories.

Introduction

HGMD collects peer‑reviewed reports of sequence variants discovered in patients and families, aggregating data from journals published by houses including Nature Publishing Group, Elsevier, and Wiley-Blackwell. Curators extract information from primary literature appearing in periodicals like The New England Journal of Medicine, Lancet, Nature Genetics, and American Journal of Human Genetics. The dataset is referenced in studies at universities such as Harvard University, University of Oxford, Stanford University, and Massachusetts Institute of Technology and is integrated into pipelines using tools developed at institutions like University of California, San Francisco, Johns Hopkins University, and University College London.

History and Development

HGMD originated in the mid‑1990s, founded by investigators associated with the Institute of Medical Genetics and further developed in collaboration with groups at Cardiff University and research networks funded by agencies like Medical Research Council and Wellcome Trust. Early growth paralleled sequencing initiatives conducted at centers such as Sanger Institute and project consortia including the Human Genome Project and 1000 Genomes Project. Over time contributions and citations came from research led by investigators at Max Planck Society, Dana‑Farber Cancer Institute, University of Cambridge, and clinical centers like Mayo Clinic and Cleveland Clinic. The platform evolved alongside standards from organizations such as American College of Medical Genetics and Genomics and collaborations with repositories like UniProt and RefSeq.

Database Content and Classification

Entries in HGMD cover variant classes reported in literature from groups at Cold Spring Harbor Laboratory and Kyoto University to clinics at Mount Sinai Hospital and Charité – Universitätsmedizin Berlin. Categories include disease‑causing mutations, probable pathological variants, and other classifications aligned with guidance from American Society of Human Genetics, European Society of Human Genetics, and variant interpretation frameworks influenced by work at Stanford University and Broad Institute. Content links to gene annotations curated by teams at European Molecular Biology Laboratory, transcript models from GenBank, and functional studies from labs at National Cancer Institute and Rockefeller University. HGMD entries reference phenotype descriptions documented in case reports from hospitals like Boston Children's Hospital and research cohorts from projects such as UK Biobank.

Access, Use and Licensing

Access to HGMD is provided through commercial licenses used by pharmaceutical firms like GlaxoSmithKline, diagnostics companies such as Illumina and Thermo Fisher Scientific, and academic subscriptions held by institutions including University of California, Berkeley, Yale University, and University of Toronto. Non‑commercial academic users and consortia tied to funders like European Commission or National Health Service may obtain terms that differ from corporate agreements negotiated with entities such as Pfizer and Roche. Integration into clinical reports at centers like Great Ormond Street Hospital and industry deployments at Genentech rely on compliance with intellectual property frameworks used by publishers Oxford University Press and databases administered by European Genome-phenome Archive.

Applications in Research and Clinical Genetics

HGMD is applied in variant filtering and pathogenicity assessment in diagnostic labs at Seattle Children's Hospital, research on genotype–phenotype correlations at University of Pennsylvania, and studies of population variation at Imperial College London. It informs computational predictors developed at Google DeepMind, machine learning research from Carnegie Mellon University, and pharmacogenomics investigations at University of California, San Diego. Clinicians referencing case literature from Johns Hopkins Hospital, St. Jude Children's Research Hospital, and Royal Free Hospital use HGMD alongside databases like OMIM and HGNC to interpret rare disease variants and to prioritize targets for functional assays conducted at institutions such as ETH Zurich and University of Tokyo.

Limitations and Criticisms

Critiques of HGMD reflect concerns raised in commentaries from editorial boards of Nature Reviews Genetics and Genome Research about coverage bias toward published variants from centers such as Mayo Clinic and Cleveland Clinic and underrepresentation of data shared in consortia like ExAC and gnomAD. Users at public repositories including ClinVar and contributors from initiatives like Global Alliance for Genomics and Health note that licensing restricts open redistribution compared to open models promoted by Wellcome Trust and European Molecular Biology Organization. Additional limitations cited by analysts at Broad Institute and European Bioinformatics Institute include potential inconsistencies with nomenclature standards maintained by HGNC and variant interpretation guidelines from American College of Medical Genetics and Genomics.

Category:Genetics databases