Zoonomia Project

Zoonomia Project
Name	Zoonomia Project
Type	Comparative genomics consortium
Founded	2017
Founders	Evan Eichler, Seth A. Bordenstein
Headquarters	Seattle, United Kingdom
Fields	Genomics, Evolutionary biology
Members	Consortium of sequencing centers, universities, museums

Contents

Overview
History and development
Methodology and data sources
Major findings and publications
Collaborations and funding
Ethical considerations and data sharing

Zoonomia Project The Zoonomia Project is a large-scale comparative genomics consortium focused on mapping mammalian genome variation to infer evolutionary constraint, function, and disease relevance. The initiative aggregates whole-genome sequences across diverse mammalian lineages to enable comparative analyses that inform studies in Human Genome Project, Darwin's theory of evolution, and Phylogenetics. It integrates data and expertise from sequencing centers, natural history collections, and biomedical research institutions.

Overview

The project assembles genomes from a broad taxonomic spread including representatives from Primates, Rodentia, Chiroptera, Carnivora, Cetartiodactyla, Perissodactyla, Lagomorpha, Monotremata, Marsupialia, Afrotheria, Xenarthra, Eulipotyphla, Scandentia, Dermoptera, Sirenia, Proboscidea, Hyracoidea, Tubulidentata, Macroscelidea, Pinnipedia, Hyaenidae, Felidae, and Canidae. Key institutional participants include Broad Institute, Wellcome Sanger Institute, Howard Hughes Medical Institute, European Molecular Biology Laboratory, National Institutes of Health, Smithsonian Institution, American Museum of Natural History, Museum of Vertebrate Zoology, Natural History Museum, London, Royal Society partners and university groups from Harvard University, University of California, Berkeley, University of Washington, University of Oxford, University of Cambridge, Stanford University, Massachusetts Institute of Technology, Princeton University, and Yale University.

History and development

Conceived amid advances following the Human Genome Project and the rise of high-throughput sequencing, the initiative formalized after pilot studies at Eichler Lab and collaborations with groups associated with Bordenstein Lab. Early milestones paralleled datasets from the Genome 10K initiative, the Bat1K consortium, and the Zoonotic and Emerging Infectious Diseases surveillance efforts. Subsequent phases scaled during funding rounds involving Gordon and Betty Moore Foundation, Howard Hughes Medical Institute, National Science Foundation, and translational partnerships with National Human Genome Research Institute and international consortia such as HUGO and ENA stakeholders. Governance drew on models from International Barcode of Life, Global Biodiversity Information Facility, and the Earth BioGenome Project.

Methodology and data sources

The consortium employs long-read sequencing platforms developed by Pacific Biosciences and Oxford Nanopore Technologies, short-read data from Illumina, chromatin conformation capture from groups using techniques pioneered at Broad Institute and Max Planck Institute for Evolutionary Anthropology, and annotation pipelines inspired by Ensembl, UCSC Genome Browser, and GENCODE. Sample provenance leverages collections from Smithsonian Institution, American Museum of Natural History, Natural History Museum, London, Field Museum, Royal Ontario Museum, California Academy of Sciences, and veterinary contributions from Zoo Atlanta, San Diego Zoo, London Zoo, Bronx Zoo, and regional repositories. Analytical frameworks integrate tools and resources from BLAST, MAFFT, Clustal, PhyML, RAxML, BEAST, PAML, Cactus, ProgressiveCactus, LASTZ, RepeatMasker, BUSCO, SNP calling pipelines, and functional annotation tied to GTEx, ClinVar, dbSNP, OMIM, and ENCODE-derived elements.

Major findings and publications

Consortium outputs include high-resolution maps of conserved elements, catalogs of accelerated regions correlated with trait evolution, and prioritization of variants relevant to human health. Key publications appeared in journals connected with Nature, Science, Cell, Nature Genetics, Genome Research, PNAS, and Nature Communications. Notable findings linked signatures of accelerated evolution to phenotypes studied in model and nonmodel organisms represented by Mus musculus, Rattus norvegicus, Homo sapiens, Pan troglodytes, Gorilla gorilla, Pongo abelii, Macaca mulatta, Callithrix jacchus, Canis lupus familiaris, Felis catus, Equus caballus, Bos taurus, Sus scrofa, Monodelphis domestica, Ornithorhynchus anatinus, Loxodonta africana, Trichechus manatus, Tursiops truncatus, Eptesicus fuscus, Myotis lucifugus, and Pteropus vampyrus. Comparative constraint maps informed candidate loci for diseases catalogued in Alzheimer's disease, Parkinson's disease, Type 2 diabetes mellitus, Schizophrenia, Autism spectrum disorder, Congenital heart defects, and rare Mendelian conditions curated in OMIM. Cross-references tied evolutionary analyses to paleontological records like Laetoli footprints, Messel Pit, Green River Formation, and biogeographic events such as the Cretaceous–Paleogene extinction event.

Collaborations and funding

Collaborative networks extend across academic, museum, and private sectors, including partners such as Broad Institute, Wellcome Sanger Institute, European Bioinformatics Institute, Cold Spring Harbor Laboratory, Fred Hutchinson Cancer Research Center, Vanderbilt University, University of Chicago, Columbia University, New York University, Johns Hopkins University, Karolinska Institutet, Max Planck Society, Chinese Academy of Sciences, RIKEN, Monash University, University of Melbourne, University of Toronto, McGill University, University of British Columbia, and funding from Gordon and Betty Moore Foundation, Howard Hughes Medical Institute, Wellcome Trust, National Institutes of Health, European Commission Horizon 2020, UK Research and Innovation, and philanthropic sources tied to conservation NGOs like World Wildlife Fund and Conservation International.

The consortium addresses sample access and permitting with frameworks referencing Convention on Biological Diversity, Nagoya Protocol, and institutional review boards at participating universities such as Harvard University, University of Oxford, University of Cambridge, Stanford University, and Yale University. Data sharing aligns with deposition norms at GenBank, European Nucleotide Archive, Sequence Read Archive, and community annotation via Ensembl and UCSC Genome Browser, balanced against concerns about indigenous rights involving stakeholders like National Congress of American Indians and regional entities in Brazil, Madagascar, Indonesia, Australia, and Papua New Guinea. Biosecurity dialogues involved consultations with World Health Organization, Centers for Disease Control and Prevention, Food and Agriculture Organization, and legal frameworks from United States Fish and Wildlife Service and CITES-listed species committees. The consortium publishes data release policies modeled on precedents from Human Genome Project and HGP Ethics Working Group.

Category:Comparative genomics

Overview

History and development

Methodology and data sources

Major findings and publications

Collaborations and funding

Ethical considerations and data sharing