LLMpediaThe first transparent, open encyclopedia generated by LLMs

Plant 10KP Project

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Expansion Funnel Raw 86 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted86
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Plant 10KP Project
NamePlant 10KP Project
Start2010s
StatusOngoing
LocationGlobal

Plant 10KP Project

The Plant 10KP Project is a large-scale international initiative to generate reference genomes for ten thousand plant species, aiming to transform comparative genomics across the green lineage. It brings together major botanical institutions, herbaria, sequencing centers, and conservation organizations to sample, sequence, assemble, and make available high-quality genome assemblies for phylogenetic, functional, and conservation applications. The effort links taxonomic expertise with high-throughput sequencing platforms and bioinformatics centers to address gaps in knowledge about plant diversity.

Background and Goals

The project emerged from discussions among researchers affiliated with the Royal Botanic Gardens, Kew, Smithsonian Institution, Missouri Botanical Garden, Chinese Academy of Sciences, and Max Planck Society, seeking to follow precedents set by the Human Genome Project, Earth BioGenome Project, and the 1000 Genomes Project. Its primary goals include producing chromosomally complete assemblies for representative taxa across major clades such as Angiosperms, Gymnosperms, Bryophytes, Pteridophytes, and Charophytes; resolving deep phylogenetic relationships exemplified in controversies like the Amborella trichopoda placement; enabling trait-mapping efforts relevant to crops such as Oryza sativa, Zea mays, Solanum lycopersicum, and Glycine max; and supporting conservation priorities exemplified by work on endangered genera curated at institutions like the New York Botanical Garden and Australian National Herbarium.

Scope and Methodology

The scope spans global floras from temperate regions catalogued in resources like the Flora of China and the Flora of North America to hyperdiverse tropical lineages in the Amazon Rainforest, Congo Basin, and Southeast Asian rainforests. Methodological standards draw on long-read technologies from vendors such as Pacific Biosciences and Oxford Nanopore Technologies, short-read platforms by Illumina, scaffolding using Hi-C from groups including the Broad Institute and optical mapping techniques pioneered by companies like Bionano Genomics. The project adopts assembly pipelines influenced by frameworks used at the Joint Genome Institute and protocols developed by consortia associated with the National Center for Biotechnology Information and the European Molecular Biology Laboratory.

Sampling and Sequencing Efforts

Sampling protocols coordinate collectors from the Royal Botanic Garden Edinburgh, Kew Herbarium, Herbarium Berolinense, and university herbaria at institutions such as Harvard University, University of Oxford, University of Melbourne, and Peking University. Sequencing campaigns prioritize type specimens and fresh collections vetted by taxonomists including curators from the Natural History Museum, London and researchers from the Smithsonian Tropical Research Institute. Sequencing efforts have leveraged high-throughput hubs like the Wellcome Sanger Institute and regional centers at the Chinese Academy of Sciences and the J. Craig Venter Institute, producing assemblies that integrate long reads, linked reads, and proximity ligation data to resolve complex genomes seen in genera such as Ficus, Eucalyptus, Pinus, and Quercus.

Data Management and Accessibility

Data management follows FAIR principles promoted by organizations like the Global Biodiversity Information Facility and repositories maintained by the International Nucleotide Sequence Database Collaboration partners: GenBank, European Nucleotide Archive, and DNA Data Bank of Japan. Metadata standards align with initiatives such as the Darwin Core and the Minimum Information about a Genome Sequence guidelines. The project coordinates with open-data platforms including the Ensembl Plants and the Phytozome resource to ensure assemblies, annotations, and raw reads are discoverable by researchers affiliated with universities like Stanford University, University of California, Berkeley, and Wageningen University.

Scientific Contributions and Discoveries

Results from the project have informed deep-time angiosperm diversification debates linked to studies published by teams at the Royal Society and have clarified genome evolution patterns related to whole-genome duplication events observed in lineages such as Brassicaceae, Fabaceae, and Poaceae. Comparative analyses have elucidated gene-family expansions involved in secondary metabolism relevant to pharmacological research at the National Institutes of Health and agrobiotechnology efforts at the International Rice Research Institute. High-quality assemblies have enabled discoveries about structural variation in conifers like Pinus taeda and repetitive element dynamics studied by groups at the Max Planck Institute for Plant Breeding Research and the Chinese Academy of Agricultural Sciences.

Collaborations and Funding

Collaborators include botanical gardens, university departments, sequencing centers, and conservation NGOs such as the World Wildlife Fund and Conservation International. Funding stems from national science agencies including the National Science Foundation, European Research Council, National Natural Science Foundation of China, philanthropic foundations exemplified by the Gordon and Betty Moore Foundation and the Bill & Melinda Gates Foundation, and institutional investments from partners like the Wellcome Trust.

The project navigates legal frameworks including the Convention on Biological Diversity, the Nagoya Protocol, and national access-and-benefit-sharing regimes administered by agencies such as the United States Department of Agriculture and the Ministry of Ecology and Environment of the People's Republic of China. It works with indigenous communities, botanical networks, and conservation bodies to prioritize endangered taxa protected under listings by the International Union for Conservation of Nature and national red lists. Policies balance open science goals with sovereign rights and benefit-sharing arrangements negotiated with stakeholders including botanical gardens, universities, and governmental institutions.

Category:Genomics projects Category:Botany Category:Conservation biology