Vitis Database — LLMpedia

Vitis Database
Name	Vitis Database
Type	Biological database
Scope	Grapevine genomics and viticulture
Established	2000s
Access	Public / subscription

Contents

Overview
Data Model and Contents
Technology and Architecture
Access and Usage
Curation and Quality Control
Applications and Impact
History and Development

Vitis Database The Vitis Database is a specialized repository for genomic, phenotypic, and breeding information about grapevines. It aggregates sequence data, cultivar records, trait maps, and experimental metadata to support researchers, breeders, and industry stakeholders in viticulture and enology. The resource interfaces with international initiatives and institutions involved in plant genomics, biodiversity, and agricultural research.

Overview

The database serves as a central hub linking resources from projects such as the International Vitis Genetics Consortium, European Cooperative Programme for Plant Genetic Resources (ECPGR), USDA Agricultural Research Service, Institut National de la Recherche Agronomique (INRA), University of California, Davis, and John Innes Centre. It integrates datasets produced by consortia including the 1000 Plants (1KP) Project, the Genome 10K Project, and national sequencing efforts from China Academy of Agricultural Sciences, National Institute of Agricultural Botany, and Commonwealth Scientific and Industrial Research Organisation (CSIRO). The platform supports interoperability with archives like GenBank, European Nucleotide Archive, Sequence Read Archive, and curated resources such as UniProt, EnsemblPlants, and Gramene.

Data Model and Contents

Content types include whole-genome assemblies, transcriptomes, single nucleotide polymorphism (SNP) panels, linkage maps, quantitative trait loci (QTL) records, passport data for cultivars and accessions, and phenotyping assays. Representative linked datasets reference cultivar names cataloged in repositories like Vitis International Variety Catalogue and germplasm collections held by U.S. National Plant Germplasm System and National Plant Germplasm System (NPGS). Trait ontologies align with standards from Plant Ontology Consortium, Crop Ontology, and metadata schemas from MIAPPE. The schema models relationships among genomes, markers, trials, and publications from journals such as Nature Genetics, The Plant Cell, PNAS, BMC Genomics, and Horticulture Research.

Technology and Architecture

The architecture typically combines relational databases (e.g., PostgreSQL) with NoSQL stores and object storage for raw reads. Web services are implemented using frameworks like Django or Flask and API layers conforming to RESTful architecture and GA4GH standards. Visualization and analysis tools integrate libraries from BioPython, R Project for Statistical Computing, Bioconductor, and genome browsers inspired by JBrowse and Ensembl. High-performance computing resources are provided by infrastructures such as XSEDE, European Grid Infrastructure, and cloud platforms from Amazon Web Services, Google Cloud Platform, and Microsoft Azure for scalable sequence alignment, variant calling, and genome assembly pipelines.

Access and Usage

Access policies vary: some modules are open access following principles adopted by FAIR Data Principles, while others require registration or subscription agreements with institutions like Ecole Polytechnique Fédérale de Lausanne or corporate partners in the wine industry. Users include researchers from Cornell University, University of Adelaide, Wageningen University, and breeders employed by organizations such as Foster's Group and Treasury Wine Estates. Common use cases involve marker-assisted selection workflows, genomic selection models described in publications from Frontiers in Plant Science, trial design coordinated with EuroCare Vitis networks, and provenance tracking linked to geographical indications like Champagne (wine) and Bordeaux wine PDOs.

Curation and Quality Control

Curation is overseen by panels of domain experts, including geneticists, ampelographers, and bioinformaticians from institutions such as INRAE, CSIRO, Scuola Superiore Sant'Anna, and Institut Pasteur. Quality control pipelines incorporate methods from GATK, SAMtools, and benchmarking procedures used by Genome in a Bottle and international standards bodies like ISO. Metadata curation follows ontologies and controlled vocabularies maintained by organizations such as Gene Ontology Consortium and MIAPPE to ensure reproducibility cited in articles from Methods in Ecology and Evolution.

Applications and Impact

The resource supports breeding programs that have produced disease-resistant and climate-resilient cultivars evaluated in trials reported by European Commission research programs and national agricultural extension services like USDA Cooperative Extension Service. It underpins studies of pathogen resistance against agents such as Plasmopara viticola and Erysiphe necator, and investigations into metabolite pathways linked to wine aroma discussed in Journal of Agricultural and Food Chemistry. Policymakers and heritage organizations, including UNESCO biosphere reserves and regional appellation authorities, use the database to inform conservation and traceability programs.

History and Development

The database evolved from early grapevine genetics efforts at institutions like Université Montpellier, University of Padua, and Institut Français de la Vigne et du Vin (IFV), incorporating data from the first draft grapevine genome projects led by groups at CNR and Genoscope. Funding and collaborative development have involved grants from bodies such as the European Research Council, National Science Foundation, and national ministries of agriculture. Over time it has assimilated community standards developed by consortia including DivSeek and aligned with data-sharing frameworks promoted by GO FAIR.

Category:Biological databases Category:Botany Category:Agriculture