LLMpediaThe first transparent, open encyclopedia generated by LLMs

FlyBase

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Expansion Funnel Raw 67 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted67
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
FlyBase
NameFlyBase
Founded1992
TypeBiological database
HeadquartersCambridge, Massachusetts; Cambridge, UK
LanguageEnglish
FieldsGenetics, Genomics, Developmental Biology

FlyBase is a curated online database and knowledgebase for genetic and genomic information on the model organism genus Drosophila. It integrates genetic, molecular, phenotypic, cytogenetic, and bibliographic data to support research in genetics, developmental biology, and evolutionary biology. The resource serves experimentalists and computational researchers by linking primary literature, community resources, and genome-scale datasets to standardized gene and allele records.

Overview

FlyBase provides structured records for genes, alleles, stocks, phenotypes, genomes, and references associated with species such as Drosophila melanogaster, Drosophila pseudoobscura, and other drosophilids used by laboratories worldwide. The resource interconnects entries with external repositories and community resources including the National Center for Biotechnology Information, Ensembl Genomes, and the UCSC Genome Browser. FlyBase entries cite primary literature hosted by sources like PubMed, curated from journals such as Genetics, Development, Nature, and Science. It supports standardized vocabularies and ontologies developed in collaboration with projects such as the Gene Ontology and the Sequence Ontology to enable cross-resource interoperability with databases like UniProt and Model Organism Databases.

History and Development

FlyBase originated in the early 1990s as a collaboration among researchers at institutions including the University of Cambridge (UK), the University of Cambridge (Massachusetts Institute of Technology), and the Cold Spring Harbor Laboratory community initiatives to organize genetic information for Drosophila. Early phases paralleled large-scale efforts such as the Human Genome Project and contributed to genome annotation work that interfaced with projects like the Drosophila Genome Project. Over time, FlyBase adapted to incorporate data from high-throughput technologies developed at facilities including the Broad Institute, the Wellcome Sanger Institute, and sequencing centers at the European Molecular Biology Laboratory. Governance and funding have involved agencies and organizations such as the National Institutes of Health, the Wellcome Trust, and university-based consortia.

Data Content and Curation

The database stores gene models, transcript annotations, protein products, mutant allele descriptions, phenotype annotations, and genetic interaction data. Curators extract and standardize information from literature published in venues like Cell, PNAS, Nature Genetics, and specialized Drosophila meetings such as the International Drosophila Research Conference. FlyBase employs controlled vocabularies and ontologies from collaborators such as the Gene Ontology Consortium, the Phenotype And Trait Ontology efforts, and the Sequence Ontology to annotate molecular functions, biological processes, and cellular components. Curation workflows prioritize reproducibility and provenance, linking dataset entries to accessioned resources at repositories like GenBank, ArrayExpress, and the Sequence Read Archive. Quality control integrates community feedback from groups including stock centers like the Bloomington Drosophila Stock Center and consortia such as the Drosophila RNAi Screening Center.

Tools and Features

FlyBase provides web-based search tools, genome browsers, and batch query services that interoperate with tools from Ensembl, the UCSC Genome Browser, and visualization platforms developed by the Open Source Biology community. Features include gene report pages, allele and phenotype summaries, and downloadable annotation files compatible with analysis suites such as Bioconductor and Galaxy (computational biology platform). Programmatic access via APIs and FTP supports integration with pipeline tools used in labs associated with the Max Planck Society, the European Bioinformatics Institute, and academic groups at universities like Harvard University and Stanford University. Additional utilities include orthology mappings to databases such as OrthoDB, pathway cross-references to KEGG and Reactome, and links to protein resources like PDB and InterPro.

Community and Collaboration

FlyBase operates through partnerships with international research communities, stock centers, and ontology consortia. It engages with conferences including the Society for Developmental Biology meetings and workshops at institutions like the Cold Spring Harbor Laboratory to solicit user needs and contributions. Community annotation campaigns have involved collaborations with projects such as the Encyclopedia of DNA Elements working groups and training programs hosted by the European Molecular Biology Organization. FlyBase staff coordinate with curators at model organism databases including WormBase, ZFIN, and Mouse Genome Informatics to harmonize annotation standards and cross-reference orthologous information.

Impact and Applications

FlyBase underpins research in neurobiology, developmental genetics, and evolutionary studies by enabling reproducible access to curated Drosophila information cited in high-impact studies published in outlets like Nature Neuroscience, Current Biology, and Trends in Genetics. Its datasets support genome editing projects using tools derived from discoveries at institutions such as the University of California, Berkeley and the Broad Institute, contribute to large-scale screens coordinated by centers like the Center for Integrative Genomics, and inform translational models for human disease curated in resources such as OMIM and ClinVar. FlyBase’s interoperable data model has influenced standards adopted by international bioinformatics infrastructures including the European Bioinformatics Institute and the Global Alliance for Genomics and Health.

Category:Biological databases