Influenza Research Database

Influenza Research Database
Name	Influenza Research Database
Formation	2004

Contents

Overview
History and Development
Data Content and Curation
Tools and Analytical Features
Access, Licensing, and Data Submission
Collaborations and Funding
Impact and Applications

Influenza Research Database is a publicly accessible bioinformatics resource for influenza virus sequence, annotation, and analysis that supports research on influenza A, B, C, and D viruses. The database aggregates sequence records, antigenic data, metadata and curated literature to serve researchers working in virology, epidemiology, immunology and public health. It interoperates with major sequence repositories, surveillance programs, and research consortia to enable comparative genomics, vaccine design and outbreak investigation.

Overview

The resource integrates sequence repositories such as GenBank, EMBL-EBI, DDBJ, and links to surveillance initiatives like Global Influenza Surveillance and Response System, Centers for Disease Control and Prevention, World Health Organization and European Centre for Disease Prevention and Control. It cross-references curated resources including UniProt, Protein Data Bank, NCBI Taxonomy, and ontologies from Gene Ontology and Sequence Ontology. The platform supports data standards promoted by organizations such as International Committee on Taxonomy of Viruses, Global Initiative on Sharing All Influenza Data, Public Health England, and collaborates with institutions including Harvard University, Johns Hopkins University, University of Oxford, Imperial College London and Centers for Disease Control and Prevention branches. Users range from investigators at National Institutes of Health intramural programs to clinicians at Mayo Clinic, modelers at Los Alamos National Laboratory, and public health officials at Ministry of Health (United Kingdom) equivalents.

History and Development

The platform originated amid post-2003 investments in pathogen informatics that followed outbreaks like the 2009 swine flu pandemic and built on infrastructure exemplified by projects at National Center for Biotechnology Information and international nodes such as European Bioinformatics Institute. Early development involved collaborations with Department of Health and Human Services, Biomedical Advanced Research and Development Authority, and academic groups at University of Wisconsin–Madison and St. Jude Children's Research Hospital. Subsequent enhancements incorporated lessons from responses to events including H5N1 avian influenza outbreaks, 2009 H1N1 pandemic, and zoonotic spillover investigations linked to Nipah virus and Ebola virus epidemic in West Africa. Iterative releases added integration with databases maintained by Los Alamos National Laboratory, analytic toolkits influenced by work at Broad Institute, and visualization paradigms from Nextstrain and UCSC Genome Browser projects.

Data Content and Curation

Content includes curated nucleotide and protein sequences, antigenic cartography, experimental metadata, host and geographic metadata linked to standards from ISO 3166, and literature summaries drawing on journals such as Nature, Science, The Lancet, New England Journal of Medicine, and PLOS Pathogens. Sequence curation references taxonomic frameworks from International Committee on Taxonomy of Viruses and functional annotations from UniProtKB curators. The database ingests submissions from surveillance labs including Public Health Agency of Canada, CDC Influenza Division, WHO Collaborating Centres for Reference and Research on Influenza, and veterinary networks like World Organisation for Animal Health. It applies quality control routines aligned with pipelines at NCBI and aligns with metadata policies advocated by Global Alliance for Genomics and Health.

Tools and Analytical Features

The platform provides alignment, phylogenetics, and mutation scanning toolsets interoperable with MAFFT, MUSCLE, RAxML, BEAST, IQ-TREE, and visualization modules inspired by Nextstrain and Microreact. It offers antigenic cartography interfaces similar to those used by World Health Organization influenza vaccine strain selection meetings and integrates epitope mapping leveraging structures in the Protein Data Bank with homology modeling approaches used at Modeller and SWISS-MODEL. Analytical workflows support selection pressure analysis using methods from HyPhy and recombination detection tools related to RDP4. The resource enables batch queries and programmatic access consistent with APIs from NCBI Entrez and data formats used by GISAID and EMBL-EBI.

Access, Licensing, and Data Submission

Data access policies reflect contributions from public repositories such as GenBank and curated inputs from partners including Global Initiative on Sharing All Influenza Data participants and WHO Collaborating Centres. Submission pipelines accept metadata and sequences from laboratories at institutes like University of California, San Francisco, Centers for Disease Control and Prevention, and veterinary diagnostic centers affiliated with USDA networks. Licensing often follows public-domain or permissive terms used by National Institutes of Health funded resources, while respecting contributor agreements modeled after GISAID data access expectations. User authentication and data-use compliance procedures parallel practices at European Genome-phenome Archive for sensitive datasets.

Collaborations and Funding

Major collaborations span federal agencies including National Institute of Allergy and Infectious Diseases, academic partners at University of Maryland School of Medicine, Columbia University, Yale University, and international agencies such as World Health Organization and Food and Agriculture Organization. Funding sources include grants from National Institutes of Health, cooperative agreements with Centers for Disease Control and Prevention, and support from research programs at Department of Health and Human Services. Collaborative network partners include surveillance consortia like WHO Global Influenza Surveillance, modeling groups at Los Alamos National Laboratory, and vaccine research centers linked to BARDA and academic vaccine hubs at Oxford Vaccine Group.

Impact and Applications

The database has supported influenza vaccine strain selection deliberations at World Health Organization meetings, genomic epidemiology studies published in Nature Medicine and PNAS, outbreak investigations involving H5N1 and H7N9 detections, and antiviral resistance monitoring referenced by European Centre for Disease Prevention and Control. It underpins research by investigators at Johns Hopkins Bloomberg School of Public Health, London School of Hygiene and Tropical Medicine, and Imperial College London modelers, and informs One Health initiatives coordinated with World Organisation for Animal Health. Applications include comparative genomics in studies from Cold Spring Harbor Laboratory, diagnostics development at industry partners like Roche Diagnostics, and policy guidance used by national public health institutes such as Public Health England and Australian Department of Health.

Category:Biological databases