Generated by GPT-5-mini| CyVerse | |
|---|---|
| Name | CyVerse |
| Type | Non-profit research organization |
| Founded | 2008 |
| Headquarters | Tucson, Arizona |
| Region served | International |
| Leader title | Executive Director |
| Parent organization | University of Arizona |
CyVerse CyVerse is a national-scale cyberinfrastructure organization that provides data management, computational, and collaboration services to life sciences and allied research communities. It offers cloud-based platforms, high-performance computing gateways, and data commons designed to support large-scale projects in genomics, ecology, agriculture, and bioinformatics. CyVerse integrates resources from universities, national laboratories, and research consortia to enable reproducible analysis, data sharing, and training across diverse scientific domains.
CyVerse provides modular cyberinfrastructure including data storage, workflow engines, and identity management for scientists working in fields such as genomics, ecology, plant science, microbiology, and environmental science. It interoperates with resources like XSEDE, Open Science Grid, Amazon Web Services, Google Cloud Platform, and national data repositories to support projects initiated at institutions such as the University of Arizona, University of California, Berkeley, Arizona State University, Cold Spring Harbor Laboratory, and Lawrence Berkeley National Laboratory. Users leverage tools inspired by platforms developed at institutions like Broad Institute, European Bioinformatics Institute, National Center for Biotechnology Information, and Joint Genome Institute. CyVerse supports collaborations with initiatives such as the Human Cell Atlas, Earth BioGenome Project, iPlant Collaborative, Genomics England, and the Global Biodiversity Information Facility.
Originally funded as a national-scale initiative, CyVerse evolved from community-driven projects and federal programs supported by agencies including the National Science Foundation, Department of Energy, National Institutes of Health, and private foundations. Early roots include partnerships with the iPlant Collaborative, the National Center for Supercomputing Applications, the TeraGrid project, and campus-based research computing centers at institutions like Cornell University, University of Texas, and University of Florida. Funding models included cooperative agreements with research universities, grants involving agencies such as NSF Directorate for Biological Sciences, DOE Office of Science, and philanthropic support from organizations allied with museums like Smithsonian Institution and botanical gardens such as Missouri Botanical Garden. Major awardees and collaborators have included researchers affiliated with Harvard University, Massachusetts Institute of Technology, Stanford University, Yale University, and Princeton University.
CyVerse architecture combines distributed storage, containerized applications, and workflow orchestration to support end-to-end research pipelines. Core components align with technologies developed by Docker, Kubernetes, Galaxy Project, Apache Hadoop, and PostgreSQL, while enabling integration with national compute resources such as Oak Ridge National Laboratory, Los Alamos National Laboratory, Argonne National Laboratory, and National Energy Research Scientific Computing Center. Services include data commons, Discovery Environment, Atmosphere virtual machines, Data Store, and Apps that facilitate analyses used in projects like 1000 Genomes Project, ENCODE, and The Cancer Genome Atlas. Authentication and authorization integrate with identity providers and standards used by ORCID, InCommon, eduGAIN, and ResearcherID to enable federated access across university clusters and government facilities.
Researchers use CyVerse for workflows in comparative genomics, transcriptomics, metabolomics, remote sensing, and ecological modeling, with pipelines referencing tools from BLAST, BWA, GATK, Trinity, and QIIME. Educational programs incorporate CyVerse into curricula at institutions such as University of Arizona, Arizona State University, University of Illinois, University of Washington, and University of California systems to teach reproducible research methods used in courses associated with Howard Hughes Medical Institute, National Science Teachers Association, and Carnegie Institution for Science. Citizen science and outreach efforts interface with projects like Zooniverse, iNaturalist, eBird, and the Smithsonian Transcription Center to broaden datasets for researchers affiliated with Botanical Research institutes, agricultural experiment stations, and conservation organizations like The Nature Conservancy.
Governance structures bring together academic partners, national laboratories, and community advisory boards including scientists from institutions such as Princeton University, Johns Hopkins University, University of Colorado, Colorado State University, and University of Minnesota. Community engagement leverages workshops and conferences held with professional societies like American Society of Plant Biologists, Society for Conservation Biology, American Society for Microbiology, Genetics Society of America, and the Ecological Society of America. Advisory roles often interact with funding agency review panels and collaborative networks including Research Data Alliance, Global Organization for Bioinformatics Learning, Education & Training, and the Open Bioinformatics Foundation.
CyVerse-enabled projects have supported large consortia and notable studies in plant genomics, microbial ecology, and environmental monitoring, complementing efforts such as the Plant Genome Project, Crop Trust initiatives, Human Microbiome Project, and Long Term Ecological Research Network. Noteworthy deployments include data management for the Earth Microbiome Project, analysis workflows for ENCODE contributors, and cloud-based training for participants in workshops associated with Cold Spring Harbor Laboratory, Gordon Research Conferences, and Federation of American Societies for Experimental Biology. The platform has facilitated publications with authors from institutions such as Harvard Medical School, Salk Institute, Max Planck Institute, and University of Cambridge, and fostered collaborations with industry partners like Illumina, Thermo Fisher Scientific, Roche, and Agilent Technologies to accelerate translational research.
Category:Research infrastructure