Generated by DeepSeek V3.2| STRIDES Initiative | |
|---|---|
| Name | STRIDES Initiative |
| Established | 2018 |
| Focus | Biomedical data science, cloud computing, artificial intelligence |
| Key people | Joshua C. Denny, Andrea T. Norris |
| Parent organization | National Institutes of Health |
| Collaborators | Google Cloud, Amazon Web Services, Microsoft Azure, Broad Institute |
STRIDES Initiative. The Science and Technology Research Infrastructure for Discovery, Experimentation, and Sustainability (STRIDES) Initiative is a strategic partnership program launched by the National Institutes of Health to provide NIH-funded researchers with cost-effective access to commercial cloud computing environments and advanced data science tools. Established to accelerate biomedical discovery, the initiative negotiates contracts with leading industry providers to create a scalable, secure, and collaborative ecosystem for analyzing vast and complex biomedical datasets, such as those from the All of Us Research Program and the Cancer Genome Atlas. By lowering technological and financial barriers, it empowers scientists across the United States to leverage cutting-edge computational resources, including artificial intelligence and machine learning, to advance public health research.
The core mission of the initiative is to modernize the National Institutes of Health's data infrastructure by integrating commercial cloud solutions directly into the research workflow. It operates under the auspices of the NIH Office of Data Science Strategy and the NIH Center for Information Technology, providing a centralized framework for data access and analysis. This framework supports a wide array of NIH institutes and centers, including the National Cancer Institute, the National Human Genome Research Institute, and the National Institute of Allergy and Infectious Diseases. The environment facilitates work on massive datasets from flagship projects like the Human Microbiome Project and the Brain Research Through Advancing Innovative Neurotechnologies Initiative, enabling researchers to perform analyses that were previously computationally prohibitive.
The initiative was formally launched in 2018 following a period of strategic planning within the National Institutes of Health to address the growing data deluge in fields like genomics and precision medicine. Its creation was influenced by the success of earlier pilot projects and the pressing need to manage data from endeavors such as the Precision Medicine Initiative. The first major partnership was established with Google Cloud, followed swiftly by agreements with Amazon Web Services and later Microsoft Azure. Key leadership from figures like Joshua C. Denny and Andrea T. Norris helped shape its governance model and expand its reach. The development phase involved close collaboration with NIH-funded extramural researchers at institutions like the University of California, San Francisco and the Mayo Clinic to ensure the tools met practical scientific needs.
A primary objective is to enable large-scale integrative analysis of diverse biomedical data types, including whole genome sequencing, electronic health records, and medical imaging data. Researchers utilize the cloud platforms to apply sophisticated machine learning algorithms for tasks such as drug discovery, variant calling in genomic studies, and predictive modeling for disease outcomes. The infrastructure specifically supports the analysis of controlled-access datasets from repositories like the Database of Genotypes and Phenotypes and the Cancer Imaging Archive. By providing standardized, cloud-optimized workflows and tools, the initiative aims to enhance reproducibility, foster open science, and accelerate the translation of research findings into clinical applications, such as those pursued by the National Center for Advancing Translational Sciences.
The initiative is fundamentally built upon public-private partnerships with major cloud service providers, including Google Cloud, Amazon Web Services, and Microsoft Azure. These contracts provide NIH researchers with discounted rates and dedicated technical support. Beyond industry, it collaborates extensively with academic and research institutions, such as the Broad Institute of MIT and Harvard, Stanford University, and the Jackson Laboratory. It also works closely with other federal agencies, including the Department of Energy and the National Science Foundation, on cross-cutting computational challenges. These collaborations are often formalized through joint working groups and co-funded research projects that address specific scientific questions in areas like COVID-19 research and Alzheimer's disease.
The initiative has significantly democratized access to high-performance computing for the biomedical research community, supporting thousands of researchers across all 50 states. Notable achievements include enabling the rapid data analysis for the National COVID Cohort Collaborative during the COVID-19 pandemic and reducing the cost and time required for large-scale genomic studies. It has facilitated groundbreaking research published in journals like Nature and Science, contributing to discoveries in cancer genomics, rare diseases, and population health. By creating a more efficient and collaborative data ecosystem, the initiative has strengthened the United States' research infrastructure and positioned the National Institutes of Health at the forefront of data-driven biomedical science.
Category:National Institutes of Health Category:Biomedical research Category:Cloud computing Category:Research initiatives