LLMpediaThe first transparent, open encyclopedia generated by LLMs

DNAnexus

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Expansion Funnel Raw 70 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted70
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
DNAnexus
NameDNAnexus
TypePrivate
Founded2009
HeadquartersMountain View, California
IndustryBiotechnology, Cloud computing
ProductsCloud bioinformatics platform

DNAnexus is a cloud-based bioinformatics platform that provides genomic data analysis, management, and collaboration tools for research institutions, pharmaceutical companies, and clinical laboratories. Founded in 2009, the company offers scalable infrastructure and software to process DNA sequencing data, support regulatory workflows, and integrate with biomedical research pipelines. DNAnexus serves customers in translational research, precision medicine, population genomics, and clinical trials.

History

DNAnexus was founded in 2009 amid rapid advances in next-generation sequencing and the rise of cloud computing, paralleling developments involving Illumina, Life Technologies, Amazon Web Services, Google Cloud Platform, and Microsoft Azure. Early deployments targeted academic centers such as Stanford University, Broad Institute, University of California, San Francisco, and Harvard Medical School to address the computational bottlenecks faced by projects like the 1000 Genomes Project and the Cancer Genome Atlas. The company grew through collaborations with industry partners including Pfizer, Novartis, Genentech, and technology providers such as Intel and NVIDIA. Over time DNAnexus expanded its remit from research-grade pipelines to regulated environments supporting initiatives like the All of Us Research Program and clinical sequencing pilot programs associated with Mayo Clinic and Cleveland Clinic.

Platform and Services

The DNAnexus platform provides a suite of services for processing and managing genomics data, integrating tools from ecosystem players like GATK, BWA, SAMtools, Picard, and DeepVariant. It offers workflow execution compatible with standards promulgated by organizations such as the Global Alliance for Genomics and Health and supports container technologies from Docker, Singularity, and orchestration patterns influenced by Kubernetes. Customers can run scalable variant-calling, alignment, annotation, and secondary-analysis pipelines used by projects tied to NIH, Wellcome Trust, and national biobanks including UK Biobank and All of Us. Professional services include custom pipeline development for partners like AstraZeneca, Bristol-Myers Squibb, and academic consortia such as ENCODE and GTEx.

Technology and Architecture

DNAnexus leverages cloud infrastructure from providers such as Amazon Web Services, Google Cloud Platform, and Microsoft Azure while integrating tools and libraries developed by research groups at Broad Institute, Wellcome Sanger Institute, and European Bioinformatics Institute. The architecture emphasizes reproducibility and provenance tracking, adopting metadata standards from FAIR principles initiatives and interoperability recommendations advocated by Global Alliance for Genomics and Health. Compute workloads use high-performance instances often accelerated by processors from Intel or GPUs from NVIDIA for machine-learning methods inspired by work from DeepMind and academic groups at MIT and Stanford University. Data management incorporates formats championed by GA4GH, such as CRAM and BAM, and annotation resources like dbSNP, ClinVar, Ensembl, and RefSeq.

Applications and Use Cases

DNAnexus is used in cancer genomics programs similar to the Cancer Genome Atlas and precision oncology initiatives at institutions like Memorial Sloan Kettering Cancer Center and Dana-Farber Cancer Institute, enabling somatic variant detection, tumor-normal comparisons, and neoantigen prediction. In rare-disease diagnostics it supports pipelines employed by referral centers modeled on GeneDx, Ambry Genetics, and research consortia like Deciphering Developmental Disorders. Population-scale projects—analogous to efforts at UK Biobank, Estonian Genome Project, and Iceland's deCODE genetics—utilize the platform for cohort analytics, GWAS workflows, and phenotype integration with electronic health record systems from vendors such as Epic Systems and Cerner. Pharmaceutical research groups apply DNAnexus for biomarker discovery, pharmacogenomics, and clinical-trial genomics used by companies like Roche, GlaxoSmithKline, and Merck & Co..

Partnerships and Collaborations

DNAnexus has partnered with cloud providers Amazon Web Services, Google Cloud Platform, and Microsoft Azure to deliver validated environments for genomics. Collaborations with sequencing technology vendors including Illumina, PacBio, and Oxford Nanopore Technologies support native ingestion of instrument outputs. The company engages with research organizations and consortia such as Broad Institute, Wellcome Trust Sanger Institute, Global Alliance for Genomics and Health, All of Us Research Program, and national health agencies like NIH to support large-scale genomics initiatives. Industry alliances include working with pharmaceutical companies like Pfizer and Novartis as well as diagnostic firms such as Guardant Health and academic medical centers including Mayo Clinic.

Data Security and Compliance

DNAnexus emphasizes compliance with regulatory frameworks including HIPAA, CLIA-related requirements commonly addressed in clinical genomics, and standards relevant to FDA submissions. The platform implements encryption, access controls, audit logging, and identity management practices aligned with guidance from National Institute of Standards and Technology and privacy considerations driven by legislation such as GDPR in the European Union. Security certifications and compliance attestations are designed to support translational research and clinical-grade deployments in settings like hospital systems exemplified by Cleveland Clinic and research networks funded by NIH.

Category:Bioinformatics Category:Cloud computing companies Category:Genomics