Generated by GPT-5-mini| cBioPortal | |
|---|---|
| Name | cBioPortal |
| Developer | Memorial Sloan Kettering Cancer Center; Dana-Farber Cancer Institute; Broad Institute |
| Released | 2011 |
| Programming language | JavaScript; Java; Python; R |
| Operating system | Cross-platform |
| License | Open-source |
cBioPortal is an open-source web platform for interactive exploration of cancer genomics datasets developed to support translational research and clinical interpretation. It integrates somatic mutation, copy-number alteration, mRNA expression, DNA methylation, protein expression, clinical annotations, and pathway-level summaries to facilitate hypothesis generation and data-driven discovery. The project has been maintained by collaborations among major research institutions and is widely used by cancer centers, consortia, and pharmaceutical organizations.
cBioPortal originated from a collaboration among research groups at Memorial Sloan Kettering Cancer Center, Dana-Farber Cancer Institute, and the Broad Institute of MIT and Harvard to provide a user-friendly interface for complex oncology datasets. Its design philosophy emphasizes rapid visualization inspired by projects such as The Cancer Genome Atlas and International Cancer Genome Consortium while adopting principles from software efforts like Galaxy (computational biology project), IGV (Integrative Genomics Viewer), and UCSC Genome Browser. The portal supports cohort-level and case-level queries, enabling comparisons across studies like Pan-Cancer Atlas, MSK-IMPACT, and datasets produced by Genomics England and European Genome-phenome Archive contributors. It has influenced downstream platforms developed at institutions such as Fred Hutchinson Cancer Research Center, Stanford University, and Johns Hopkins University.
The portal aggregates public and controlled-access datasets from initiatives including The Cancer Genome Atlas (TCGA), International Cancer Genome Consortium (ICGC), Clinical Proteomic Tumor Analysis Consortium (CPTAC), and institutional sequencing programs like MSK-IMPACT. Data modalities incorporated comprise somatic mutation calls (often from pipelines developed at Broad Institute and Sanger Institute), copy-number segments (with methods from GISTIC authors), RNA-seq expression quantified with tools such as RSEM and Kallisto, and proteomics from mass spectrometry cohorts including CPTAC studies. Clinical annotations draw on curated data standards influenced by organizations like AACR (American Association for Cancer Research), NCI (National Cancer Institute), and registries such as SEER Program. Integration of pathway and network resources references repositories like Reactome, KEGG, and Gene Ontology contributors.
The platform uses a multi-tier architecture with a backend data service, a relational or NoSQL store, and a JavaScript-based frontend built on libraries used in projects like D3.js and Bootstrap (front-end framework). Server-side components are implemented in languages and frameworks associated with teams at Broad Institute and Memorial Sloan Kettering Cancer Center, employing RESTful APIs similar to designs in Ensembl and UCSC Genome Browser. Data loading and harmonization pipelines adopt file formats and tools from initiatives such as Variant Call Format, BED (file format), and workflow engines inspired by Cromwell (workflow management system) and Nextflow. Deployment patterns include containerization using technologies promulgated by Docker, Inc. and orchestration practices from Kubernetes adopters among academic computing centers.
Key interactive visualizations parallel capabilities seen in genomic tools like Integrative Genomics Viewer, offering mutation lollipop plots informed by annotations from resources such as COSMIC and ClinVar. The portal supports OncoPrint visual summaries, survival analysis with Kaplan–Meier plots analogous to outputs in studies from SEER Program-linked research, mutual exclusivity and co-occurrence statistics like tests used in publications from groups at Memorial Sloan Kettering Cancer Center and Broad Institute, and pathway-level enrichment informed by Reactome and KEGG. Programmatic access via API endpoints encourages integration with analysis environments such as R (programming language), Bioconductor, and Python (programming language) ecosystems. Export and interoperability align with standards propagated by GA4GH and data sharing practices endorsed by NIH policy.
Researchers at institutions including Memorial Sloan Kettering Cancer Center, Dana-Farber Cancer Institute, Broad Institute, Stanford University School of Medicine, University of California, San Francisco, and Harvard Medical School use the portal for exploratory analyses that inform publications, clinical trial design, and translational pipelines. Pharmaceutical and biotech groups such as Genentech, Pfizer, and Novartis have used portal-derived insights for target nomination and biomarker discovery. Case studies published in journals from publishers like Nature Publishing Group, Cell Press, and The New England Journal of Medicine illustrate applications in precision oncology, including identification of actionable alterations referenced against clinical knowledgebases such as OncoKB and CIViC. The portal supports educational use in courses at institutions like Johns Hopkins University and Massachusetts Institute of Technology.
The software is distributed under an open-source license and has an active developer community with contributions from academic centers, consortia, and industry partners including teams from Memorial Sloan Kettering Cancer Center, Broad Institute, Dana-Farber Cancer Institute, and contributors associated with ELIXIR nodes. Development workflows follow practices common to projects hosted on platforms like GitHub and GitLab, with issue tracking, continuous integration practices influenced by adopters such as Travis CI and Jenkins (software), and containerized deployment examples promoting reproducibility advocated by Open Science initiatives. Training, documentation, and community support are provided through workshops at conferences such as American Association for Cancer Research Annual Meeting, ASCO Annual Meeting, and Genome Informatics symposia.
Category:Bioinformatics software