LLMpediaThe first transparent, open encyclopedia generated by LLMs

UCSC Genome Browser

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Sanger Institute Hop 3
Expansion Funnel Raw 93 → Dedup 32 → NER 28 → Enqueued 19
1. Extracted93
2. After dedup32 (None)
3. After NER28 (None)
Rejected: 4 (not NE: 4)
4. Enqueued19 (None)
UCSC Genome Browser
NameUCSC Genome Browser
DeveloperUniversity of California, Santa Cruz
Initial release2000
Programming languageC, Perl, JavaScript
Operating systemCross-platform
LicenseVarious academic licenses

UCSC Genome Browser The UCSC Genome Browser is an integrated web-based platform developed at the University of California, Santa Cruz for exploration of genomic sequence and annotation. It provides sequence visualization, comparative genomics, and annotation integration used by researchers at institutions such as the National Institutes of Health, Broad Institute, Wellcome Sanger Institute, and European Bioinformatics Institute. The Browser has been cited in publications from journals including Nature, Science, Genome Research, PNAS, and Cell.

History

The project originated in 2000 at the University of California, Santa Cruz under the leadership of Jim Kent and David Haussler’s group, contemporaneous with the Human Genome Project and efforts at the International Human Genome Sequencing Consortium, Celera Genomics, and the National Human Genome Research Institute. Early milestones included hosting the draft human assembly alongside resources from Ensembl and annotations from the RefSeq and GENCODE consortia. Subsequent expansions incorporated comparative assemblies from model organism centers like WormBase, FlyBase, and Mouse Genome Informatics and collaborative datasets from projects such as the 1000 Genomes Project, ENCODE Project, and the Cancer Genome Atlas.

The Browser evolved through integration of data standards endorsed by the Global Alliance for Genomics and Health and interoperability with tools developed at institutions including the Broad Institute, European Molecular Biology Laboratory, and the Wellcome Sanger Institute. Releases frequently accompanied new assemblies from sequencing centers like Baylor College of Medicine Human Genome Sequencing Center and updates from consortia including GIAB and Human Cell Atlas collaborations.

Features and tools

The Browser offers features comparable to platforms from Ensembl, IGV (Integrative Genomics Viewer), and UCSC Table Browser-adjacent utilities. Key tools include the Table Browser for querying annotation tables, the LiftOver utility for coordinate conversion between assemblies, and the Genome Browser graphical viewer for track stacking and navigation. Additional utilities support BLAST integration with NCBI BLAST, sequence alignment with tools from MAFFT and BLAT, and gene model curation interoperable with GENCODE, RefSeq, and UniProt entries.

Specialized toolsets enable comparative genomics (pairwise and multiple alignments using algorithms from groups such as Paten Lab and implementations related to Lastz), conservation scoring pipelines leveraging methods cited in literature from PhastCons authors, and variant effect inspection in contexts used by ClinVar, dbSNP, and gnomAD. The platform supports session saving, custom track hubs used broadly by researchers at Stanford University, MIT, Harvard Medical School, and biotechnology companies like Illumina and 10x Genomics.

Data and annotation tracks

Annotation tracks aggregate resources from major databases and projects, including RefSeq, GENCODE, Ensembl, UniProt, dbSNP, ClinVar, 1000 Genomes Project, gnomAD, ENCODE Project, and the Roadmap Epigenomics Project. Comparative genomics tracks contain alignments with genomes from Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Danio rerio, and non-model organisms sequenced by centers like JGI and Sanger Institute. Epigenomic and regulatory tracks incorporate data from labs associated with Friedman Lab, Lander Lab, and datasets produced by consortia such as GTEx and BLUEPRINT.

Variant annotation and clinical tracks reference curated resources from ClinGen, OMIM, HGMD (commercial collaborations), and population variant catalogs from ExAC and 1000 Genomes Project. Expression and RNA-seq tracks link to data produced by groups at ENCODE Project Consortium, GTEx Consortium, and investigator-led projects at institutions such as Cold Spring Harbor Laboratory and Broad Institute.

User interface and visualization

The Genome Browser GUI provides an interactive genomic canvas used by scientists at institutions including Stanford University, Harvard University, MIT, UC Berkeley, and industrial partners such as Illumina and Thermo Fisher Scientific. Visualization options include dense, squish, pack, and full display modes familiar to users of IGV and JBrowse. Track hubs and composite tracks enable workflows shared by groups at EMBL-EBI, Sanger Institute, and clinical bioinformatics teams at Mayo Clinic and Massachusetts General Hospital.

Navigation supports keyboard shortcuts and URL-driven sessions used in workshops at Cold Spring Harbor Laboratory and courses at EMBL. The browser’s graphical outputs are widely used in figures for journals like Nature Genetics and Genome Biology and in presentations at conferences including ASHG and Gordon Research Conferences.

Bioinformatics integration and APIs

Programmatic access includes utilities and RESTful endpoints analogous to services offered by NCBI, EBI APIs, and integration with workflow systems such as Galaxy, Cromwell, and Nextflow. Command-line tools like LiftOver and bigBed/bigWig converters are used in pipelines from groups at Broad Institute and in cloud deployments on Amazon Web Services and Google Cloud Platform aligned with best practices from the Global Alliance for Genomics and Health.

The Browser supports data hubs and UCSC Genome Browser API calls used by software projects at Illumina, 10x Genomics, and open-source projects hosted by GitHub repositories maintained by university and consortium developers. Interoperability with Bioconductor packages and APIs enables analyses performed in environments at Stanford Medicine, Dana-Farber Cancer Institute, and academic core facilities.

Community, licensing, and data access policies

The project operates under academic licensing and data-use policies influenced by funders such as the National Institutes of Health, the Wellcome Trust, and the European Commission. Community engagement includes outreach with groups like ENCODE Project Consortium, GTEx Consortium, 1000 Genomes Project, and training collaborations with Cold Spring Harbor Laboratory and EMBL. Data access follows contributor policies similar to those from dbGaP, EGA, and other controlled-access repositories when handling human-subject datasets, while many public annotation tracks remain open for academic use in accordance with provider licenses from RefSeq, GENCODE, and UniProt.

The Browser’s user community includes researchers at Harvard Medical School, Broad Institute, Stanford University, clinical groups at Mayo Clinic and Massachusetts General Hospital, and industry users at companies such as Illumina and Thermo Fisher Scientific, all contributing to shared track hubs, tutorials, and collaborative publications in venues like Genome Research and Nature Methods.

Category:Genome browsers