LLMpediaThe first transparent, open encyclopedia generated by LLMs

cytosine

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Chargaff's rules Hop 4
Expansion Funnel Raw 68 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted68
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()

cytosine Cytosine is a pyrimidine nucleobase found in nucleic acids, pairing with guanine in DNA and RNA. It contributes to genetic information storage in organisms from Escherichia coli to Homo sapiens and is central to processes studied by researchers at institutions like the National Institutes of Health and the European Molecular Biology Laboratory. Chemically related to compounds used in the work of Robert Burns Woodward, Dorothy Hodgkin, and structural studies at the Brookhaven National Laboratory, cytosine's properties influence fields from molecular biology to medicinal chemistry.

Structure and chemical properties

Cytosine is a heterocyclic aromatic organic compound in the pyrimidine family first characterized in studies linked to researchers at University of Cambridge and University of Oxford. Its molecular formula and ring structure were elucidated in the context of early 20th-century chemistry alongside advances by Linus Pauling and Erwin Chargaff. The base exhibits keto–enol tautomerism and has functional groups that determine hydrogen-bonding patterns essential for pairing with guanine in the Watson–Crick model developed by James Watson and Francis Crick. Cytosine's protonation and deprotonation states are influenced by pH, a property investigated in work at Max Planck Society laboratories and relevant to enzymatic recognition by polymerases such as those studied at Cold Spring Harbor Laboratory.

Biological role and occurrence

In cellular contexts studied in organisms from Saccharomyces cerevisiae to Arabidopsis thaliana, cytosine resides in DNA polymerase substrates and RNA polymerase transcripts, contributing to replication and transcription mechanisms explored by scientists at Rockefeller University and Stanford University. Cytosine residues in genomes are targets for epigenetic modification by enzymes like DNA methyltransferases, a process central to research programs at Harvard Medical School and implicated in development and disease in Mus musculus models. Viral genomes such as those of Human immunodeficiency virus and SARS-CoV-2 also contain cytosine, affecting host interactions investigated by teams at Johns Hopkins University and Centers for Disease Control and Prevention.

Biosynthesis and metabolism

De novo synthesis of pyrimidines that produce cytosine derivatives involves multi-enzyme complexes studied in bacteria like Escherichia coli and in mammalian systems characterized at Massachusetts Institute of Technology. Pathways include carbamoyl phosphate synthesis linked to enzymes described in research from University of California, Berkeley and Imperial College London. Salvage pathways reutilizing nucleosides operate in protozoan parasites such as Plasmodium falciparum and are drug targets investigated by groups at London School of Hygiene and Tropical Medicine and University of Toronto. Cytosine deamination to uracil is catalyzed by deaminases related to proteins studied at Max Delbrück Center and has implications for immune functions mediated by cells analyzed at Mayo Clinic.

Mutagenesis and DNA damage

Spontaneous and enzymatic deamination of cytosine produces mutagenic events that have been characterized in model systems including Drosophila melanogaster and Caenorhabditis elegans; mutational signatures linked to cytosine alterations are cataloged by consortia such as the International Cancer Genome Consortium and The Cancer Genome Atlas. Cytosine methylation and subsequent deamination at CpG sites contribute to mutation hotspots studied in clinical genomics at Memorial Sloan Kettering Cancer Center and Dana–Farber Cancer Institute. DNA repair pathways addressing cytosine-derived lesions, including base excision repair mechanisms involving glycosylases, have been dissected by researchers at University of Chicago and Karolinska Institutet and are relevant to mutational processes observed in BRCA1 and TP53 deficient backgrounds.

Detection and analytical methods

Analytical techniques for detecting cytosine and modified cytosines include high-performance liquid chromatography methods developed at chemical research centers such as Scripps Research and mass spectrometry workflows refined at Lawrence Berkeley National Laboratory. Bisulfite sequencing approaches for mapping 5-methylcytosine at single-base resolution were advanced in laboratories like UCSC and European Bioinformatics Institute and are widely used in epigenomics projects at Wellcome Trust–funded centers. Next-generation sequencing platforms from companies such as Illumina and long-read technologies by Oxford Nanopore Technologies enable detection of cytosine variants; bioinformatics analyses drawing on tools from EMBL-EBI and Broad Institute support interpretation.

Applications and synthetic derivatives

Synthetic cytosine analogs serve as antiviral and chemotherapeutic agents, with development histories linked to pharmaceutical companies like Roche and GlaxoSmithKline and to academic translational research at University College London. Modified cytosines such as 5-fluorocytosine and 5-azacytidine have clinical applications against fungal infections and hematologic malignancies studied in trials at Mayo Clinic and MD Anderson Cancer Center. In synthetic biology, engineered bases and cytosine derivatives are components of expanded genetic alphabets investigated in laboratories including Synthetic Genomics and Harvard Wyss Institute. Chemical synthesis and structural modification techniques trace back to methodologies used by groups at ETH Zurich and California Institute of Technology.

Category:Nucleobases