LLMpediaThe first transparent, open encyclopedia generated by LLMs

A Common Word

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Expansion Funnel Raw 95 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted95
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
A Common Word
NameA Common Word
Partofspeechword phrase
LanguageEnglish
Etymologysee text
Relatedsee text

A Common Word is an ordinary lexical item that appears frequently across texts and speech, notable for its ubiquity in corpora such as the Oxford English Dictionary, Corpus of Contemporary American English, and British National Corpus. It functions across registers from informal dialogues in BBC Radio broadcasts and The New York Times articles to academic prose in journals like Nature (journal) and The Lancet, and appears in canonical literature including works by William Shakespeare, Jane Austen, Charles Dickens, and James Joyce.

Etymology

The etymology of many high-frequency lexemes traces through language contact events involving Old English, Middle English, Norman conquest of England, Latin, Old Norse, and Franciscan Order borrowings. Historical records in manuscripts housed at institutions such as the British Library, Bodleian Library, and Vatican Library show shifts similar to those documented for terms in Beowulf, Domesday Book, and the writings of Geoffrey Chaucer. Comparative philology by scholars like Jacob Grimm, Friedrich Diez, and Noam Chomsky situates common lexemes within broader Indo-European developments connected to roots reconstructed in works by August Schleicher.

Definitions and Usage

Dictionaries including Merriam-Webster, Collins English Dictionary, and Cambridge Dictionary provide prescriptive and descriptive entries that distinguish senses used by authors from The Guardian and The Washington Post versus specialized senses in texts from Royal Society proceedings or reports by World Health Organization. Lexicographers such as Samuel Johnson and modern teams at Oxford University Press annotate polysemy observed in corpora compiled by projects like Google Books Ngram Viewer and databases maintained by Leipzig Corpora Collection.

Grammar and Morphology

Morphological patterns for common lexemes are analyzed in works by A. A. Milne and in transformational grammar frameworks developed by Noam Chomsky, Zellig Harris, and Ray Jackendoff. Inflectional paradigms appear in grammars of varieties documented by University of Cambridge and Harvard University linguists; syntax treatments in textbooks from MIT Press and Routledge illustrate argument structure and distribution across constructions found in HarperCollins editions and corpora indexed by Linguistic Data Consortium.

Semantic Variations and Synonyms

Synonymy and sense variation are charted in thesauri such as Roget's Thesaurus and in semantic networks like WordNet and projects funded by institutions including NSF and European Research Council. Cross-linguistic equivalents in French Academy, Real Academia Española, and Duden highlight shifts analogous to patterns in translations by HarperCollins Publishers of works by Leo Tolstoy, Fyodor Dostoevsky, and Gabriel García Márquez.

Cultural and Historical Significance

A high-frequency lexeme figures in the rhetoric of political figures from Abraham Lincoln and Franklin D. Roosevelt to Margaret Thatcher and Nelson Mandela, appears in manifestos such as The Communist Manifesto and speeches at events like the Nuremberg Trials and Paris Peace Conference (1919), and recurs in liturgical texts used by institutions like the Church of England, Vatican, and Al-Azhar University. Its presence in cultural artifacts spans cinema from Alfred Hitchcock to Akira Kurosawa, music by The Beatles and Bob Dylan, and visual art displayed at Tate Modern and Louvre.

Frequency and Corpus Studies

Quantitative studies in corpus linguistics by researchers at University of Oxford, Stanford University, Princeton University, and Max Planck Institute report Zipfian distributions for lexemes across corpora such as Google Books and the Penn Treebank. Statistical treatments using tools from R (programming language), Python (programming language), and packages developed at Carnegie Mellon University analyze frequency trajectories, collocation strengths, and changes tracked in databases curated by Library of Congress and National Archives.

Idioms, Phrases, and Collocations

Common lexemes anchor idioms recorded in collections like those by Oxford University Press and phrasebooks used by travelers to United States Department of State destinations and in subtitles for films distributed by Warner Bros. and Paramount Pictures. Collocational patterns feature in concordances produced by AntConc and in studies of multiword expressions published in journals such as Journal of Pragmatics and Language (journal), with examples drawn from plays performed at Royal National Theatre, novels published by Penguin Books, and speeches delivered at United Nations assemblies.

Category:Lexicography