LLMpediaThe first transparent, open encyclopedia generated by LLMs

ISO 639-2

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Inuit languages Hop 4
Expansion Funnel Raw 53 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted53
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
ISO 639-2
Number639-2
TitleCodes for the representation of names of languages — Part 2: Alpha-3 code
Version1998
OrganizationInternational Organization for Standardization
CommitteeISO/TC 37
RelatedISO 639-1, ISO 639-3, ISO 639-5

ISO 639-2. It is the second part of the International Organization for Standardization's comprehensive ISO 639 series, which provides standardized codes for identifying the world's languages. Published in 1998, this standard introduced three-letter alpha-3 codes, significantly expanding the coverage beyond the earlier two-letter system. It is maintained by the ISO/TC 37 technical committee and serves as a foundational resource for library cataloging, linguistics, and information technology systems worldwide.

Overview

The development of this standard was driven by the needs of major institutions like the Library of Congress and international bibliographic agencies to manage materials in a vast array of languages. It was created to address the limitations of its predecessor, ISO 639-1, which could not accommodate the thousands of languages used globally. The standard includes codes for individual languages, macrolanguages, and collective groups, such as those for Germanic languages or Bantu languages. Its publication represented a critical advancement for information science and digital library initiatives, enabling more precise data exchange across systems like the MARC standards.

Code structure

The standard defines three-letter, lowercase alphabetic codes, which provide a much larger namespace than the two-letter system. Codes are assigned based on the native name of a language, often derived from its name in English or French. A key feature is the inclusion of "bibliographic codes" (B codes) and "terminology codes" (T codes) for a select group of languages; for instance, the language of Gerhard Richter and Immanuel Kant has different codes for cataloging versus terminology purposes. This distinction primarily affects languages like those covered in the Duden or the Académie Française, ensuring compatibility between library systems and linguistic terminology databases.

Relationship to other ISO 639 parts

This standard exists within a larger family. It was the direct successor to ISO 639-1, which it encompasses and extends. Later, ISO 639-3 was developed to provide codes for all known individual languages, drawing heavily from its codes as a base. Another related standard, ISO 639-5, was created for language families and groups, organizing entities like the Slavic languages or Indo-Aryan languages. The entire suite is managed under the auspices of ISO/TC 37, with registration authority functions handled by the Library of Congress in collaboration with the International Information Centre for Terminology.

Usage and applications

Its codes are ubiquitously used in global systems for metadata tagging and resource discovery. Major applications include library catalog systems worldwide, such as those operated by the British Library and the Bibliothèque nationale de France. In the realm of information technology, the codes are implemented in internet standards like RFC 5646 for language tags and are supported by software from entities like Microsoft and the World Wide Web Consortium. They are also crucial for the work of organizations such as UNESCO and SIL International in language documentation and preservation projects, including those for endangered languages in regions like the Amazon basin and Siberia.

Changes and maintenance

The standard is a living document, with updates managed by the ISO 639-2/RA Registration Authority, which is based at the Library of Congress. Changes, such as the addition of new codes or the deprecation of old ones, are proposed through a formal process involving experts from bodies like the International Federation of Library Associations and Institutions. For instance, the evolution of codes for languages in the Balkans or for creole languages in the Caribbean reflects ongoing linguistic research. All amendments are published in official registers and disseminated to ensure consistency across systems used by the European Union and national archives like the National Archives and Records Administration.

Category:ISO standards Category:ISO 639 Category:Technical communication