Generated by GPT-5-mini| IETF BCP 47 | |
|---|---|
| Name | BCP 47 |
| Othernames | Best Current Practice 47 |
| Issued | 1997 |
| Status | Internet Standard |
| Authors | IETF |
| Related | RFC 5646, RFC 4646, RFC 3066 |
IETF BCP 47
IETF BCP 47 is an Internet standards document that specifies the construction and usage of language tags for identifying human languages and related orthographic, regional, and script variants. It serves as a normative framework used across protocols and systems developed by organizations such as Internet Engineering Task Force, World Wide Web Consortium, Unicode Consortium, European Commission, and International Organization for Standardization. Implementations span software developed by companies like Apple Inc., Google LLC, Microsoft, Mozilla Foundation, and services operated by Amazon (company), Facebook, Twitter.
BCP 47 provides a common registry and formal syntax that interoperates with standards maintained by ISO 639, ISO 15924, ISO 3166-1, and IANA, enabling consistent language identification in protocols like HTTP, SMTP, LDAP, XMPP, and formats such as HTML, XML, JSON. The specification complements character encoding and collation work from Unicode Consortium and locale conventions reflected in projects like POSIX and Common Locale Data Repository. It is referenced in technical efforts by European Union institutions, internationalization projects at W3C Internationalization (i18n) Activity, and multilingual service deployments at United Nations agencies.
The syntax defines hierarchical subtags including primary language, extlang, script, region, variant, extensions, and private-use, aligning with registries from ISO 639-3, ISO 15924, and ISO 3166-1 alpha-2. Primary language subtags map to entries in registries used by bodies such as Library of Congress, Ethnologue, and SIL International, while script subtags reference listings maintained by Unicode Consortium and ISO. Region subtags correspond to codes created by International Organization for Standardization and geopolitical delineations used by organizations like United Nations Statistical Commission and European Commission for regional policy. Extension subtags allow interoperability with tags used in specifications by IETF Working Group outputs and application profiles produced by bodies like OASIS and IETF Applications Area.
Registration of subtags is managed via IANA registries, with contributions and change control processes involving stakeholders such as IETF Working Group, IANA, W3C, Unicode Consortium, and national bodies like British Standards Institution, American National Standards Institute, and Deutsche Institut für Normung. The registry records preferred values, deprecated entries, and grandfathered tags similar to archival practices at institutions like Library of Congress and National Archives and Records Administration. Requesters for new subtags may include software vendors like Red Hat, academic groups at Massachusetts Institute of Technology, and NGOs such as Mozilla Foundation and Creative Commons.
BCP 47 tags are embedded in web technologies including HTML5, CSS, SVG, and server protocols like HTTP/1.1 and HTTP/2, and are used by platforms such as YouTube, Wikipedia, WordPress, and Drupal for content negotiation and localization. They inform localization workflows in tools from Adobe Systems, Atlassian, and SAP SE, and are critical in machine translation pipelines developed by Google Translate, DeepL, and research groups at Stanford University, Massachusetts Institute of Technology, and Carnegie Mellon University. Accessibility frameworks from World Health Organization and standards bodies like W3C Web Accessibility Initiative incorporate language tags for text-to-speech, captioning, and internationalized metadata handling.
Implementations appear in operating systems such as Android (operating system), iOS, Windows NT, and macOS, and in libraries like ICU (software), libxml2, glibc, and Boost. Interoperability testing and conformance rely on test suites and profiles shared by consortia including W3C, IETF, and academic consortia at European Research Consortium for Informatics and Mathematics. Vendors coordinate via forums like IETF Meetings, W3C Technical Architecture Group, and standards workshops at institutions like MITRE Corporation and Internet Society.
BCP 47 evolved from earlier RFCs such as RFC 3066 and RFC 4646 and was consolidated in later documents like RFC 5646 and RFC 4647, with editorial and procedural input from the IETF Applications Area, IETF Language Tagging Working Group, and contributors from Microsoft Research, Google Research, Apple Inc., and Mozilla Foundation. The revision history reflects interactions with international standards like ISO 639 and registration practices coordinated through IANA and community discussions at IETF Hackathons, W3C Conferences, and academic symposiums at ACL (Association for Computational Linguistics) and LREC.
Category:Internet standards