LLMpediaThe first transparent, open encyclopedia generated by LLMs

OpenDocument

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Microsoft Word Hop 4
Expansion Funnel Raw 43 → Dedup 17 → NER 11 → Enqueued 9
1. Extracted43
2. After dedup17 (None)
3. After NER11 (None)
Rejected: 6 (not NE: 6)
4. Enqueued9 (None)
Similarity rejected: 1
OpenDocument
NameOpenDocument
Extension.odt, .ods, .odp, .odg, .odf, .odb
Mimeapplication/vnd.oasis.opendocument.*
DeveloperOASIS
Released01 May 2005
Latest release version1.3
Latest release date17 December 2020
GenreDocument file format
Container forXML, ZIP (file format)
Extended fromXML
StandardISO/IEC 26300, OASIS Standard

OpenDocument. It is an XML-based, open file format for office applications, standardized by OASIS and later by the ISO and the IEC. The format is designed to enable interoperability and long-term access to documents, independent of specific software or vendors, and is widely implemented in free and open-source software and proprietary applications. Its development was heavily influenced by the XML format used in OpenOffice.org and its descendants like Apache OpenOffice and LibreOffice.

Overview

The primary goal was to create a vendor-neutral alternative to proprietary formats like those from Microsoft Office, ensuring documents remain accessible over time. It encompasses a family of formats for different document types, including text, spreadsheets, presentations, and graphics, all using a common ZIP (file format) container to bundle XML content and resources. Key principles include separation of content, styles, metadata, and application settings, promoting reuse and processing by different tools. The format's architecture was significantly shaped by the existing implementation within the OpenOffice.org project, which provided the initial technical foundation.

Technical specifications

A file is essentially a ZIP (file format) archive containing several XML files and a manifest. The core `content.xml` file holds the actual text and data, while `styles.xml` defines formatting, and `meta.xml` contains information like the author and creation date, following the Dublin Core metadata standards. It supports advanced features like mathematical formulas using MathML, digital signatures via XML Signature, and complex formatting through styles. The specification defines RDF-based metadata embedding for extended semantics, and later versions, such as the one approved by the Joint Technical Committee 1 of ISO and IEC, added improvements for accessibility and encryption.

Adoption and support

It is the native format for major free and open-source software suites including LibreOffice, Apache OpenOffice, and Calligra Suite. Several governments and institutions, such as the European Union and the Government of Belgium, have mandated its use for document exchange to ensure long-term preservation. While Microsoft added support to Microsoft Office starting with Office 2007, implementation has sometimes been inconsistent. Other supporting applications include Google Docs, Apple's Pages (word processor), and the KDE-based Okular document viewer, broadening its ecosystem.

Standardization and development

The technical committee at OASIS, with members from Sun Microsystems, IBM, and Google, among others, developed the initial standard. It was subsequently submitted to the ISO and the IEC under their fast-track process, becoming ISO/IEC 26300 in 2006. Further development has continued within OASIS, with version 1.3 achieving OASIS Standard status and also being published as ISO/IEC 26300-1:2020. The Document Foundation, stewards of LibreOffice, plays a significant role in implementing new versions and promoting the format.

Comparison with other formats

Compared to the binary DOC (computing) format from Microsoft, it offers greater transparency and recoverability due to its XML structure. Its main competitor, the Office Open XML format standardized as ISO/IEC 29500, is also XML-based and ZIP (file format)-packaged but was developed primarily by Microsoft and has a different technical architecture. While PDF is superior for fixed-layout distribution, it is less suitable for editing. The Uniform Office Format, promoted in China, represents another XML-based alternative, though with less international adoption outside specific regions.