LLMpediaThe first transparent, open encyclopedia generated by LLMs

RAR

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: SumatraPDF Hop 5
Expansion Funnel Raw 71 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted71
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
RAR
NameRAR
DeveloperEugene Roshal
Introduced1993
Filename extensions.rar, .rev
Mime typeapplication/x-rar-compressed
OpenPartial
Latest releaseWinRAR 6.x (proprietary)

RAR is a proprietary archive file format developed to provide high-compression, multi-volume archiving and robust recovery features. Created by Eugene Roshal in the early 1990s, it became closely associated with the WinRAR application while influencing compression research and utilities across platforms. The format supports advanced features such as solid compression, recovery records, and optional encryption, and has been implemented in various programs and libraries for Microsoft Windows, Linux, macOS, FreeBSD, and other environments.

History

Eugene Roshal designed RAR in 1993 during a period of rapid development in archive formats alongside contemporaries such as ZIP (file format), ARJ, LHA (file format), and tar. Early adoption in Microsoft Windows was accelerated by WinRAR, which competed with tools like PKZIP, 7-Zip, and WinZip. Throughout the 1990s and 2000s RAR grew in popularity for distributing software and large datasets on BBSes, peer-to-peer networks such as BitTorrent, and file-hosting services like RapidShare and Mega. The format evolved alongside standards and controversies involving patent and licensing disputes in the period that also saw legal cases such as Sony BMG copy protection scandal and debates over proprietary codecs represented by RealNetworks litigation. Notable milestones include introduction of RAR5 in 2013, which brought structural changes comparable to generational shifts seen in formats like PNG and JPEG 2000.

File Format and Compression Features

RAR archives use a proprietary container structure with file headers, compressed data blocks, optional dictionaries, and recovery records, conceptually similar to multipart container designs found in MPEG-4 and Matroska (container format). The format supports solid compression, which concatenates multiple files into a single compression context for improved ratios—an approach comparable to techniques in bzip2 and xz (file format). RAR implements LZ-based and PPM-style compression variants across revisions; RAR5 expanded dictionary sizes and improved metadata handling akin to enhancements in zstd and Brotli. Multi-volume archives enable spanning across removable media such as CD-ROM and DVD, and support for split archives mirrors features in archive managers like RARLab WinRAR and 7-Zip. Archive metadata stores timestamps, attributes, and pathname encodings interoperable with filesystems like NTFS, exFAT, ext4, and HFS+.

Encryption and Security

RAR offers optional header and file body encryption. Earlier RAR revisions used a proprietary cipher and CRC-based integrity checks, while RAR5 introduced more robust cryptographic primitives including AES-256 in CBC mode and improved key derivation functions that parallel practices in OpenSSL and GnuTLS. Recovery records and parity volumes provide resilience against corruption, conceptually related to error correction work in Reed–Solomon implementations used by CD-ROM and RAID. Security concerns have centered on closed-source implementations, prompting third-party projects and audits by communities familiar with GNU Privacy Guard and OpenPGP practices. Vulnerabilities historically exploited in decompression routines led to advisories similar to those published by CERT Coordination Center and US-CERT for other archive utilities.

Implementations and Software

WinRAR by RARLab is the canonical Windows implementation; related command-line tools and ports exist for Linux (unrar, rar), macOS (RAR for macOS), and numerous BSDs. Open-source implementations include unrar (limited due to licensing), and third-party libraries appear in projects such as p7zip, libarchive, and bindings for languages like Python (programming language), Java (programming language), and C#. Integration into file managers mirrors that of Total Commander and GNOME Files, while server-side automation is supported by CI/CD tools including Jenkins and GitLab runners that handle packaged artifacts. Commercial software ecosystems such as Microsoft Azure, Amazon S3, and Google Cloud Storage may host RAR archives but typically rely on client-side tooling for extraction.

Usage and Integration

RAR is widely used for distributing software, game assets, multimedia collections, and datasets—roles also served by ISO image distribution and container formats like Docker (software). Its multi-volume and recovery features made it a favorite for physical-media transfers (e.g., across DVD‑R batches) and for segmented uploads to services such as MediaFire and Dropbox. Workflow integration commonly appears in backup scripts using PowerShell or Bash (Unix shell), in archival policies for institutions like The Library of Congress or National Archives and Records Administration when proprietary formats are permitted, and in content delivery for independent developers on platforms like Steam (service). Interoperability considerations often require conversion to open formats (for example, extracting RAR and re-archiving to ZIP (file format) or tar.gz) for long-term preservation.

The RAR algorithm and WinRAR software are proprietary; Eugene Roshal and RARLab control licensing for the official compressors and some binary libraries. A limited, source-available unrar source was once distributed under a restrictive license, leading to incompatibilities with permissive free-software definitions advocated by organizations such as the Free Software Foundation and Open Source Initiative. Legal disputes in software distribution echo broader cases involving Microsoft and Apple Inc. over proprietary formats, while antitrust and interoperability debates—similar to those involving Microsoft antitrust case and European Commission inquiries—highlight tensions between proprietary compression formats and open archival standards. Users and institutions often balance functional features against long-term access, favoring open formats for preservation when required by mandates like those from UNESCO or national archival policies.

Category:Archive formats