LLMpediaThe first transparent, open encyclopedia generated by LLMs

Amazon S3 Glacier

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Expansion Funnel Raw 59 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted59
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Amazon S3 Glacier
NameAmazon S3 Glacier
DeveloperAmazon Web Services
Released2012
Operating systemCross-platform
LicenseProprietary

Amazon S3 Glacier is a cloud storage service for long-term data archiving and backup designed by Amazon Web Services. It provides low-cost, durable storage optimized for infrequently accessed data and integrates with a broad ecosystem of applications and services for data lifecycle management. Used by enterprises, research institutions, and media companies, the service emphasizes cost efficiency, durability, and integration with other cloud-native offerings.

Overview

Amazon S3 Glacier is positioned as an archival tier within the Amazon Web Services portfolio alongside Amazon Simple Storage Service, Amazon Elastic Compute Cloud, Amazon RDS, AWS Backup. It targets workloads such as compliance archives, digital preservation for Library of Congress, media asset archives for Netflix, and scientific datasets for NASA projects. The offering competes with archival services from Google Cloud Storage, Microsoft Azure Blob Storage, and vendors like Backblaze and Iron Mountain for long-term retention and regulatory archiving.

Features and Storage Classes

The service exposes multiple storage classes and capabilities including retrieval options, data lifecycle policies, and vaults for organizational isolation. It integrates with Amazon S3 lifecycle rules to transition objects from frequent-access tiers to archival tiers such as S3 Glacier flexible retrieval and S3 Glacier Deep Archive. Key features connect with AWS Identity and Access Management for access control, AWS Key Management Service for encryption, Amazon CloudWatch for monitoring, and AWS CloudTrail for audit logging. Use cases often involve media preservation for BBC, genomic datasets for Broad Institute, or financial records for institutions like Goldman Sachs.

Pricing and Retrieval Tiers

Pricing is structured around storage capacity, retrieval rates, and per-request fees, with different tiers for expedited, standard, and bulk retrievals. Typical customers compare cost-performance trade-offs against offerings from Google Cloud, Microsoft Azure, and traditional tape libraries from IBM and HPE. Retrieval tiers influence latency and throughput targets relevant to organizations such as CERN and European Space Agency that manage large scientific archives. Cost management commonly leverages AWS Cost Explorer and tagging strategies used by enterprises like Airbnb and Spotify.

Security, Compliance, and Durability

Security features include server-side encryption integrated with AWS Key Management Service and client-side encryption patterns used by providers like Veeam and Commvault. Compliance certifications and attestations align with frameworks used by HIPAA-covered entities, SOC 2 examinations, and ISO/IEC 27001-certified environments supporting customers such as Pfizer and Johnson & Johnson. Durability claims are benchmarked against archival expectations from institutions like National Archives and Records Administration and align with redundancy models similar to practices at Dropbox and Box.

Integration and Tools

Ecosystem integrations enable ingestion, management, and analytics via tools and services including AWS Lambda for event-driven workflows, Amazon S3 Glacier Select for in-place querying, and third-party solutions from Veritas, Commvault, Rubrik, and Druva. Backup orchestration platforms from Veeam and orchestration partners like ServiceNow and Splunk often integrate Glacier storage classes into retention policies. Data transfer options span AWS Snowball, AWS DataSync, and network upload strategies employed by media customers such as Walt Disney Studios and broadcasters like CBS.

Performance and Limitations

Performance characteristics emphasize high durability over low latency; retrieval times vary by tier and are influenced by object size and service quotas. Limitations include retrieval latency for deep archive tiers compared with hot storage used by Twitter or LinkedIn, potential egress costs relevant to global businesses like Uber, and lifecycle transition delays when automating archival from active buckets. Architectural patterns from companies like Netflix and Spotify illustrate combining object storage with CDN layers such as Amazon CloudFront for performance-sensitive delivery.

History and Evolution

Launched in 2012 by Amazon Web Services, the archival offering evolved with subsequent additions such as Glacier retrieval tiers, integration with Amazon S3 lifecycle management, and the introduction of S3 Glacier Deep Archive to address ultra-low-cost retention needs. Over time, the service expanded alongside AWS features like AWS Lambda, Amazon S3 Select, and enhanced compliance attestations to meet demands from research bodies like NOAA and media archives at The New York Times. Ongoing enhancements mirror trends in cloud archiving adopted by enterprises including General Electric and Siemens.

Category:Cloud storage