LLMpediaThe first transparent, open encyclopedia generated by LLMs

Cloud Storage

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Google Cloud Hop 4
Expansion Funnel Raw 76 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted76
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()

Cloud Storage is a model of computer data storage in which digital data is stored in logical pools across multiple servers, typically managed by a hosting company. These providers operate large data center facilities, ensuring data availability and accessibility over a computer network, primarily the Internet. Users and organizations buy or lease storage capacity to store user, application, or enterprise data.

Overview

The fundamental concept involves abstracting physical hardware, managed by providers like Amazon Web Services or Microsoft Azure, to present a virtualized storage resource. This model evolved from earlier utility computing and time-sharing ideas pioneered by companies like IBM. It relies heavily on technologies such as server virtualization and is a core component of modern cloud computing, alongside software as a service and platform as a service. The National Institute of Standards and Technology has published definitions that help standardize understanding of these services.

Types of cloud storage

Services are broadly categorized by their access model and primary use case. Object storage, such as Amazon S3, manages data as discrete units called objects, ideal for unstructured data like multimedia. File storage organizes data in a hierarchical file system, often accessed via protocols like Server Message Block or Network File System, and is offered by services like Google Cloud Filestore. Block storage, provided by platforms like Microsoft Azure Disk Storage, treats storage as raw, configurable blocks, similar to a direct-attached hard disk drive, and is commonly used for databases. Specialized forms include cloud storage gateway devices and services tailored for backup and disaster recovery.

Architecture and services

Infrastructure is built on massively scalable data centers utilizing commodity hardware and advanced storage area network technologies. Redundancy is achieved through techniques like data replication across multiple geographic availability zones. Core services often include content delivery network integration for performance, and application programming interfaces, such as RESTful APIs, for programmatic control. Management features include versioning, data lifecycle management policies, and integration with identity and access management systems like OAuth.

Security and privacy

Providers implement multiple security layers, including encryption for data both at rest and in transit using protocols like Transport Layer Security. Key management can be handled by the provider or the customer through services like AWS Key Management Service. Compliance with standards such as the General Data Protection Regulation, Health Insurance Portability and Accountability Act, and ISO/IEC 27001 is critical. Challenges include data sovereignty laws, potential government access requests, and securing API keys against breaches, as highlighted by incidents involving Dropbox or iCloud.

Advantages and disadvantages

Primary benefits include elasticity, allowing near-infinite scalability without upfront capital expenditure, converting costs to operational expenditure. It enhances business continuity through built-in redundancy and facilitates remote work and collaboration software like Slack or Microsoft Teams. Drawbacks encompass vendor lock-in, where migrating data between Google Cloud Platform and IBM Cloud can be complex, potential for data breaches, recurring costs over time, and dependence on Internet access bandwidth and network latency, which can impact performance.

Market and providers

The market is dominated by hyperscale providers, with Amazon Web Services holding a significant share through Amazon S3. Other major competitors include Microsoft Azure, Google Cloud Platform, and Alibaba Cloud. Specialized or legacy players include IBM Cloud, Oracle Cloud, and Dropbox. The market is influenced by price wars, continuous feature innovation, and strategic partnerships with enterprises like Netflix or NASA. Regional providers also exist to address specific data sovereignty requirements.

Category:Cloud computing Category:Computer storage