LLMpediaThe first transparent, open encyclopedia generated by LLMs

Google Persistent Disk

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Docker Hop 4
Expansion Funnel Raw 71 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted71
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Google Persistent Disk
NameGoogle Persistent Disk
DeveloperGoogle LLC
Released2013
Operating systemLinux, Windows
PlatformGoogle Cloud Platform
LicenseProprietary

Google Persistent Disk is a block storage service provided by Google LLC for virtual machines and containerized workloads on Google Cloud Platform. It offers durable, network-attached volumes that can be attached to Google Compute Engine instances, Google Kubernetes Engine clusters, and other Google Cloud Platform services. Persistent Disk integrates with Google networking, identity, and management products to support production, backup, and analytics workloads.

Overview

Persistent Disk presents network-attached block devices to compute instances and integrates with Google Compute Engine, Google Kubernetes Engine, Anthos, Cloud Run, and Cloud Functions ecosystems. It supports multiple disk types—standard persistent disks, solid-state persistent disks, and regional persistent disks—aligned with Google Cloud Storage, BigQuery, Cloud Spanner, and Cloud SQL for data workflows. Persistent Disk is used alongside Google services such as Cloud IAM, Cloud Monitoring, Cloud Logging, Cloud Pub/Sub, and Cloud Storage for operations, observability, and data movement.

Architecture and features

Persistent Disk uses a distributed storage architecture built on software-defined storage and replication primitives similar to those used by Google File System and other internal systems. It provides features like snapshotting, cloning, encryption-by-default, and live resizing that interoperate with Compute Engine Images, Instance Templates, Managed Instance Groups, and Cloud Deployment Manager. Snapshots integrate with Cloud Storage for cross-region copying and lifecycle management and work with Cloud Build and Container Registry for image pipelines. Persistent Disk supports both persistent and local SSDs; local SSDs are ephemeral and complement Persistent Disk in high IOPS scenarios, interoperating with Locality-Aware Scheduling techniques used in Kubernetes on Google Kubernetes Engine.

Performance and scaling

Performance scales by disk type, size, and attachment mode; larger PD-SSD or PD-HDD volumes provide higher IOPS and throughput, and zonal or regional disks influence latency and redundancy similar to replication patterns in Spanner and Bigtable. For horizontal scaling, Persistent Disk can be read-shared across multiple Compute Engine instances in read-only mode and used with multi-attach PDs for select workloads, analogous to shared-disk designs in VMware vSphere, Microsoft Azure Disk Storage, and Amazon EBS. Benchmarks often compare Persistent Disk performance against alternatives like Amazon EBS, Azure Managed Disks, and on-premises SANs used by enterprises such as Netflix, Spotify, and Airbnb for varying workload profiles. Autoscaling of attached instances via Managed Instance Groups leverages Persistent Disk snapshots for rapid instance provisioning.

Security and durability

Persistent Disk encrypts customer data at rest by default using Google-managed keys and integrates with Cloud Key Management Service for customer-supplied or customer-managed encryption keys and with Cloud HSM for hardware-backed key storage. Access control ties into Cloud IAM roles and service accounts used by Compute Engine and Google Kubernetes Engine. Durability is achieved through replication across storage servers and zones, reflecting design principles seen in Colossus and Google File System; regional persistent disks replicate synchronously across zones similar to cross-zone replication in Amazon RDS and Azure Site Recovery. Compliance programs and attestations for Persistent Disk align with certifications held by Google Cloud, which include standards like ISO/IEC 27001 and SOC 2.

Management and administration

Administrators manage Persistent Disk via the Google Cloud Console, gcloud CLI, and RESTful APIs exposed through Cloud Resource Manager and Cloud Billing. Lifecycle operations include creating, formatting, attaching, detaching, snapshotting, cloning, resizing, and deleting disks; these operations can be automated with Cloud Deployment Manager, Terraform, Ansible, and Puppet. Monitoring and alerting integrate with Cloud Monitoring (formerly Stackdriver) and log export to Cloud Logging for audit trails consumed by SIEMs used by organizations such as Splunk, IBM QRadar, and Elastic Stack.

Use cases and integration

Persistent Disk supports databases, analytics, CI/CD runners, and stateful services: common pairings include Cloud SQL instances backing transactional systems, PostgreSQL and MySQL deployments on Compute Engine, and NoSQL systems like MongoDB and Cassandra on VMs. For containerized stateful workloads, Persistent Disk integrates with Kubernetes through PersistentVolumes and PersistentVolumeClaims, enabling StatefulSets and operators used by projects such as Prometheus and EFK stacks. Backup and DR strategies use snapshots and replication in conjunction with Cloud Storage and orchestration tools like Velero. Big data workflows often combine Persistent Disk with Dataproc, Dataflow, and BigQuery for ETL and analytics.

Pricing and limits

Pricing varies by disk type (standard, SSD, regional SSD), provisioned size, and snapshot storage; it aligns competitively with offerings from Amazon Web Services and Microsoft Azure. Performance limits include per-disk IOPS and throughput caps and per-instance aggregate limits; quotas and regional availability are managed through Cloud Console quota pages and can be increased by contacting Google Cloud Support. Administrative limits, such as maximum snapshot counts and API request rates, are documented in Google Cloud product guides and affect planning for enterprises such as Twitter, Pinterest, and Shopify that run large-scale services.

Category:Google Cloud Platform