Borg (cluster manager)

Borg (cluster manager)
Name	Borg
Developer	Google
Released	2003
Programming language	C++
Platform	Linux
License	Proprietary
Website	Google

Contents

Overview
Architecture
Scheduling and Resource Management
Fault Tolerance and Reliability
Deployment and Use Cases
Performance and Evaluation
Legacy and Influence on Other Systems

Borg (cluster manager) is a distributed cluster management system developed by Google to coordinate large-scale compute resources across data centers. It schedules workloads, manages resources, and enforces isolation for services and batch jobs running on clusters composed of thousands of machines. Borg inspired subsequent orchestration systems and influenced research and products across industry and academia.

Overview

Borg originated inside Google as part of efforts to run services such as Gmail, Search (Google), Google Maps, YouTube, and AdSense at scale across geographically distributed data centers. Designed by engineers influenced by ideas from Mesos (software), MapReduce, and early cluster systems like Condor (software), Borg provides multi-tenant isolation, cluster pooling, and automated placement for containers on machines running Linux. Operators at Google used Borg to manage workloads tied to projects like Bigtable, Spanner, Dremel, and Borg-derived internal platforms; its development intersected with research from institutions such as Stanford University, MIT, and UC Berkeley.

Architecture

Borg's architecture separates responsibilities across components: a central set of Borgmaster processes, replicated for availability, and per-machine agents called Borglets. The design adopts ideas from distributed consensus systems like Paxos and Raft used in projects such as Chubby (service) and Zookeeper. Borgmasters maintain global state about machines, jobs, and tasks, while Borglets enforce container runtime constraints provided by systems like cgroups and namespaces originating in the Linux kernel. Networking integrates with solutions such as B4 (software), Jupiter (network), and software-defined networking concepts explored by Nicira and OpenFlow. Persistent state for scheduling and metadata interacts with storage systems including Colossus (file system), Spanner, and LevelDB-inspired designs.

Scheduling and Resource Management

Borg implements sophisticated scheduling policies combining bin-packing, priority, quota, and gang scheduling techniques. Workloads are expressed as jobs composed of tasks with resource requests (CPU, memory, disk, GPU) and constraints, similar to descriptions in Kubernetes and Docker Swarm. Admission control enforces organization-level quotas and fair-share policies used by teams responsible for Chrome, Android, Google Ads, and Google Cloud Platform services. The scheduler handles heterogeneous hardware, accelerators like TPU and GPU units, and affinity/anti-affinity rules inspired by research from Berkeley clusters and production systems used by companies such as Facebook, Amazon Web Services, and Microsoft Azure.

Fault Tolerance and Reliability

Borg emphasizes resilience through replication, automatic restart, and live migration strategies influenced by prior work in fault-tolerant computing (see SODA and SOSP communities). Health checking integrates probes similar to patterns in Kubernetes liveness and readiness checks, while load balancing ties into systems like Maglev and Envoy. Job priority inversion and preemption policies draw on theoretical work from scheduling theory and operational lessons from outages such as the Gmail outage and incidents at Amazon (company). Borg's treatment of failures influenced control-plane designs in Google Cloud Platform and fault containment patterns used by Netflix.

Deployment and Use Cases

Borg runs a mix of long-running services, batch analytics, machine learning training, and ephemeral test workloads for products like Google Photos, Google Play, and Google Assistant. It supports deployment pipelines tied to continuous integration systems similar to practices at Travis CI and Jenkins (software), and integrates with monitoring stacks inspired by Borgmon and later projects such as Prometheus (software). Machine learning workloads leverage accelerators in clusters designed for projects like AlphaGo and TPU research, while data processing jobs interface with frameworks like TensorFlow, MapReduce, Spark (software), and Dremel.

Performance and Evaluation

Evaluations of Borg in academic and industrial publications highlight metrics such as utilization, job throughput, scheduling latency, and repair time. Studies comparing Borg to systems like Apache Mesos, Kubernetes, YARN, and Hadoop report higher utilization and denser packing due to Borg's centralized scheduling heuristics and aggressive reclamation. Performance optimizations draw on work from Google File System (GFS), Bigtable, and interconnect innovations at Googleplex labs. Empirical analyses published in venues such as OSDI, SOSP, and NSDI quantify improvements in resource efficiency, energy consumption, and cost amortization across projects at Google.

Legacy and Influence on Other Systems

Borg's practices and architecture directly inspired Kubernetes, an open-source orchestration system developed by engineers formerly at Google and contributors from Red Hat, VMware, and CNCF. Concepts from Borg informed cluster management in Google Cloud, and shaped products at companies including Apple, Netflix, Twitter, LinkedIn, and Airbnb. Research communities at Stanford University, UC Berkeley, and MIT built on Borg's ideas in papers and courses, while vendors such as Amazon Web Services, Microsoft Azure, and IBM incorporated orchestration principles into offerings. Borg's influence extends to standards and projects in the Cloud Native Computing Foundation ecosystem and to orchestration tools like Helm (software), Istio, and Envoy.

Category:Google software