PVM (Parallel Virtual Machine)

PVM (Parallel Virtual Machine)
Name	PVM (Parallel Virtual Machine)
Developer	Oak Ridge National Laboratory; University of Tennessee; Emory University
Released	1989
Operating system	Unix; Linux; Microsoft Windows; macOS
Platform	x86; Alpha; SPARC; PowerPC
Genre	Message passing; Distributed computing
License	BSD-like; open source

Contents

Introduction
Architecture and Components
Programming Model and APIs
Performance and Scalability
Implementations and Platforms
Historical Development and Impact
Security and Fault Tolerance

PVM (Parallel Virtual Machine) is a software system that enables a heterogeneous collection of computers to be used as a coherent and flexible distributed parallel processing resource. Originally developed at Oak Ridge National Laboratory, University of Tennessee, and Emory University in the late 1980s, the system provided a message-passing environment that supported research in high-performance computing, numerical simulation, and distributed applications. PVM influenced later standards and projects in parallel computing and remains historically significant for cluster computing, scientific computing, and computational science.

Introduction

PVM was created to allow researchers at institutions such as Oak Ridge National Laboratory, Lawrence Berkeley National Laboratory, and Los Alamos National Laboratory to aggregate heterogeneous workstations into a unified computational resource, addressing needs in projects involving National Science Foundation initiatives, collaborations with NASA, and university consortia. The system exposed a virtual machine abstraction that let developers run tasks across machines from vendors such as Sun Microsystems, Intel, IBM, DEC, and HP while integrating operating systems like Unix, Linux, and Microsoft Windows NT in multi-institutional environments. PVM’s model was complementary to and contemporaneous with efforts such as Message Passing Interface and influenced distributed computing research associated with laboratories like Argonne National Laboratory.

Architecture and Components

The PVM architecture combined a daemon-based runtime, task managers, and a library API to form a virtual machine spanning nodes that could include workstations, servers, and supercomputers from companies like Cray Research and SGI. Core components included the pvmd daemon, the task starter, and a message-passing library implemented in C with bindings for Fortran and other languages; these components were deployed across networks using protocols supported by Berkeley Software Distribution-derived TCP/IP stacks on systems from Sun Microsystems and DEC. Administrative tools and utilities allowed integration with cluster management approaches seen in projects at Lawrence Livermore National Laboratory and practices from Beowulf cluster initiatives. The design emphasized modularity, enabling interaction with batch systems developed by centers such as National Center for Supercomputing Applications.

Programming Model and APIs

PVM provided a task- and message-based programming model in which a master process could spawn tasks, send and receive messages, and perform group operations using an API derived from conventions in high-performance computing at institutions like Oak Ridge National Laboratory and universities such as University of California, Berkeley and Massachusetts Institute of Technology. The API offered functions for task creation, intertask communication, data marshalling, and synchronization, comparable in purpose to APIs from MPI efforts coordinated by groups at Argonne National Laboratory and University of Tennessee. Language bindings for C and Fortran enabled scientific codes developed at centers such as Los Alamos National Laboratory, Sandia National Laboratories, and Princeton University to be ported to PVM with relative ease, and auxiliary tools supported profiling and debugging similar to utilities from GNU Project ecosystems.

Performance and Scalability

Performance studies of PVM were carried out by researchers at University of Tennessee, Oak Ridge National Laboratory, and European centers including CERN to assess communication overhead, latency, and bandwidth on heterogeneous networks comprising hardware from Intel and IBM. Scalability experiments compared PVM clusters against dedicated parallel systems like machines from Cray Research and architectures promoted by DARPA research programs, revealing trade-offs between flexibility and raw performance. Optimizations involving network protocols, node scheduling, and data encoding drew on research traditions from High Performance Computing centers and informed later work in middleware such as Condor and runtime systems used by Google and Amazon Web Services for distributed workloads.

Implementations and Platforms

PVM implementations were ported to many UNIX variants and to Microsoft Windows, running on processor families including x86, SPARC, PowerPC, and DEC Alpha used in research at Lawrence Berkeley National Laboratory and corporate environments at Hewlett-Packard. Community distributions were maintained by academic groups and laboratories including Oak Ridge National Laboratory and University of Tennessee, and integrated with cluster toolchains developed at National Center for Supercomputing Applications and university computing centers at Princeton University and Yale University. Alternative parallel environments such as MPI libraries and proprietary message-passing systems from IBM and Intel coexisted with PVM in heterogeneous computing deployments.

Historical Development and Impact

PVM’s development during the late 1980s and early 1990s occurred alongside influential projects at Oak Ridge National Laboratory, University of Tennessee, Emory University, and funding from agencies such as the National Science Foundation and Department of Energy. The project shaped pedagogy and research in distributed computing at institutions like Massachusetts Institute of Technology, Stanford University, and University of California, Berkeley and contributed concepts later adopted by standards bodies and initiatives such as the Message Passing Interface working groups, the Beowulf cluster movement, and middleware projects at Argonne National Laboratory. PVM influenced software used in scientific collaborations at CERN and simulation work at Los Alamos National Laboratory and Sandia National Laboratories.

Security and Fault Tolerance

PVM’s original design targeted trusted research environments typical of university clusters and national laboratories, with authentication and access control practices influenced by campus networking efforts at University of Tennessee and federated collaborations among institutions such as Oak Ridge National Laboratory and Lawrence Berkeley National Laboratory. Fault tolerance and recovery mechanisms in PVM addressed node failure, task restart, and message loss using techniques explored in distributed systems research from Carnegie Mellon University and Massachusetts Institute of Technology, while later operational deployments considered integration with secure cluster management tools developed by commercial vendors like IBM and Hewlett-Packard. Advances in secure middleware, public-key infrastructures used by NASA projects, and redundancy strategies from supercomputing centers informed subsequent improvements in resilience for distributed parallel environments.

Category:Parallel computing Category:Distributed computing