Generated by GPT-5-mini| NumPy ecosystem | |
|---|---|
| Name | NumPy ecosystem |
| Caption | Core array programming libraries and tools |
| Developer | NumPy Developers, Python Software Foundation, community |
| Released | 2006 |
| Programming language | Python, C, Fortran |
| License | BSD |
NumPy ecosystem The NumPy ecosystem is a collection of interoperable Python (programming language), Scientific computing-oriented libraries and tools centered on the NumPy ndarray. It underpins research and production work across institutions such as Massachusetts Institute of Technology, European Organization for Nuclear Research, and Lawrence Berkeley National Laboratory, and influences projects at companies like Google, Facebook, Microsoft, and IBM. The ecosystem intersects with standards and events including PEP 8, SciPy Conference, and initiatives by the Python Software Foundation.
The ecosystem grew from projects including Numeric (software), Numarray, and the founding contributions of developers tied to places like Trenton Systems and academic labs at University of Chicago and University of California, Berkeley. It emphasizes array broadcasting, memory views, and C/Fortran interoperability used in workflows across NASA, European Space Agency, Los Alamos National Laboratory, Argonne National Laboratory, and corporate research groups at Intel, NVIDIA, and Amazon (company). Standards and conferences shaping the landscape include PEP 3118, Python Enhancement Proposal, SciPy Conference, and outputs from organizations like the OpenStack Foundation where high-performance computing patterns cross-pollinated.
Core components center on the ndarray implementation and foundational libraries maintained by the NumPy Developers group and influenced by contributors from Travis Oliphant-led projects and teams at Continuum Analytics (now Anaconda, Inc.). Key languages and tools integrated with core components include C (programming language), Fortran, Cython, and interoperability layers used by projects at Microsoft Research, IBM Research, and the National Institute of Standards and Technology. Tooling ecosystem items adopted in core development workflows include GitHub, Travis CI, GitLab, CircleCI, and package managers like pip and conda from Anaconda, Inc..
A broad array of scientific and machine learning libraries builds on core arrays: SciPy (software), pandas, scikit-learn, TensorFlow, PyTorch, Theano, JAX (library), and domain-specific packages such as Astropy, Biopython, MDAnalysis, nibabel, xarray, dask, mpi4py, numba, cuPy, tweepy, statsmodels, sympy, networkx, matplotlib, seaborn, yellowbrick, lightgbm, xgboost, catboost, gensim, spaCy, NLTK, OpenCV, scikit-image, pandas-profiling, imbalanced-learn, prophet (software), fbprophet, pymc3, pyro (probabilistic programming), Edward (software). Many of these projects have collaborative ties to research centers such as Broad Institute, MIT-IBM Watson AI Lab, DeepMind, and funding from agencies including National Science Foundation and DARPA.
Visualization and I/O tooling interoperable with arrays include Matplotlib, Bokeh (library), Plotly, Seaborn, Altair, Mayavi, ParaView, VTK, Holoviews, Datashader, and connectors for storage systems like HDF5, NetCDF, Zarr, and databases used at enterprises like Oracle Corporation and MongoDB. File and data formats commonly handled by the ecosystem are used in projects affiliated with European Space Agency, NOAA, USGS, and research labs like CERN and include CSV, JSON, Feather (binary columnar storage), Apache Arrow, and ORC (file format). Integration with cloud and big-data platforms occurs via adapters for Apache Spark, Dask (project), Kubernetes, and services offered by Amazon Web Services, Google Cloud Platform, and Microsoft Azure.
Development and governance are influenced by bodies and events such as the NumPy Developers collective, the Python Software Foundation, and the annual SciPy Conference, with significant contributions from individuals and organizations like Travis Oliphant, Stuart Feldman, Pauli Virtanen, Ralf Gommers, Charles Harris, Anaconda, Inc., Enthought, and academic groups at University of Oxford, Princeton University, Harvard University, and University of Cambridge. Community practices follow workflows established on GitHub, legal guidance related to the BSD license, and outreach through venues such as PyCon, EuroPython, Grace Hopper Celebration, and regional meetups supported by institutions like IEEE and ACM. Funding and sustainability conversations engage funders including the Moore Foundation, Gordon and Betty Moore Foundation, and corporate sponsors like Intel and NVIDIA.
The ecosystem underpins research and products across sectors represented by institutions like Pfizer, Moderna, Roche, GlaxoSmithKline, Siemens, Boeing, Lockheed Martin, Goldman Sachs, and academic consortia such as Human Genome Project collaborators. Its impact appears in landmark collaborations with projects like Large Hadron Collider, Human Connectome Project, and climate modeling efforts at NOAA and IPCC. The architecture and practices influenced downstream standards and frameworks at companies including Netflix, Airbnb, Uber, and research groups at Facebook AI Research and OpenAI.
Category:Scientific software