Generated by GPT-5-mini| PyData Global | |
|---|---|
| Name | PyData Global |
| Status | Active |
| Genre | Technical conference |
| Frequency | Annual |
| First | 2014 |
| Organizer | NumFOCUS |
| Location | International / Online |
PyData Global PyData Global is an annual international conference focused on the Python data ecosystem, machine learning, and open-source tools. The event gathers practitioners from projects, companies, and institutions to discuss software like NumPy, pandas (software), SciPy, scikit-learn, and frameworks such as TensorFlow and PyTorch. Attendees include contributors from organizations like Anaconda, Inc., Google, Facebook, Microsoft, and research groups from MIT, Stanford University, University of California, Berkeley, and Harvard University.
PyData Global showcases talks, tutorials, and workshops about libraries and platforms including Dask, XGBoost, LightGBM, cuPy, Jupyter Notebook, JupyterLab, and Apache Arrow. The conference highlights open-source governance from foundations such as NumFOCUS, Apache Software Foundation, and Linux Foundation. Industry adoption is visible via companies like Netflix, Uber, Airbnb, Spotify, LinkedIn, IBM, NVIDIA, and Intel. Research collaborations feature projects from DeepMind, OpenAI, Allen Institute for AI, and Microsoft Research.
The PyData Global lineage traces influences from early Python community gatherings like PyCon US, SciPy Conference, EuroPython, PyData New York, and PyCon UK. Key historical moments reference contributors and projects such as Travis Oliphant, Wes McKinney, Fernando Pérez, Matthew Rocklin, and institutions like Los Alamos National Laboratory and Lawrence Berkeley National Laboratory. Conference evolution parallels milestones in machine learning and data science involving ImageNet Large Scale Visual Recognition Challenge, AlexNet, ResNet, and frameworks such as Theano and Keras.
Typical formats include keynote addresses from leaders at Google Research, Facebook AI Research, and DeepMind, alongside tutorials led by teams from OpenAI, NVIDIA, Intel AI Lab, and universities such as Carnegie Mellon University, University of Oxford, University of Cambridge, ETH Zurich, and Imperial College London. Community-driven content often mirrors governance models from NumFOCUS, Python Software Foundation, Open Source Initiative, and Creative Commons. Events include sponsor booths from Red Hat, Canonical Ltd., AWS, Google Cloud Platform, and Microsoft Azure.
Tracks cover applied machine learning with libraries like scikit-image, statsmodels, spaCy, NLTK, and Gensim; data engineering featuring Airflow, Prefect, Apache Spark, Flink, and Kafka; visualization with matplotlib, seaborn, Bokeh, Plotly, and Altair; and reproducible research using Docker, Singularity, Conda, GitHub, and GitLab. Specialized tracks address ethics and policy in AI and cite work from Partnership on AI, AI Now Institute, IEEE Standards Association, European Commission, and UNESCO.
Organizing roles are frequently filled by volunteers from NumFOCUS, Python Software Foundation, local meetup chapters like PyData London, PyData New York City, PyData Amsterdam, and academic labs including Berkeley AI Research (BAIR), Stanford AI Lab, MIT CSAIL, and Oxford Machine Learning Research Group. Sponsors and partners have included Anaconda, Continuum Analytics, DataCamp, O’Reilly Media, Springer Nature, ACM, IEEE, and The Alan Turing Institute.
Attendees comprise data scientists, engineers, researchers, and educators from corporations such as Goldman Sachs, JPMorgan Chase, Capital One, Waymo, Cruise LLC, Siemens, Boeing, and healthcare organizations like Mayo Clinic, Cleveland Clinic, and Johns Hopkins University. Student engagement draws undergraduates and graduates from Caltech, Princeton University, Yale University, Columbia University, University of Toronto, McGill University, and University of British Columbia. Recruitment and career fairs partner with firms like Palantir Technologies, Databricks, Snowflake, and ThoughtWorks.
PyData Global has influenced adoption and development across open-source ecosystems including NumPy Developers, pandas Community, Jupyter community, and ecosystem projects like scikit-learn-contrib. Outreach extends to workshops with nonprofits such as DataKind, International Rescue Committee, Medic Mobile, and educational initiatives with Code.org, Girls Who Code, Black Girls CODE, and Mozilla Foundation. The conference has supported datasets and reproducible benchmarks akin to OpenAI Gym, UCI Machine Learning Repository, Kaggle, ImageNet, and COCO.
Category:Python (programming language) conferences