This article was accepted into the corpus but its outbound wikilinks were never NER-processed — typical at the deepest BFS hop or when the run's entity cap was reached. No expansion funnel to show.
| DataFest | |
|---|---|
| Name | DataFest |
| Type | Competition and Festival |
| Founded | 2008 |
| Founders | Institute of Analytics |
| Headquarters | Chicago |
| Region served | International |
DataFest DataFest is an annual multi-day student and professional competition and festival centered on applied data analysis, statistical inference, machine learning, and data visualization. It convenes teams from universities, research institutes, corporate labs, and nonprofit organizations to engage with real-world datasets supplied by corporate partners such as Facebook, Google, Microsoft, IBM, and Amazon. The event combines elements of hackathons like MHacks, challenges such as Kaggle competitions, and showcase formats used by conferences including NeurIPS, SIGGRAPH, CHI, and Strata Data Conference.
DataFest functions as a focal point for collaboration among participants from institutions such as Massachusetts Institute of Technology, Stanford University, University of California, Berkeley, University of Chicago, Harvard University, Columbia University, Princeton University, University of Oxford, University of Cambridge, National University of Singapore, Tsinghua University, University of Toronto, ETH Zurich, Imperial College London, University of Melbourne, University of Tokyo, Seoul National University, Peking University, University of Michigan, Carnegie Mellon University, Yale University, Brown University, Duke University, Cornell University, University of Washington, University of Illinois Urbana–Champaign, Georgia Institute of Technology, University of Pennsylvania, Johns Hopkins University, Northwestern University, University of British Columbia, McGill University, Université Paris-Saclay, École Polytechnique, University of Amsterdam, Utrecht University, Leiden University, University of Copenhagen, KU Leuven, Catholic University of Leuven, University of Hong Kong, Hong Kong University of Science and Technology, National Taiwan University, Seoul National University Hospital, Pohang University of Science and Technology, Monash University, University of Sydney, University of New South Wales, Auckland University of Technology to network with practitioners from Goldman Sachs, JP Morgan Chase, Bloomberg L.P., Uber, Airbnb, Spotify, LinkedIn, Tesla, Inc., Siemens, Procter & Gamble, Unilever, Johnson & Johnson, Pfizer, Roche, Novartis, Shell plc, BP plc, ExxonMobil.
DataFest traces its roots to student-led analytics gatherings inspired by events like Data Science Bowl and HackMIT; early iterations involved partnerships with organizations such as The New York Times, The Guardian, ProPublica, Wikimedia Foundation, OpenAI, DeepMind, DARPA, National Science Foundation, European Research Council, Bill & Melinda Gates Foundation, Wellcome Trust, Ford Foundation, Rockefeller Foundation, Mozilla Foundation, and Linux Foundation. Influenced by academic competitions including Putnam Competition and Mathematical Contest in Modeling, the festival grew from regional meetups to international editions in cities like Chicago, New York City, San Francisco, London, Berlin, Paris, Singapore, Sydney, Toronto, Vancouver, Zurich, Geneva, Tokyo, Seoul, Beijing, Shanghai, Mumbai, Bangalore, and Tel Aviv. High-profile guest lecturers have included speakers from Harvard Medical School, Stanford School of Engineering, Princeton School of Public and International Affairs, MIT Media Lab, Oxford Internet Institute, Cambridge Judge Business School, Columbia Journalism School, Wharton School, Sloan School of Management, and INSEAD.
Host institutions range from university departments such as Department of Statistics at various universities to professional organizations like Association for Computing Machinery, Institute of Electrical and Electronics Engineers, American Statistical Association, Royal Statistical Society, IEEE DataPort, and Data Science Nigeria. Formats borrow from competitions such as ACM ICPC, IEEE Big Data, Imagine Cup, Microsoft Build, and Google I/O workshops. Sponsor tiers have included firms like Accenture, Deloitte, McKinsey & Company, Boston Consulting Group, EY, KPMG, Capgemini, Atos, SAP SE, Oracle Corporation, Salesforce, Tableau Software, Snowflake Inc., Databricks, Cloudera, Palantir Technologies, SAS Institute, Alteryx, H2O.ai.
Participants typically include undergraduate students, graduate students, postdoctoral researchers, faculty, industry researchers, data engineers, and data scientists from organizations such as NASA, European Space Agency, CERN, Siemens Healthineers, Mayo Clinic, Cleveland Clinic, Kaiser Permanente, General Electric, Boeing, Lockheed Martin, Northrop Grumman, Raytheon Technologies, BAE Systems, Thales Group, Renault, Volkswagen Group, Toyota Motor Corporation, BMW Group, Hyundai Motor Company, Nissan Motor Co., Honda Motor Co.. Registration processes are managed via portals modeled after Eventbrite, Meetup, Cvent, TryHackMe, and HackerRank; eligibility rules often reference institutional affiliation verification similar to Common App functions and code-of-conduct policies akin to TechCrunch Disrupt and SXSW.
The competition structure integrates problem tracks inspired by challenges at ImageNet Large Scale Visual Recognition Challenge, KDD Cup, ICDM, SIGKDD Explorations, AAAI Competitions, IJCAI, ICML Competitions, and NeurIPS competitions. Scoring systems borrow concepts from ROC curve evaluations used in medical diagnostics (e.g., Youden's J statistic), information-theoretic metrics referenced in Claude Shannon's work, and reproducibility practices from Reproducibility Project. Judges and mentors have affiliations with institutions like Bell Labs, AT&T Labs, Google Research, Facebook AI Research, Microsoft Research, IBM Research, Amazon Web Services Research, Adobe Research, Intel Labs, NVIDIA Research, OpenAI Research.
Notable outcomes include student projects that spun out as startups competing in incubators such as Y Combinator, Techstars, 500 Startups, Plug and Play Tech Center, and acceleration programs at MassChallenge and StartX. DataFest presentations have been cited at conferences including AAAI, ICML, NeurIPS, KDD, SIGMOD, VLDB, ICLR, WWW Conference, AAAS Annual Meeting, ESWC, PODS, EMNLP, ACL (conference), ICWSM. Alumni have received fellowships and awards from Rhodes Scholarship, Marshall Scholarship, Fulbright Program, National Institutes of Health, Howard Hughes Medical Institute, MacArthur Fellows Program, Simons Foundation, Alfred P. Sloan Foundation, John D. and Catherine T. MacArthur Foundation, and grants from Horizon Europe.
Educational initiatives include workshops modeled on curricula from DataCamp, Coursera, edX, Udacity, Khan Academy, Codecademy, fast.ai, OpenCourseWare, and summer schools like Robin Hanson Summer School (example format). Collaborations have extended to nonprofit education partners such as Code.org, Girls Who Code, Black Girls CODE, Teach For America, FIRST Robotics Competition, Society of Women Engineers, National Center for Supercomputing Applications, American Mathematical Society, Association for Women in Mathematics, Mathematical Association of America, Computing Research Association, National Academy of Sciences, Royal Society.
Criticism has centered on issues similar to debates around Kaggle and OpenAI: data privacy concerns analogous to controversies at Facebook–Cambridge Analytica data scandal, algorithmic bias discussed in contexts like COMPAS (software), reproducibility problems highlighted by Reproducibility Project: Psychology, and equity concerns parallel to debates at SXSW and TechCrunch Disrupt. Logistical challenges have mirrored those faced by large events such as Olympic Games and World Expo including venue capacity in cities like London, New York City, San Francisco, Chicago, Berlin, Paris, transportation coordination with agencies like Transport for London, MTA (New York City Transit), Bay Area Rapid Transit, and accommodations similar to issues addressed by UN Habitat.
Category:Competitions