LLMpediaThe first transparent, open encyclopedia generated by LLMs

Strata Conference

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: 10gen Hop 4
Expansion Funnel Raw 114 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted114
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Strata Conference
NameStrata Conference
StatusActive
GenreTechnology conference
First2010
FrequencyAnnual
VenueVaries
CountryUnited States
OrganizerO'Reilly Media

Strata Conference Strata Conference is an annual technology event focused on data science, big data, machine learning, artificial intelligence, data engineering and related cloud computing ecosystems. Founded and produced by O'Reilly Media, the conference attracts practitioners from Amazon Web Services, Google Cloud Platform, Microsoft Azure, IBM Watson, and academic institutions such as Massachusetts Institute of Technology, Stanford University, and University of California, Berkeley. Strata functions as a convergence point for professionals affiliated with companies like Facebook, Twitter, LinkedIn, Airbnb, Uber, Netflix, and research groups from Google Research, DeepMind, and OpenAI.

History

The event originated in 2010 as a response to rising interest in Hadoop, MapReduce, NoSQL databases, and the Apache Hadoop ecosystem, drawing speakers from Yahoo! Research, Cloudera, Hortonworks, MapR Technologies, and Facebook Research. Early conferences featured practitioners from Yahoo!, eBay, LinkedIn Engineering, Twitter Engineering, and academics from UC Berkeley AMP Lab and Carnegie Mellon University. Over time Strata shifted to include topics covered by teams at Google, Microsoft Research, IBM Research, Intel Labs, and startups like Cloudera and Databricks. Major milestones included sessions responding to breakthroughs from ImageNet, AlexNet, AlphaGo, and publications from NeurIPS, ICML, KDD, and SIGMOD communities.

Topics and Tracks

Strata's programming covers applied themes such as machine learning productionization, deep learning frameworks like TensorFlow, PyTorch, and Keras, feature engineering used by teams at Airbnb and Spotify, and scalable storage patterns employed by Amazon S3, Google BigQuery, Snowflake (company), and Apache Cassandra. Tracks often address pipeline orchestration with tools from Apache Airflow, Kubeflow, and Argo Workflows and analytics patterns involving Spark (software), Flink, Hadoop Distributed File System, and Presto (SQL query engine). Security and governance sessions reference standards and practices from GDPR, California Consumer Privacy Act, and case studies from PayPal, Mastercard, Goldman Sachs, and JPMorgan Chase. Emerging topics have included reinforcement learning advances from DeepMind, OpenAI, fairness work from ACM Conference on Fairness, Accountability, and Transparency, and production challenges highlighted by Netflix and Uber Engineering.

Speakers and Keynotes

Keynotes historically featured leaders and researchers such as executives from Google, Amazon, Microsoft, and principal scientists from IBM Watson, DeepMind, OpenAI, and Facebook AI Research. Notable presenters have included alumni of MIT Media Lab, faculty from Stanford University Department of Computer Science, and researchers with affiliations to Carnegie Mellon University School of Computer Science, Harvard John A. Paulson School of Engineering and Applied Sciences, and Princeton University. Industry speakers represent teams at Airbnb Engineering, Spotify Technology S.A., LinkedIn Data Science, and Netflix TechBlog, alongside open source maintainers from projects like Apache Software Foundation and foundations such as Linux Foundation and Apache Software Foundation projects.

Format and Events

The conference format blends conference sessions, tutorial workshops, hands-on training, and vendor expo halls featuring sponsors such as IBM, Google, Amazon, Microsoft, Snowflake (company), Databricks, Cloudera, and Confluent. Events include multi-day training courses like those offered by O'Reilly Media instructors, lightning talks, poster sessions aligned with practices seen at NeurIPS and ICML, and hackathons comparable to those hosted by GitHub and Kaggle. Strata has incorporated panels on ethics and policy with contributors connected to Electronic Frontier Foundation, OpenAI Policy, and regulatory perspectives influenced by European Commission discussions on AI.

Attendance and Community

Attendees typically include data scientists, data engineers, machine learning engineers, CTOs, product managers, and researchers from companies like Facebook, Google, Amazon, Microsoft, Netflix, Uber, Airbnb, LinkedIn, and startups incubated in Y Combinator and Techstars. The community extends to academic collaborators from MIT, Stanford University, UC Berkeley, Carnegie Mellon University, and professional groups such as ACM, IEEE, and Data Science Association. Local meetups and satellite events intersect with communities hosted by PyData, Machine Learning Meetup, Women in Machine Learning, and veteran conferences like Strata Data Conference and Hadoop World attendees.

Sponsorship and Organization

Organization and sponsorship are led by O'Reilly Media with partnerships from cloud providers Amazon Web Services, Google Cloud Platform, Microsoft Azure, and enterprise vendors such as IBM, Snowflake (company), Databricks, Cloudera, Confluent, and MongoDB. The conference collaborates with academic institutions like MIT, Stanford University, and UC Berkeley for curriculum development and with professional societies including ACM and IEEE for outreach. Financial and logistical support has come from corporate innovation teams at Salesforce, Oracle Corporation, SAP, and consulting firms such as McKinsey & Company and Accenture.

Category:Technology conferences