NoSQL Matters — LLMpedia

NoSQL Matters
Name	NoSQL Matters
Type	Conceptual overview
First appeared	2000s
Influenced by	NoSQL movement, Big Data, cloud computing
Related	Distributed databases, NewSQL, ACID, BASE

Contents

NoSQL Matters

NoSQL Matters is an umbrella term describing discussions, conferences, publications, and communities centered on non-relational database technologies. It synthesizes the evolution of database systems linked to pioneers and organizations such as Amazon Web Services, Google, Facebook, Twitter, LinkedIn and academic labs at University of California, Berkeley, Massachusetts Institute of Technology, Stanford University and Carnegie Mellon University. The topic intersects with events and projects like Hadoop, MapReduce, Cassandra (database), MongoDB, Redis, CouchDB and standards debates involving institutions such as Internet Engineering Task Force and Apache Software Foundation.

Overview

NoSQL Matters surveys the landscape shaped by practitioners from Eric Brewer-influenced communities, companies like Amazon, Google, Facebook and research groups at Princeton University, University of Washington, University of Toronto and Imperial College London. Influential conferences include Strata Data Conference, QCon, Kafka Summit, KubeCon and Velocity Conference, while trade press and publishers such as O'Reilly Media, ACM, IEEE and Wired (magazine) provide coverage. Key projects credited with catalyzing the movement include Bigtable, Dynamo, HBase, Riak and ElasticSearch, with corporate contributors like Oracle Corporation, Microsoft, IBM, SAP SE and Cloudera.

Discussions cover document stores exemplified by MongoDB, Couchbase, CouchDB, and RavenDB; column-family stores exemplified by Cassandra (database), HBase, and ScyllaDB; key-value stores like Redis, Amazon DynamoDB, Berkeley DB and Etcd; and graph databases such as Neo4j, JanusGraph, TigerGraph and Amazon Neptune. Influences include academic systems like Google Bigtable and Dynamo (storage system), and vendor products from Oracle Corporation and Microsoft such as Azure Cosmos DB. Discussions often reference schema-less design promoted by contributors at MongoDB, Inc. and query patterns studied in papers from University of California, Berkeley and MIT CSAIL.

NoSQL Matters emphasizes applications in web-scale services operated by Facebook, Twitter, LinkedIn, Netflix and Pinterest; real-time analytics at Uber Technologies, Airbnb, Stripe (company) and PayPal; content management for The New York Times, BBC, The Guardian and Wikimedia Foundation; recommendation engines used by Amazon (company), Spotify, YouTube and SoundCloud; and geospatial services by HERE Technologies, Esri, TomTom and Google Maps. It also covers scientific data platforms at NASA, CERN, European Space Agency and National Institutes of Health, and IoT deployments by Siemens, Bosch, General Electric and Schneider Electric.

Coverage spans tradeoffs highlighted by the CAP theorem and work by Eric Brewer and Seth Gilbert and Nancy Lynch; comparisons with ACID properties and BASE approaches debated at ACM SIGMOD and VLDB (conference). Scalability stories reference systems like Cassandra (database), HBase, Dynamo (storage system), Elasticsearch, CockroachDB and Spanner (Google) and corporate infrastructures at Amazon Web Services, Google Cloud Platform, Microsoft Azure and IBM Cloud. Performance tuning and benchmarks discussed at forums like TPC (Transaction Processing Performance Council) and in papers from SIGMOD illustrate concerns for latency in services run by Netflix, Twitter, Instagram and Snap Inc..

Topics include query languages and APIs developed by MongoDB, Inc., Apache Lucene, Elastic (company), and projects such as SPARQL and Gremlin from Apache TinkerPop. Indexing strategies reference implementations in Elasticsearch, Solr, PostgreSQL extensions, and graph query engines like Neo4j and TigerGraph. Integration patterns with Apache Kafka, Apache Flink, Apache Spark and Beam (software) are common, as are migration stories involving MySQL, PostgreSQL, Oracle Database and Microsoft SQL Server.

Security discussions cite breach incidents and responses involving organizations like Equifax, Target Corporation, Yahoo!, and compliance frameworks from HIPAA, GDPR, PCI DSS, and guidance from National Institute of Standards and Technology. Operational topics include backup and recovery practices used by Dropbox, Box (company), and GitHub, deployment automation with Kubernetes, Docker, Ansible, Terraform, monitoring with Prometheus and Grafana, and incident response modeled by teams at Google, Facebook and Netflix.

Adoption narratives reference vendors and adopters such as MongoDB, Inc., DataStax, Couchbase, Redis Labs, Elastic (company), Cloudera, Confluent, Google, Amazon, Microsoft and IBM. Criticisms from academic venues like SIGMOD, VLDB, USENIX and commentators at The Register, InfoQ and TechCrunch focus on consistency guarantees, operational complexity, and vendor lock-in. Emerging trends discussed in forums like KubeCon, Re:Invent, Google Cloud Next and Microsoft Ignite include multi-model databases, convergence with NewSQL systems such as Cockroach Labs and VoltDB, serverless databases like Amazon Aurora Serverless and Azure Cosmos DB, and research from MIT, Stanford University and UC Berkeley on distributed transactions and programmability.

Category:Databases