General:
- Brewer’s CAP theorem – Fundamental limit on guarantees of distributed systems
- Google BigTable paper – Google’s large-scale column-oriented store
- Amazon Dynamo paper – Amazon’s eventually consistent key/value store
- No Relation : The mixed blessing of non-relational databases – Master Thesis, Ian Thomas Varley
- http://nosql-database.org – Regularly updated list of nosql projects/articles
Paradigms
MapReduce / Hadoop
- Apache Hadoop – Distributed Filesystem &
- Hadoop Streaming
- MapReduce in academic papers
Distributed Caching
Distributed Consensus Protocols
- Byzantine fault tolerance
- Paxos algorithm – Lamport’s classical distributed concensus protocol
Concurrency Patterns
Data Partitioning
Data Replication
Open Source Projects
- Non-relational data stores
- Apache HBase – BigTable-based column-oriented store
- Project Voldemort – Eventual-consistency key/value store
- Apache Cassandra – Dynamo/BigTable hybrid-approach database
- Tokyo Cabinet – Ordered key-value store
- Message queueing
- ActiveMQ
- Apache QPid – Open Source AMQP implementation
- Distributed coordination
- Apache Zookeeper – Paxos-based distributed synchronization
- Distributed Filesystems
- Analytics Platforms
Selected Articles
- GFS Evolution – ACM Queue article
- HDFS Scalability - Yahoo Blog article
Selected Papers
- High Performance Scalable Data Stores – Overview by Rick Catell
Selected Presentations
- Scalability, Availability & Stability Patterns – High-level overview on number of scalability topics, by Jonas Boner