NOSQL Talk and References


This page contains the links I have used as references in preparing my NOSQL talks at OGF29 & at OSCON,10.

[Update 7/16] Latest OSCON Slideset (Slideshare has a bug – cannot update!)
OSCON Tutorial Handout & AMI details (Link to my next blog that has the handout & other info)
My talk is now available as a video from O’Reilly and at Safari Books !

Links:

No order and no judgment except that I found these particularly useful and have incorporated the materials and insights into my presentation:

[I-1]http://blog.mongodb.org/post/660037122/holy-large-hadron-collider-batman
[I-2]*http://www.sciencedirect.com/science?_ob=ArticleURL&_udi=B9865-506HM1Y-63&_user=10&_coverDate=05%2F31%2F2010&_alid=1357020903&_rdoc=1&_fmt=high&_orig=search&_cdi=59117&_sort=r&_docanchor=&view=c&_ct=1&_acct=C000050221&_version=1&_urlVersion=0&_userid=10&md5=095f3839793bb37ca66502f36f81f0c8
[I-3]*http://www.slideshare.net/vkuznet/das-iccs-2010
[I-4]http://cms.web.cern.ch/cms/Education/ComicBook/index.html
[I-5]http://www.royans.net/arch/mongodb/
[I-6] http://www.royans.net/arch/mongodb-migration-from-mysql-at-wordnik/
[I-7] http://www.slideshare.net/netik/billions-of-hits-scaling-twitter
[I-8] http://about.digg.com/blog/looking-future-cassandra
[I-8a] http://highscalability.com/how-i-learned-stop-worrying-and-love-using-lot-disk-space-scale
[I-9]http://nosql.mypopescu.com/post/407159447/cassandra-twitter-an-interview-with-ryan-king
[I-10] * http://hstack.org/why-were-using-hbase-part-1/
[I-11] * http://hstack.org/why-were-using-hbase-part-2/
[I-12] https://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/
[I-13] http://www.couch.io/case-studies
[I-14] http://blogs.the451group.com/information_management/2010/04/21/user-perspectives-on-nosql/
[I-15]http://nosql.mypopescu.com/post/518121097/hbase-adobe-an-interview-with-adobe-saas
[I-16] http://techcrunch.com/2010/03/16/big-data-freedom/
[1]  [Good Practical tips and answers to questions HBase 0.19-0.21] http://www.techcast.com/events/apacheconus09/mstack/ Jgray,Mstack Practical Hbase ApacheCon 2009
[2]  Examples of hbase usage : http://wiki.apache.org/hadoop/Hbase/PoweredBy
[3] http://www.slideshare.net/guestdfd1ec/design-patterns-for-distributed-nonrelational-databases <- title says it all. Patterns of NOSQL datastores
[4] http://staff.ustc.edu.cn/~jpq/paper/flash/1996-The%20Log-Structured%20Merge-Tree%20(LSM-Tree).pdf log structured Merge Tree paper for distributed writing algorithms
[5] http://labs.google.com/papers/bigtable-osdi06.pdf < big table paper from Google, started it all
[6] http://ianvarley.com/UT/MR/Varley_MastersReport_Full_2009-08-07.pdf <- masters thesis. No Relation: The Mixed Blessings of Non-Relational Databases
[7] DeCandia, … “Dynamo: Amazon’s Highly Available Key-Value Store”,
[8] Lakshman, A. Cassandra Project Presentation at June 2009 NoSQL Meetup, San Francisco. http://static.last.fm/johan/nosql-20090611/cassandra_nosql.pdf
[9] Helland, P. “Life beyond Distributed Transactions: an Apostate’s Opinion”. 3rd Biennial
Conference on Innovative DataSystems Research (CIDR), 2007.
[10] Taylor, B. “How FriendFeed uses MySQL to store schema-less data.” February, 2009.

http://bret.appspot.com/entry/how-friendfeed-uses-mysql

[11] How to Make Life Suck Less! (When building scalable systems) – Bradford Stephens http://www.youtube.com/watch?v=N0iAT47ScWg
[12] http://about.digg.com/node/564 Digg Saying Yes to NoSQL; Going Steady with Cassandra
[13] http://about.digg.com/blog/database-capabilities-high-volume-environment what is a database ?
[14**] CAP Theorem – Must Read Homeworkhttp://www.julianbrowne.com/article/viewer/brewers-cap-theorem
[15**] Must read : HW http://about.digg.com/blog/looking-future-cassandra
[16] Scalable Internet Architectures – Theo Schlossnagle YouTube Video http://www.youtube.com/watch?v=2WuT2rdLK5A&NR=1
[17] http://blog.marcua.net/post/442594842/notes-from-nosql-live-boston-2010 Notes from NOSQL Boston March 11/2010. Good read
[18] http://www.slideshare.net/bscofield/nosql-codemash-2010 < good taxonomy
[19] NOSQL Eco System – http://www.rackspacecloud.com/blog/2009/11/09/nosql-ecosystem/# <- Reqd Reading – quick comparison of compares 10 NOSQL DBs from scalability, data and query model, and persistence design.
[20] http://blog.sym-link.com/2009/10/30/nosql_whats_in_a_name.html < origin of NOSQL
[21] http://www.slideshare.net/emileifrem/neo4j-the-benefits-of-graph-databases-oscon-2009 – neo4j presentation at OSCON 2009
[22] NOSQL OAKLAND teve Yen’s talk – NOSQL is a horseless carriage http://dl.getdropbox.com/u/2075876/nosql-steve-yen.pdf – taxonomy of NOSQL
[23] http://perspectives.mvdirona.com/2009/11/03/OneSizeDoesNotFitAll.aspx James Hamilton talks about the NOSQL categorization. Good read on the dilemma
[24] http://blogs.nyu.edu/blogs/gc69/stdin/2008/09/
[25] http://daimlerpics.blogspot.com/2009/10/mercedes-benz-daimler-motor-carriage.html
[26] http://www.horselesscarriage.com/
[27] http://invisiblecollege.weblog.leidenuniv.nl/ Blog
[28] http://gigaom.com/2009/10/25/the-future-is-big-data-in-the-cloud/ – quick overview of big data. Required reading
[29] http://gigaom.com/2009/09/20/getting-closer-to-real-time-with-hadoop/
[30] http://blog.oskarsson.nu/2009/06/nosql-debrief.html – videos and presentation from NOSQL at SFO/June’09
[31] http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html – Reqd read
[32] http://nosql.mypopescu.com/post/405547409/8-reasons-you-should-like-couchdb-and-not-only
[33]  http://blog.endpoint.com/2010/03/nosql-live-dynamo-derivatives-cassandra.html Notes from NOSQL Boston March 12,2009
[34] http://www.jaxzin.com/2010/03/first-impressions-from-nosql-live.html Another view of Boston NOSQL March 11, 2010
[35] http://portal.acm.org/ft_gateway.cfm?id=1394128&type=pdf <- BASE Paper
[36] * http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.20.1495&rep=rep1&type=pdf <- Gilbert & Lynch Paper
[37] * http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.24.3690&rep=rep1&type=pdf <- Fox and brewer paper
[38] http://www.allthingsdistributed.com/2008/12/eventually_consistent.html
[39] http://www.cs.berkeley.edu/~brewer/cs262b-2004/PODC-keynote.pdf <- Brewer’s keynote
[40] http://www.infoq.com/presentations/Project-Voldemort-at-Gilt-Groupe
[41] http://www.infoq.com/presentations/availability-consistency
[42] http://cs.gmu.edu/~setia/cs707/slides/consistency.pdf
[43] http://cs-www.cs.yale.edu/homes/dna/papers/vldb07hstore.pdf < – End Of AN architecture Era
[44] http://blog.mongodb.org/post/475279604/on-distributed-consistency-part-1
[45] http://dbmsmusings.blogspot.com/2010/04/problems-with-cap-and-yahoos-little.html
[46] http://cacm.acm.org/blogs/blog-cacm/83396-errors-in-database-systems-eventual-consistency-and-the-cap-theorem/fulltext
[47] http://perspectives.mvdirona.com/2010/04/07/StonebrakerOnCAPTheoremAndDatabases.aspx
[48] http://www.cidrdb.org/cidr2007/papers/cidr07p15.pdf
[49] http://www.allthingsdistributed.com/2010/02/strong_consistency_simpledb.html
[50] http://developer.amazonwebservices.com/connect/entry.jspa?externalID=3572
[51] http://blog.knuthaugen.no/2010/03/the-nosql-landscape.html
[52] http://blog.nahurst.com/visual-guide-to-nosql-systems
[53] http://www.slideshare.net/benjaminblack/introduction-to-cassandra-replication-and-consistency
[60] http://www.mongodb.org/display/DOCS/Tutorial
[61] http://www.mongodb.org/display/DOCS/Comparing+Mongo+DB+and+Couch+DB
[62] http://try.mongodb.org/ <- Interactive Shell!
[63] http://blog.boxedice.com/2010/05/05/mongodb-webinar-approaching-1-billion-documents-slides-and-audio/
[64] http://i.imgur.com/hxxYn.jpg
[65] http://techzinglive.com/?p=192 <- Justin and Jason speak with Michael Dirolf, a lead developer at 10gen
[66] http://blog.xebia.com/2010/03/28/mongodb-geospatial-index/
[67] http://dirolf.com/2010/05/27/inside-mongodb.html
[68] http://en.wikipedia.org/wiki/Geohash
[69] http://www.10gen.com/event_schemadesign_10apr27
[70] http://nosql.mypopescu.com/post/394779847/mongodb-tutorial-mapreduce
[71] http://kylebanker.com/blog/2009/12/mongodb-map-reduce-basics/
[72] http://www.paperplanes.de/2010/2/25/notes_on_mongodb.html
[73] http://cookbook.mongodb.org/
[74] http://www.slideshare.net/gabriele.lana/couchdb-vs-mongodb-2982288
[75] http://www.slideshare.net/gdusbabek/cassandra-presentation-for-san-antonio-jug
[76] http://wiki.apache.org/cassandra/GettingStarted
[77] http://wiki.apache.org/cassandra/Operations
[78] http://arin.me/blog/wtf-is-a-supercolumn-cassandra-data-model
[78a] http://mail-archives.apache.org/mod_mbox/incubator-cassandra-dev/200908.mbox/%3Cb6f68fc60908111037ofdc0d6csa39543857e3583a2@mail.gmail.com%3E
[79] http://blog.evanweaver.com/articles/2009/07/06/up-and-running-with-cassandra/
[80] http://stackoverflow.com/questions/1502735/whats-the-best-practice-in-designing-a-cassandra-data-model
[81] http://stackoverflow.com/questions/tagged?tagnames=cassandra&sort=hot&pagesize=50 <- stackover flow questions
[82] http://blog.octo.com/en/nosql-lets-play-with-cassandra-part-13/
[83] http://dustyreagan.com/installing-cassandra-on-ubuntu-linux/
[84] http://www.sodeso.nl/?p=251
[85] http://prettyprint.me/2010/01/20/introduction-to-nosql-and-cassandra-part-2/
[86] Efficient Reconciliation and Flow Control for Anti-Entropy Protocols, Robbert van Renesse et al,http://www.cs.cornell.edu/home/rvr/papers/flowgossip.pdf
[87] Consistent Hashing & Random Trees http://www.akamai.com/dl/technical_publications/ConsistenHashingandRandomTreesDistributedCachingprotocolsforrelievingHotSpotsontheworldwideweb.pdf
[88] Unreliable Failure Detectors for Reliable Distributed Systems http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.1159
[89] The Weakest Failure Detector for Solving Consensus http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.55.8585
[90] A Weakly coupled adaptive gossip protocol http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.109.297
[91] http://wiki.apache.org/cassandra/ArchitectureAntiEntropy
[92] http://wiki.apache.org/cassandra/AntiEntropy
[93] http://wiki.apache.org/cassandra/ArchitectureGossip
[94] The ϕ Accrual Failure Detector, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.106.3350
[95] http://www.slideshare.net/mwynholds/cassandra-and-rails-at-la-nosql-meetup
[96] http://www.slideshare.net/kbanker/mongodb-schema-design-mongony
[97] http://en.wikipedia.org/wiki/List_of_cities_by_latitude
[98] http://www.slideshare.net/slidarko/an-overview-of-data-management-paradigms-relational-document-and-graph-3880059
[99] https://issues.apache.org/jira/browse/HBASE-1200
[100] http://www.sodeso.nl/?p=108
[101] http://maxgrinev.com/2010/07/09/a-quick-introduction-to-the-cassandra-data-model/
[102] http://maxgrinev.com/2010/07/12/do-you-really-need-sql-to-do-it-all-in-cassandra/
[103] http://wiki.apache.org/cassandra/DataModel
[104] http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.145.1796 Principles for Inconsistency
[998] NOSQL Summer Read papers : http://doubleclix.wordpress.com/2010/06/12/a-path-throug-nosql-summer-reading/
[999] Poems from http://www.bartleby.com
http://nosql-database.org/ < exhaustive List of all NOSQL datastores,
http://internetmindmap.com/database_software < list of links for NOSQL databases, DHT implementations et al

http://www.zemanta.com/fruitblog/nosql-meetup-reporton/

[998] NOSQL Summer Read papers : http://doubleclix.wordpress.com/2010/06/12/a-path-throug-nosql-summer-reading/
[999] Poems from http://www.bartleby.com

About these ads

2 thoughts on “NOSQL Talk and References

  1. Pingback: OSCON NOSQL Training AMI « My missives

  2. Pingback: When is ‘big data’ really ‘BIG DATA’ ? « My missives

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s