Reliable, scalable, distributed computing: Processing large datasets using Hadoop(tm) CapBUG Talk (September 2012) Jason Crawford