Stephen McAteer

Logo

Father of two, husband of one. PhD in mathematical physics. Lead data scientist at the Victorian Auditor-General's Office. Long suffering Essendon supporter.

LinkedIn - GitHub - email

Image source: https://upload.wikimedia.org/wikipedia/commons/thumb/2/27/Cubieboard_HADOOP_cluster.JPG/640px-Cubieboard_HADOOP_cluster.JPG
9 April 2015

Why 100 duck-sized horses are scarier than 1 horse-sized duck (and how we set up our Hadoop cluster) (presentation)

In this presentation we give a brief overview of the benefits and modern techniques of distributed parallel computation (MapReduce) and storage (Hadoop Distributed File System). That is, we teach you a healthy fear of duck-sized horses. We will then go through the steps involved in setting up a Hadoop cluster pre-loved desktops. Finally we provide a tour of an actual, real-life cluster.

(Based on work with Ross Ashman, Justin Beck and Timothy Surendonk, presented at the DST Group, MCA MSTC seminar.)

tags: analytics - Hadoop - presentation - defence