Tom White

Hadoop expert, author.

Areas of Expertise:

  • Apache Hadoop
  • distributed computing
  • big data
  • programming
  • writing
Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.

Hadoop: The Definitive Guide Hadoop: The Definitive Guide
by Tom White
Third Edition May 2012
Print: $49.99
Ebook: $39.99

Hadoop: The Definitive Guide Hadoop: The Definitive Guide
by Tom White
Second Edition October 2010
Ebook: $39.99

Hadoop: The Definitive Guide Hadoop: The Definitive Guide
by Tom White
June 2009
Ebook: $35.99

Tom blogs at:


December 16 2012

IFTTT, pronounced "ift", and which stands for "if this then that", is a great service for wiring bits of the internet together. The idea is that you create rules for performing actions, based on triggers. If this [trigger] occurs then perform that [action]. There are lots of triggers and actions, provided by channels.… read more


December 09 2012

[I wrote this in July, but never got round to posting it.] Last weekend I visited the U.S. Capitol in Washington, D.C., with my family, and I learned that the House of Representatives has 435 seats which are appointed so that each state has a number of seats that is proportional… read more


December 03 2012

I wrote a visualization of the populations of the largest cities in the US over the years. I got the idea when I was in Detroit in the summer, and read about the huge decline in Detroit's city population since the 1950s when the automotive industry was at its… read more

Webcast: The State of Hadoop
September 15, 2010
Duration: Approximately 60 minutes. Cost: Free Apache Hadoop is a part of a growing ecosystem of projects for large-scale data analysis which is being used to solve problems for organizations in a wide range of disciplines. This talk will touch on...

Webcast: An Introduction to Hadoop
July 16, 2009
Duration: Approximately 60 minutes. Cost: Free In this webcast, Cloudera founder Christophe Bisciglia and O'Reilly author Tom White will provide an introduction to Hadoop/MapReduce, the open source project that allows organizations to process, store...

Tom White