
By: H2O.ai
Last Thursday of the month, April 25, 2013, is here! It’s BigDataWeek.
Join us on our monthly open house and meet the artists and hackers behind H2O.
This time we are hacking the airline dataset!
“Have you ever been stuck in an airport because your flight was delayed or cancelled and wondered if you could have predicted it if you’d had more data? This is your chance to find out.
DataSets: Flight arrival and departure details for all commercial flights within the USA, from October 1987 to April 2008. This is a large dataset: there are nearly 120 million records in total, and takes up 1.6 gigabytes of space compressed and 12 gigabytes when uncompressed.
http://stat-computing.org/dataexpo/2009/
Also available: Airlines all years 1987-2008:
https://s3.amazonaws.com/h2o-airlines-unpacked/allyears.csv (12GB)”