Search Button
RSS icon Sort by:
Big Data Science in H2O with R
Big Data Science in H2O with R
by H20.ai August 21, 2013 Uncategorized [EN]

Big Data Science with H2O in R from Anqi Fu We had a great turnout at our Meetup last night! We took a look at the H2O/R API, then dove right in to a hands-on demo, where we imported, cleaned, and ran GLM on the airlines data set in H2O using R commands. Here are […]

Read More
Public Data Sets
Public Data Sets
by H20.ai August 16, 2013 Uncategorized [EN]

For your data analysis pleasure, I give you a giant list of super cool publicly available data. If you’re looking at the data sets and wondering “now what?” – you can find this list AND tutorials on how to use H2O for analysis at the H2O docs page (here: http://docs.0xdata.com). You can also get a detailed […]

Read More
TCP Is Not Reliable
TCP Is Not Reliable
by H20.ai August 16, 2013 Uncategorized [EN]

Been to long between blogs… “TCP Is Not Reliable” – what's THAT mean? Means: I can cause TCP to reliably fail in under 5 mins, on at least 2 different modern Linux variants and on modern hardware, both in our datacenter (no hypervisor) and on EC2. What does “fail” mean?  Means the client will open […]

Read More
Run H2O From Within R
Run H2O From Within R
by H20.ai August 13, 2013 Uncategorized [EN]

With the REST API, it’s simple to run H2O operations from within R using similar syntax to all your favorite R functions. In this post, we’ll walk through a simple demo of its capabilities. First, get H2O installed and running by following the tutorial here. Once you have the R package loaded, you can take […]

Read More
Use R to run Better Algorithms on Big Data
Use R to run Better Algorithms on Big Data
by H20.ai August 12, 2013 Uncategorized [EN]

Our resident R users will demonstrate how to use the R package and invoke big data modeling entirely from R. In this session our resident R & Math hacker, Anqi Fu will demonstrate the R API for H2O. Early users, community and customers of H2O have been invoking GLM, Random Forest and K-means from an […]

Read More
Memory used graph
Random Forest Measurements for the MNIST Dataset
by H20.ai August 8, 2013 Uncategorized [EN]

This post discusses the performance of H2O’s Random Forest [5] algorithm. We compare different versions of H2O as well as the RF implementation by wise.io. We use wall-clock time to measure work flows that match up with the user experience. A link to the scripts used is available here [1].

Read More
Scientist analytics engineers text
We the people: Our meetup member introductions
by H20.ai August 5, 2013 Uncategorized [EN]

You may have noticed that we have a ton of stuff going on at 0xdata, including several upcoming meetups that I expect will be very well attended. I was feeling a little curious about who exactly would be attending. What are the common areas of interest, are our members mostly software people or data scientists? […]

Read More
1 58 59 60 63