August 28th, 2013

Hands on Workshop: Hack Data With Math

RSS icon RSS Category: Uncategorized
Fallback Featured Image

Thursday night (August 29) at 7, resident math hacker Spencer A. is leading a hands on workshop on using H2O to analyze real-world data. For those of you who are new to the math side of H2O, we have notes below to help you get prepared.

H2O is a distributed math platform featuring a set of analytical tools that can be accessed through an HTML based UI or through R. It’s built to handle really big data sets by analyzing subsets of big data in parallel. If you would like to read more about our performance, I highly recommend looking at Spence’s blog post on performance: http://0xdata.com/blog/2013/08/rf_on_mnist/
You can access instructions and resources for running H2O from your laptop here: http://docs.0xdata.com/quickstart/quickstart_jar.html, and you can download the .jar file here: http://0xdata.com/h2O/. There are tutorials on the documents page as well, if you are inclined to take a deeper look at some of the algorithms we will be working with.
Specifically, Spence will be walking through the famous MNIST data set and Allstate data similar to that presented in a recent Kaggle competition. If you would like to download those data sets before you get here, you can find all of the information you need at: http://www.meetup.com/H2Omeetup/events/124513412/
For the MNIST data, we will walk through the data to develop a better intuitive understand of the Random Forest (RF) algorithm. Generalized linear modeling (GLM) and it’s parallelization will be discussed through a hands on analysis of the Allstate data. By the end of the workshop you will be prepared to analyze and interpret data of your own.

Leave a Reply

+
Developing and Retaining Data Science Talent

It’s been almost a decade since the Harvard Business Review proclaimed that “Data Scientist” is

May 12, 2022 - by Jon Farland
+
The H2O.ai Wildfire Challenge Winners Blog Series – Team Too Hot Encoder

Note: this is a community blog post by Team Too Hot Encoder - one of

May 10, 2022 - by H2O.ai Team
+
The H2O.ai Wildfire Challenge Winners Blog Series – Team HTB

Note: this is a community blog post by Team HTB - one of the H2O.ai

May 10, 2022 - by H2O.ai Team
+
Bias and Debiasing

An important aspect of practicing machine learning in a responsible manner is understanding how models

April 15, 2022 - by Kim Montgomery
+
Comprehensive Guide to Image Classification using H2O Hydrogen Torch

In this article, we will learn how to build state-of-the-art models in computer vision and

March 29, 2022 - by H2O.ai Team
+
H2O Wave Snippet Plugin for PyCharm

Note: this blog post by Shamil Dilshan Prematunga was first published on Medium. What is PyCham? PyCharm

March 24, 2022 - by Shamil Prematunga

Start Your Free Trial