June 26th, 2014

Learn to manage, munge, and model big data with H2O on the Hortonworks Sandbox

RSS icon RSS Category: Uncategorized

Working with big data might seem like a daunting task if like me, you’ve spent the majority of your college years doing pencil and paper proofs. Big data for me was anything that took longer than 30 minutes to ingest into single threaded R.
For mathematicians and statisticians looking to understand widely used data platforms like Hadoop for data storage and data management, Hortonworks Sandbox is an awesome all-in-one self-teaching tool. Getting a standalone Hadoop environment on your personal computer is as easy as launching a VM.

To actually start doing predictive analytics, launch H2O on the server either as a simple JVM or a mapper task that’ll utilize all the nodes in the cluster. When it comes time to actually move from a test and research setting to a production one, the same installation and launch holds for however many nodes you add to the cluster.
H2O and Hortonworks Sandbox will turn the uninitiated into data scientists with a gentle sloping learning curve. Both H2O and Hortonworks are open source big data powerhouses that you can learn from and perhaps eventually contribute to.
It’s free to try so get started now with the following tutorial : Predictive Analytics on H2O and Hortonworks Data Platform

For more information about how H2O operates on Hadoop check out : H2O on Hadoop

Leave a Reply

Developing and Retaining Data Science Talent

It’s been almost a decade since the Harvard Business Review proclaimed that “Data Scientist” is

May 12, 2022 - by Jon Farland
The H2O.ai Wildfire Challenge Winners Blog Series – Team Too Hot Encoder

Note: this is a community blog post by Team Too Hot Encoder - one of

May 10, 2022 - by H2O.ai Team
The H2O.ai Wildfire Challenge Winners Blog Series – Team HTB

Note: this is a community blog post by Team HTB - one of the H2O.ai

May 10, 2022 - by H2O.ai Team
Bias and Debiasing

An important aspect of practicing machine learning in a responsible manner is understanding how models

April 15, 2022 - by Kim Montgomery
Comprehensive Guide to Image Classification using H2O Hydrogen Torch

In this article, we will learn how to build state-of-the-art models in computer vision and

March 29, 2022 - by H2O.ai Team
H2O Wave Snippet Plugin for PyCharm

Note: this blog post by Shamil Dilshan Prematunga was first published on Medium. What is PyCham? PyCharm

March 24, 2022 - by Shamil Prematunga

Start Your Free Trial