June 26th, 2014

Learn to manage, munge, and model big data with H2O on the Hortonworks Sandbox

RSS icon RSS Category: Uncategorized [EN]

Working with big data might seem like a daunting task if like me, you’ve spent the majority of your college years doing pencil and paper proofs. Big data for me was anything that took longer than 30 minutes to ingest into single threaded R.
For mathematicians and statisticians looking to understand widely used data platforms like Hadoop for data storage and data management, Hortonworks Sandbox is an awesome all-in-one self-teaching tool. Getting a standalone Hadoop environment on your personal computer is as easy as launching a VM.

To actually start doing predictive analytics, launch H2O on the server either as a simple JVM or a mapper task that’ll utilize all the nodes in the cluster. When it comes time to actually move from a test and research setting to a production one, the same installation and launch holds for however many nodes you add to the cluster.
H2O and Hortonworks Sandbox will turn the uninitiated into data scientists with a gentle sloping learning curve. Both H2O and Hortonworks are open source big data powerhouses that you can learn from and perhaps eventually contribute to.
It’s free to try so get started now with the following tutorial : Predictive Analytics on H2O and Hortonworks Data Platform

For more information about how H2O operates on Hadoop check out : H2O on Hadoop

Leave a Reply

10 Consejos para Convertirte en un Científico de Datos Exitoso

En este mundo que no deja de cambiar y sorprendernos, como científicos de datos debemos

January 19, 2023 - by Favio Vázquez
Explaining models built in H2O-3 — Part 1

Machine Learning explainability refers to understanding and interpreting the decisions and predictions made by a

December 22, 2022 - by Parul Pandey
H2O.ai at NeurIPS 2022

H2O.ai is proud to participate in the 36th Conference on Neural Information Processing Systems (NeurIPS)

December 6, 2022 - by Marcos V. Conde
A Brief Overview of AI Governance for Responsible Machine Learning Systems

Our paper “A Brief Overview of AI Governance for Responsible Machine Learning Systems” was recently

November 30, 2022 - by Navdeep Gill, Abhishek Mathur and Marcos V. Conde
H2O World Dallas Customer Talks

After three long years of not having an #H2OWorld, we finally held our first one

November 24, 2022 - by Vinod Iyengar
New in Wave 0.24.0

Another Wave release has arrived with quite a few exciting new features. Let's quickly go

November 21, 2022 - by Martin Turoci

Request a Demo

Explore how to Make, Operate and Innovate with the H2O AI Cloud today

Learn More