Search Button
RSS icon Sort by:
H2O Driverless AI: The Workbench for Data Science
by Bruna Smith September 26, 2019 Community Data Science H2O Driverless AI Technical Tutorials

This blog was written by Rohan Gupta and originally published here. 1. Introduction In today’s world, being a Data Scientist is not limited to those without technical knowledge. While it is recommended and sometimes important to know a little bit of code, you can get by with just intuitive knowledge. Especially if you’re on H2O’s […]

Read More
Regression Metrics’ Guide
by Bruna Smith September 9, 2019 H2O Driverless AI Machine Learning Technical Tutorials

Introduction As part of my role within the automated machine learning space with H2O.AI and Driverless AI, I have seen that many times people struggle to find the right optimization metric for their data science problems. This process is even more challenging in regression problems where the errors are often not bounded like you normally have with probabilistic modeling. […]

Read More
Driverless AI can help you choose what you consume next
by Bruna Smith September 6, 2019 H2O Driverless AI Machine Learning Recipes Recommendations Technical Tutorials

Last updated: 09/06/19 Steve Jobs once said, “A lot of times, people don’t know what they want until you show it to them’. This makes sense, especially in this era of constant choice overload. Consumers today have access to a plethora of products just at the click of their mouse. These innumerable choices can sometimes […]

Read More
Detecting Sarcasm is difficult, but AI may have an answer
by Bruna Smith August 5, 2019 H2O Driverless AI NLP Recipes Technical Tutorials

Recently, while shopping for a laptop bag, I stumbled upon a pretty amusing customer review: “This is the best laptop bag ever. It is so good that within two months of use, it is worthy of being used as a grocery bag.” The innate sarcasm in the review is evident as the user isn’t happy […]

Read More
Building AI/ML models on Lending Club Data, with — Part 1
by h2oai March 28, 2019 Beginners Community Data Journalism Data Science Technical Posts Tutorials

Lending Club publishes its basic loan databases to the public and a full version to its customers — anonymized of course. You can find the download page from this link (screenshot below): The publicly downloadable loan data has various attributes — roughly 150+ columns that have categorical, numeric, text and date fields. It also has a ‘loan_status’ text column […]

Read More
Finally, You Can Plot H2O Decision Trees in R
by h2oai January 15, 2019 Data Science Machine Learning R Technical Technical Posts Tutorials

Creating and plotting decision trees (like one below) for the models created in H2O will be the main objective of this post: Figure 1. Decision Tree Visualization in R Decision Trees with H2O With release H2O-3 (a.k.a. open source H2O or simply H2O) added to its family of tree-based algorithms (which already included DRF, […]

Read More
H2O’s AutoML in Spark
H2O’s AutoML in Spark
by Jakub Hava July 23, 2018 AutoML Sparkling Water Technical Tutorials

This blog post demonstrates how H2O’s powerful automatic machine learning can be used together with the Spark in Sparkling Water. We show the benefits of Spark & H2O integration, use Spark for data munging tasks and H2O for the modelling phase, where all these steps are wrapped inside a Spark Pipeline. The integration between Spark […]

Read More
From Kaggle Grand Masters’ Recipes to Production Ready in a Few Clicks
by Jo-fai Chow May 9, 2018 H2O Driverless AI Tutorials

Introducing Accelerated Automatic Pipelines in H2O Driverless AI At H2O, we work really hard to make machine learning fast, accurate, and accessible to everyone. With H2O Driverless AI, users can leverage years of world-class, Kaggle Grand Masters experience and our GPU-accelerated algorithms (H2O4GPU) to produce top quality predictive models in a fully automatic and timely […]

Read More
Model building for sparkling water
Use on Azure HDInsight
by h2oai April 18, 2017 Cloud Sparkling Water Technical Tutorials

This is a repost from this article on MSDN. We’re hosting an upcoming webinar to present you how to use H2O on HDInsight and to answer your questions. Sign up for our upcoming webinar on combining H2O and Azure HDInsight. We recently announced that H2O and Microsoft Azure HDInsight have integrated to provide Data Scientists […]

Read More
Indexing 1 Billion Time Series with H2O and ISax
by h2oai November 11, 2016 Technical Tutorials Use Cases

At H2O, we have recently debuted a new feature called ISax that works on time series data in an H2O Dataframe. ISax stands for Indexable Symbolic Aggregate ApproXimation, which means it can represent complex time series patterns using a symbolic notation and thereby reducing the dimensionality of your data. From there you can run H2O’s […]

Read More
1 2 3