H2O Driverless AI: The Workbench for Data Science
September 26, 2019 Community Data Science H2O Driverless AI Technical TutorialsThis blog was written by Rohan Gupta and originally published here. 1. Introduction In today’s world, being a Data Scientist is not limited to those without technical knowledge. While it is recommended and sometimes important to know a little bit of code, you can get by with just intuitive knowledge. Especially if you’re on H2O’s […]
Regression Metrics’ Guide
September 9, 2019 H2O Driverless AI Machine Learning Technical TutorialsIntroduction As part of my role within the automated machine learning space with H2O.AI and Driverless AI, I have seen that many times people struggle to find the right optimization metric for their data science problems. This process is even more challenging in regression problems where the errors are often not bounded like you normally have with probabilistic modeling. […]
Driverless AI can help you choose what you consume next
September 6, 2019 H2O Driverless AI Machine Learning Recipes Recommendations Technical TutorialsLast updated: 09/06/19 Steve Jobs once said, “A lot of times, people don’t know what they want until you show it to them’. This makes sense, especially in this era of constant choice overload. Consumers today have access to a plethora of products just at the click of their mouse. These innumerable choices can sometimes […]
Detecting Sarcasm is difficult, but AI may have an answer
August 5, 2019 H2O Driverless AI NLP Recipes Technical TutorialsRecently, while shopping for a laptop bag, I stumbled upon a pretty amusing customer review: “This is the best laptop bag ever. It is so good that within two months of use, it is worthy of being used as a grocery bag.” The innate sarcasm in the review is evident as the user isn’t happy […]
Building AI/ML models on Lending Club Data, with H2O.ai — Part 1
March 28, 2019 Beginners Community Data Journalism Data Science Technical Posts TutorialsLending Club publishes its basic loan databases to the public and a full version to its customers — anonymized of course. You can find the download page from this link (screenshot below): The publicly downloadable loan data has various attributes — roughly 150+ columns that have categorical, numeric, text and date fields. It also has a ‘loan_status’ text column […]
Finally, You Can Plot H2O Decision Trees in R
January 15, 2019 Data Science Machine Learning R Technical Technical Posts TutorialsCreating and plotting decision trees (like one below) for the models created in H2O will be the main objective of this post: Figure 1. Decision Tree Visualization in R Decision Trees with H2O With release 3.22.0.1 H2O-3 (a.k.a. open source H2O or simply H2O) added to its family of tree-based algorithms (which already included DRF, […]
H2O’s AutoML in Spark
July 23, 2018 AutoML Sparkling Water Technical TutorialsThis blog post demonstrates how H2O’s powerful automatic machine learning can be used together with the Spark in Sparkling Water. We show the benefits of Spark & H2O integration, use Spark for data munging tasks and H2O for the modelling phase, where all these steps are wrapped inside a Spark Pipeline. The integration between Spark […]
From Kaggle Grand Masters’ Recipes to Production Ready in a Few Clicks
May 9, 2018 H2O Driverless AI TutorialsIntroducing Accelerated Automatic Pipelines in H2O Driverless AI At H2O, we work really hard to make machine learning fast, accurate, and accessible to everyone. With H2O Driverless AI, users can leverage years of world-class, Kaggle Grand Masters experience and our GPU-accelerated algorithms (H2O4GPU) to produce top quality predictive models in a fully automatic and timely […]
Use H2O.ai on Azure HDInsight
April 18, 2017 Cloud Sparkling Water Technical TutorialsThis is a repost from this article on MSDN. We’re hosting an upcoming webinar to present you how to use H2O on HDInsight and to answer your questions. Sign up for our upcoming webinar on combining H2O and Azure HDInsight. We recently announced that H2O and Microsoft Azure HDInsight have integrated to provide Data Scientists […]
Indexing 1 Billion Time Series with H2O and ISax
November 11, 2016 Technical Tutorials Use CasesAt H2O, we have recently debuted a new feature called ISax that works on time series data in an H2O Dataframe. ISax stands for Indexable Symbolic Aggregate ApproXimation, which means it can represent complex time series patterns using a symbolic notation and thereby reducing the dimensionality of your data. From there you can run H2O’s […]