Blog | H2O.ai

The Evolution of AI in Banking: Key Insights from Industry Experts

by Bruna Smith | June 10, 2025 Agentic AI, Financial Services, Know Your Customer Client, Sovereign AI

In a rapidly evolving regulatory and technological landscape, AI is no longer just a buzzword in banking—it’s a business imperative. But what does real adoption look like in an industry known for risk sensitivity and regulatory oversight? That was the focus of H2O.ai’s recent webinar, "Turning AI Strategy into Results", where ...

Securing Sovereign AI: Why FedRAMP ‘High’ Matters for the Future of Government AI

by David Epperson | May 15, 2025 Cybersecurity, Government, H2O AI Cloud, Security

As someone who has spent a career working at the intersection of technology and government—first as a Federal CIO and CISO, now as the CISO at H2O.ai—I’ve seen firsthand the tension between innovation and security in the public sector. Today, I’m proud to share that H2O.ai has achieved FedRAMP® “In Process” status at the High Impact Level...

H2O.ai Tops the General AI Assistant (GAIA) Test

by Jonathan McKinney | March 17, 2025 Enterprise h2oGPTe, Generative AI

We're proud to announce that our h2oGPTe Agent has once again claimed the #1 spot on the prestigious GAIA (General AI Assistants) benchmark with an impressive 75% accuracy rate – the first time a grade of C has been achieved on the GAIA test set. Previously H2O.ai was first to get a passing grade on the GAIA test. This achievement places...

H2O.ai Lidera o benchmark "General AI Assistants” (GAIA)

by Jonathan McKinney | March 17, 2025 Enterprise h2oGPTe, Generative AI

Temos o orgulho de anunciar que nosso Agente h2oGPTe conquistou novamente o 1º lugar no prestigiado benchmark GAIA (Assistentes de IA Geral) com uma impressionante taxa de precisão de 75% – a primeira vez que uma nota C foi alcançada no conjunto de testes GAIA. Anteriormente, a H2O.ai foi a primeira a obter uma nota de aprovação no teste ...

Agentic AI at Scale: Unlocking Enterprise Value with Domain-Specific LLMs and Exabyte Data

by Betty Candel | March 13, 2025 Agentic AI, H2O LLM Studio, Large Language Models

We’d like to personally thank Savannah Peterson and Dave Vellante for an engaging and insightful discussion on theCUBE, where we discussed the importance of AI’s convergence with enterprise data at scale. NVIDIA GTC always spotlights where AI is headed, from new GPU architectures to state of the art large language models (LLMs). During th...

The Battle for AI: Why Open Source Will Win | H2O.ai

by Sri Ambati | February 12, 2025 Artificial Intelligence, Open Source

Elon Musk just threw a $97.4 billion monkeywrench into OpenAI’s already tangled corporate transformation, exposing the real fight at hand: who gets to control the future of artificial intelligence? What started as a nonprofit mission to advance AI for humanity has turned into a high-stakes battle for dominance. OpenAI, once an open resea...

H2O.ai Tops GAIA Leaderboard: A New Era of AI Agents

by Jonathan McKinney | December 23, 2024 Enterprise h2oGPTe, Generative AI

We're excited to announce that our h2oGPTe Agent has achieved the top position on the GAIA (General AI Assistants, https://arxiv.org/abs/2311.12983) benchmark leaderboard (https://huggingface.co/spaces/gaia-benchmark/leaderboard), with a remarkable score of 65% - significantly outperforming other major players in the field. This achieveme...

Document Classification with H2O VL Mississippi: A Quick Guide

by Asghar Ghorbani | October 28, 2024

In this tutorial, we'll explore using H2O.ai's Vision-Language model (H2OVL-Mississippi-800M) for document classification (for example, in document processing automation). Despite its relatively compact size of 0.8 billion parameters, this model demonstrates impressive capabilities in text recognition and document understanding tasks. ...

Agents | Building your first Agent step-by-step with h2oGPTe & LLM Chains

by Audrey Létévé | October 16, 2024 Agents, Enterprise h2oGPTe, Function Calling, Guided Generation, LLM Chains

H2O.ai Announces Collaboration with AI Verify Foundation to Drive Responsible AI Adoption at Scale in Singapore

by Jordan Seow | October 08, 2024 H2O Eval Studio, LLM Evaluation, Open Source, Responsible AI

At H2O.ai, our mission has always been clear: to enable the responsible and scalable adoption of AI. As part of our commitment to providing our customers with the tools they need to build, test, and govern their AI systems effectively, we are announcing our latest collaboration with AI Verify Foundation. The AI Verify Foundation is a not...

AI for Climate Science: Insights from the LEAP Atmospheric Physics Competition

by Parul Pandey | October 07, 2024 Kaggle, Kaggle Grandmasters

In April 2024, Kaggle hosted the LEAP-Atmospheric Physics using AI (ClimSim) competition. The competition aimed to use AI to improve climate modeling, challenging participants to develop machine learning models that could enhance climate projections and reduce uncertainty in future climate trends. The goal was to employ faster ML model...

Model Selection | Routing you to the best LLM

by Michelle Tanco | September 20, 2024 Data Science, Enterprise h2oGPTe, Product Updates, Videos

Learn how h2oGPTe routes user queries to the best LLM based on preferences for latency, cost, or accuracy for chat and retrieval augmented generation. Welcome to Enterprise h2oGPTe, your Generative AI platform for interacting with a wide range of LLMs for chat, document question answering with Retrieval Augmented Generation, new content ...

Transformando Empresas Latinoamericanas con Inteligencia Artificial: Estrategias y Perspectivas

by David Alexis Garcia Espinosa | February 16, 2024 Generative AI, LATAM

En la actualidad podemos reconocer que hay una alta emoción en foros y publicaciones acerca del uso de inteligencia artificial (IA) en diferentes ámbitos empresariales, muchas veces se habla de los grandes cambios que conlleva el uso de la IA en procesos de negocios sin embargo estos casos de uso exitosos en su mayoría pertenecen a com...

Unlocking GenAI Magic: GenAI AppStudio Revolutionizes App Development with LLMs! (Part 2)

by Piraveen Sivakumar, Shivam Bansal | February 13, 2024 GenAI App Store, Generative AI

GenAI AppStudio provides a no code way to take user sketches and generates the code for you. DEMO   Introducing GenAI AppStudio GenAI AppStudio is a no-code platform specifically crafted for non-technical users, to easily transform app ideas into reality with a few simple steps. One of its key features is the ability to sea...

H2O LLM DataStudio: V4.1 Release

by Nishaanthini Gnanavel, Genevieve Richards, Tarique Hussain | January 16, 2024 Data Preparation, Generative AI

H2O LLM DataStudio is a comprehensive no-code application designed to simplify data preparation tasks for Large Language Models (LLMs). This tool comprises three key components: Curate, Prepare, and Augment. Curate - Conversion of documents (PDFs, DOC & audio/video files) into question-answer pairs and summarization pairs Prepare ...

Introducing the H2O GenAI App Store: A Playground of Generative AI Innovation

by Michelle Tanco | November 07, 2023 Generative AI

As the world becomes increasingly interconnected and reliant on data-driven decisions, the need for powerful and innovative AI solutions has never been more critical. At H2O.ai, we've been at the forefront of AI and machine learning for the last decade, providing you with the tools and platforms to harness the power of data. Today, we're ...

My New Blog Page Title

by Robert Smith, Jordan Seow, Julian Garratt | November 07, 2023 Data Preparation, Data Science, Financial Services

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Morbi vel risus erat. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus id tortor egestas, mollis augue eu, venenatis felis. Curabitur facilisis nunc sit amet odio tempor pharetra. Integer nunc magna, tincidunt eu elit a, aliquam gravida metus. In molestie rhoncus aug...

Apresentamos a H2O GenAI App Store: um Playground de Inovação em Inteligência Artificial Generativa.

by Michelle Tanco | November 06, 2023 Generative AI

This blog was originally published in English here: https://h2o.ai/blog/2023/gen-ai-app-store/ À medida que o mundo se torna cada vez mais interconectado e dependente de decisões orientadas por dados, a necessidade de soluções de IA poderosas e inovadoras nunca foi tão crítica. Na H2O.ai, estivemos na vanguarda da IA e do aprendizado de ...

Presentamos la H2O GenAI App Store: Un Playground de Innovación en Inteligencia Artificial Generativa.

by Michelle Tanco | November 06, 2023 Generative AI

This blog was originally published in English here: https://h2o.ai/blog/2023/gen-ai-app-store/ A medida que el mundo se vuelve cada vez más interconectado y dependiente de decisiones basadas en datos, la necesidad de soluciones de inteligencia artificial (IA) potentes e innovadoras nunca ha sido tan crítica. En H2O.ai, hemos estado a la ...

H2O Release 3.44

by Marek Novotny, Wendy Wong | October 20, 2023 H2O Release, H2O-3

We are excited to announce the release of H2O-3 3.44.0.1! We have added and improved many items. A few of our highlights are the implementation of AdaBoost, Shapley values support, Python 3.10 and 3.11 support, and added custom metric support for Deep Learning, Uplift Distributed Random Forest (DRF), Stacked Ensemble, and AutoML. Please r...

Boosting LLMs to New Heights with Retrieval Augmented Generation

by Venkatesh Yadav | October 06, 2023 Generative AI, H2O LLM Studio, LLM Limitations, LLM Robustness, LLM Safety, Large Language Models, h2oGPT

Businesses today can make leaps and bounds to revolutionize the way things are done with the use of Large Language Models (LLMs). LLMs are widely used by businesses today to automate certain tasks and create internal or customer-facing chatbots that boost efficiency. Challenges with dynamic adaption of LLMs As with any new hyped-up thi...

Entrenando Tu Propio LLM Sin Programación

by Favio Vazquez | October 06, 2023 Generative AI, H2O LLM Studio

This blog was originally published in English here: https://www.analyticsvidhya.com/blog/2023/09/training-your-own-llm-without-coding/ Introducción La Inteligencia Artificial Generativa, un campo fascinante que promete revolucionar cómo interactuamos con la tecnología y generamos contenido, ha causado sensación en el mundo. En este artí...

H2O LLM DataStudio Part II: Convert Documents to QA Pairs for fine tuning of LLMs

by Genevieve Richards, Tarique Hussain, Shivam Bansal | September 22, 2023 Generative AI, H2O LLM Studio

Convert unstructured datasets to Question-answer pairs required for LLM fine-tuning and other downstream tasks with H2O LLM Data Studio Curate. Every organization needs to own its GPT as simply as it needs to bring its data, algorithms, and models (read more here). A common problem we see in organizations is that they want to be able to...

Building a Fraud Detection Model with H2O AI Cloud

sketch2app is an application that let users instantly convert sketches to fully functional AI applications. This blog is Part 1 of the LLM AppStudio Blog Series and introduces sketch2app The H2O.ai team is dedicated to democratizing AI and making it accessible to everyone. One of the focus areas of our team is to simplify the adoption of...

H2O LLM DataStudio: Streamlining Data Curation and Data Preparation for LLMs related tasks

by Shivam Bansal, Sanjeepan Sivapiran, Nishaanthini Gnanavel | June 14, 2023 Data, Data Preparation, H2O LLM Studio, Large Language Models, NLP, h2oGPT

A no-code application and toolkit to streamline data preparation tasks related to Large Language Models (LLMs) H2O LLM DataStudio is a no-code application designed to streamline data preparation tasks specifically for Large Language Models (LLMs). It offers a comprehensive range of preprocessing and preparation functions such as text cl...

Recap of H2O World India 2023: Advancements in AI and Insights from Industry Leaders

by Parul Pandey | May 29, 2023 AI4Good, Community, H2O World

On April 19th, the H2O World made its debut in India, marking yet another milestone in its global journey. The conference gathered an array of notable experts and enthusiasts from deep learning, artificial intelligence, and data science. A broad spectrum of topics was covered, shedding light on the strides made in AI technology and its ...

Enhancing H2O Model Validation App with h2oGPT Integration

by Parul Pandey | May 17, 2023 Deep Learning, H2O Model Validation, h2oGPT

As machine learning practitioners, we’re always on the lookout for innovative ways to streamline and enhance our processes. What if we could integrate the power of language models into our workflows, especially in the critical phase of model validation? Imagine running validation procedures, interpreting results, or even troubleshooting i...

Building a Manufacturing Product Defect Classification Model and Application using H2O Hydrogen Torch, H2O MLOps, and H2O Wave

by Shivam Bansal, Genevieve Richards, Nishaanthini Gnanavel | May 15, 2023 H2O Hydrogen Torch, H2O Wave, MLOps, Manufacturing

Primary Authors: Nishaanthini Gnanavel and Genevieve Richards Effective product quality control is of utmost importance in the manufacturing industry. The presence of defective components can have adverse effects on various aspects, including escalating production costs, compromising product quality, diminishing product longevity, and l...

Insights from AI for Good Hackathon: Using Machine Learning to Tackle Pollution

by Parul Pandey, Shivam Bansal | May 10, 2023 AI4Good, H2O World, Hackathon

At H2O.ai, we believe technology can be a force for good, and we’re committed to leveraging its power to create a positive impact in the world. As part of this commitment, we recently organized an AI for Good Hackathon during the H2O World India event, where participants had the opportunity to apply their data science skills to a real-wor...

Democratization of LLMs

by Sri Ambati | May 08, 2023 H2O LLM Studio, Large Language Models, h2oGPT

Every organization needs to own its GPT as simply as we need to own our data, algorithms and models. H2O LLM Studio democratizes LLMs for everyone allowing customers, communities and individuals to fine-tune large open source LLMs like h2oGPT and others on their own private data and on their servers. Every nation, state and city needs it...

Building the World's Best Open-Source Large Language Model: H2O.ai's Journey

by Arno Candel | May 03, 2023 Large Language Models, h2oGPT

At H2O.ai, we pride ourselves on developing world-class Machine Learning, Deep Learning, and AI platforms. We released H2O, the most widely used open-source distributed and scalable machine learning platform, before XGBoost, TensorFlow and PyTorch existed. H2O.ai is home to over 25 Kaggle grandmasters, including the current #1. In 2017, w...

Effortless Fine-Tuning of Large Language Models with Open-Source H2O LLM Studio

by Vinod Iyengar | November 24, 2022 H2O World

After three long years of not having an #H2OWorld, we finally held our first one in Sydney to a sold-out crowd! We then followed it up with H2O World Dallas in the same week! It was a fantastic and jam-packed event with customers, partners, colleagues, and community members sharing how they leverage H2O.ai to accelerate and transform AI l...

New in Wave 0.24.0

by Martin Turoci | November 21, 2022 H2O Hydrogen Torch, H2O Release, H2O Wave

Another Wave release has arrived with quite a few exciting new features. Let’s quickly go over the biggest ones.Wave init CLIHow many times you wanted to build a Wave app fast, but then you realized you need to start from scratch, copy over the skeleton of your app and work up from there? For these exact reasons, we introduced a new wave...

H2O.ai Raises $40 Million to Democratize Artificial Intelligence for the Enterprise

by H2O.ai Team | November 20, 2022 Press Release

Series C round led by Wells Fargo and NVIDIA MOUNTAIN VIEW, CA – November 30, 2017 – H2O.ai, the leading company bringing AI to enterprises, today announced it has completed a $40 million Series C round of funding led by Wells Fargo and NVIDIA with participation from New York Life, Crane Venture Partners, Nexus Venture Partners and Tra...

H2O.ai Placed Furthest in Completeness of Vision in 2021 Gartner Data Science and Machine Learning Magic Quadrant in the Visionaries Quadrant. -- Copy

by Read Maloney | November 18, 2022 Business, Gartner, H2O Hydrogen Torch

At H2O.ai, our mission is to democratize AI, and we believe driving value from data is a team sport. Data needs to be organized and prepared, often by data engineers, and then models need to be built by data scientists. With models built, they need to be put into production and maintained by IT and DevOps personnel. Finally, these models...

H2O.ai Expands Market Footprint in Healthcare AI by Signing Hackensack Meridian Health and Other Key Providers

by Prashant Natarajan | November 14, 2022 Healthcare

We’re excited to attend the HLTH conference this week in Las Vegas, NV. This industry event has quickly become the go-to event for c-level executives across all parts of the healthcare industry. It’s both incredible and inspiring to see how quickly the event has grown in its five years, and that’s why we’re excited to share some news abou...

An Introduction to H2O Wave Table

by Rohan Rao | November 13, 2022 H2O Hydrogen Torch, H2O Wave

H2O Wave is a Python package for creating realtime ML/AI applications for a wide variety of data science workflows and industry use cases. Data scientists view a significant amount of data in tabular form. Running SQL queries, pivoting data in Excel or slicing a pandas dataframe are pretty much bread-and-butter tasks. With the growing u...

Saving Zebras: “Their stripes are like fingerprints. No two are alike.”

by Anthony Gomes | November 10, 2022 AI4Good

It’s been said that a picture is worth a thousand words. But to Tanya Berger-Wolf, a picture is far more valuable than that. To Berger-Wolf, photos, images and videos are key to protecting biodiversity and entire species around the world. Scientists have known for years that we are in the middle of the sixth mass extinction on our planet...

H2O Managed Cloud With AWS PrivateLink is Now Generally Available

by Ophir Zahavi | November 10, 2022 Amazon Web Services, H2O AI Cloud

A n essential part of responsibly practicing machine learning is understanding how you secure your data. H2O Managed Cloud offers a single-tenant cloud environment with multiple layers of security – but how do you get your data securely into the cloud for training, and how do you score sensitive information without exposing it to the inte...

H2O.ai Receives Innovation Award for H2O Hydrogen Torch

by Anthony Gomes | October 31, 2022 H2O Hydrogen Torch

We don’t like to brag, but we do like to celebrate the work our Makers create, and more importantly, why they create it: for you. H2O.ai was proud to accept the award for “Best Deep Learning Technology” at the AI Tech awards. H2O Hydrogen Torch , a no-code deep learning training engine, was released less than a year ago in February 2022...

AI for Good: PetFinder.my Levels Up Furry Matchmaking

by H2O.ai Team | October 19, 2022 AI4Good, H2O Driverless AI

Nothing tugs at the heart strings quite like a poster in your neighborhood about a missing cat or dog. For years, technology has enabled lost pets to be reunited with their families in the form of a small microchip that contains an owner’s contact information. Now some organizations are turning to emerging technology to help the millions ...

H2O Wave joins Hacktoberfest

by Martin Turoci | September 29, 2022 H2O Hydrogen Torch, H2O Wave

It’s that time of the year again. A great initiative by DigitalOcean called Hacktoberfest that aims to bring more people to open source is about to start. Hacktoberfest incentives people to make at least 4 valuable contributions (pull requests) to an open source repository and get the reward i...

Three Keys to Ethical Artificial Intelligence in Your Organization

by H2O.ai Team | September 23, 2022 AI4Good, Machine Learning

There’s certainly been no shortage of examples of AI gone bad over the past few years–enough to give everyone pause on how (and if) this technology can truly be used for good. If it’s not Facebook selling data of its users , it’s self-driving cars from Uber that can’t recognize pedestrians in time to slow down or stop. So while the uses ...

머신러닝 자동화 솔루션 H2O Driveless AI를 이용한 뇌에서의 성차 예측

by H2O.ai Team | August 29, 2022 H2O Driverless AI, Healthcare, Solutions

Predicting Gender Differences in the Brain Using Machine Learning Automation Solution H2O Driverless AI아동기 뇌인지 발달은 기억, 주의력, 사회성 등 고등 인지 기능에 영향을 미치고, 청소년기와 성인기의 뇌 발달로까지 이어집니다.Brain cognitive development in childhood affects higher cognitive functions such as memory, attention, and sociability, and leads to brain development in adolescence ...

Make with H2O.ai Recap: Validation Scheme Best Practices

by Blair Averett | August 23, 2022 Data Science, Kaggle, Machine Learning, Make with H2O.ai

Data Scientist and Kaggle Grandmaster, Dmitry Gordeev, presented at the Make with H2O.ai session on validation scheme best practices, our second accuracy masterclass. The session covered key concepts, different validation methods, data leaks, practical examples, and validation and ensembling. Key Concepts While the validation topics cove...

Integrating VSCode editor into H2O Wave

by Martin Turoci | August 18, 2022 H2O Hydrogen Torch, H2O Wave, Tutorials

Let’s have a look at how to provide our users with a truly amazing experience when we need to allow them to edit pieces of code or configuration. We will use one of the most popular and well-known code editors called Monaco editor which powers VSCode. The resulting app will have the editor on the left side and a markdown card on the righ...

5 Tips for Improving Your H2O Wave Apps

Manufacturers are rapidly expanding their machine learning use cases by leveraging the deep integration between Snowflake’s Data Cloud and the H2O AI Cloud. Many current manufacturing quality checks require that sensor data and image data be processed and analyzed separately. Standard tooling presents challenges in storing and referencin...

The H2O.ai Wildfire Challenge Winners Blog Series - Team PSR

by H2O.ai Team, Shamil Prematunga | May 31, 2022 AI4Good, Community, H2O Driverless AI, H2O Hydrogen Torch

Note : this is a community blog post by Team PSR – one of the H2O.ai Wildfire Challenge winners.This blog represents an experience we gained by participating in the H2O wildfire challenge. We need to mention that competing in this challenge is like a journey in a knowledge pool. For a person who is willing to get the knowledge of buildin...

Developing and Retaining Data Science Talent

by Jon Farland | May 12, 2022 Company, Makers

It’s been almost a decade since the Harvard Business Review proclaimed that “Data Scientist” is the sexiest job of the 21st century. Since then, there has been an explosion of job opportunities and university degree programs claiming to give students all of the skills they need to accel in the field of data science . Yet, the scarcity of ...

The H2O.ai Wildfire Challenge Winners Blog Series - Team HTB

by H2O.ai Team | May 10, 2022 AI4Good, Community, Computer Vision, H2O Hydrogen Torch

Note : this is a community blog post by Team HTB – one of the H2O.ai Wildfire Challenge winners. You can check out their app here . The Challenge The purpose of the challenge was to develop an AI application to improve the forecast of bushfires and wildfires, with the main aim of reducing the human losses that these phenomena can cause...

The H2O.ai Wildfire Challenge Winners Blog Series - Team Too Hot Encoder

by H2O.ai Team | May 10, 2022 AI4Good, Community

Note : this is a community blog post by Team Too Hot Encoder – one of the H2O.ai Wildfire Challenge winners. You can check out their app here .The ChallengeThe aim of the project is to predict the probability of wildfire occurrence in Turkey for each month in 2020. As a result of these predictions, it is aimed to carry out more intensive...

Bias and Debiasing

by Kim Montgomery | April 15, 2022 Explainable AI, H2O-3

An important aspect of practicing machine learning in a responsible manner is understanding how models perform differently for different groups of people, for instance with different races, ages, or genders. Protected groups frequently have fewer instances in a training set, contributing to larger error rates for those groups. Some models...

Comprehensive Guide to Image Classification using H2O Hydrogen Torch

by H2O.ai Team | March 29, 2022 Computer Vision, H2O AI Cloud, H2O Hydrogen Torch, Tutorials

In this article, we will learn how to build state-of-the-art models in computer vision and natural language processing within a couple of minutes using H2O Hydrogen Torch. Introduction to H2O Hydrogen Torch H2O Hydrogen Torch (HT) aims to simplify building and deploying deep learning models for a wide range of tasks in computer vision...

Democratizing Lending through AI

by H2O.ai Team | March 23, 2022 Financial Services, H2O AI Cloud

According to the Federal Reserve , nearly 40% of adults in the U.S. sought credit in 2020, only slightly fewer than those who applied in the previous pre-pandemic year; among those who applied more than 1 in 10 were denied credit or were approved for less than they had sought. The reasons behind these denials are many, however, the same r...

Setting Up Your Local Machine for H2O AI Cloud Wave App Development

by Michelle Tanco | March 17, 2022 H2O AI Cloud, H2O Hydrogen Torch, H2O Wave

by Shamil Prematunga | February 25, 2022 Community, H2O AI Cloud, H2O Hydrogen Torch

Note : this is a community blog post by Shamil Dilshan Prematunga . It was first published on Medium . In this blog, I am going to highlight how cool H2O Wave is, by demonstrating my application called “K means App” which was built using Wave 0.20.0 . This is a simple application I have created to demonstrate one of the unsupervised lea...

A Quick Introduction to PyTorch: Using Deep Learning for Stock Price Prediction

by H2O.ai Team | February 23, 2022 Deep Learning, H2O AI Cloud, Neural Networks, Technical, Tutorials

Torch is a scalable and efficient deep learning framework. It offers flexibility and speed to build large scale applications. It also includes a wide range of libraries for developing speech, image, and video-based applications. The basic building block of Torch is called a tensor. All the operations defined in Torch use a tensor. Ok, l...

Introducing H2O Hydrogen Torch: A No-code Deep Learning Framework

by Philipp Singer, Yauhen Babakhin | February 17, 2022 Computer Vision, H2O AI Cloud, H2O Hydrogen Torch, NLP, Product Updates

Over and over again we heard from customers, “deep learning is cool, but it’s hard and time consuming.” They kept asking “could someone just make it easier?” In typical “Maker” fashion, you ask, we deliver, H2O Hydrogen Torch . H2O Hydrogen Torch is a new product that enables data scientists and developers to train and deploy state-of-t...

How to Create Your Spotify EDA App with H2O Wave

by H2O.ai Team | February 09, 2022 H2O AI Cloud, H2O Hydrogen Torch, H2O Wave, Technical, Tutorials

In this article, I will show you how to build a Spotify Exploratory Data Analysis (EDA) app using H2O Wave from scratch.H2O Wave is an open-source Python development framework for interactive AI apps. You do not need to know Flask, HTML, CSS, etc. H2O Wave has ready-to-use user-interface components and charts, including dashboard templa...

H2O.ai releases new H2O MLOps features that improves the explainability, flexibility and configuration of machine learning workflows.

by Abhishek Mathur | February 03, 2022 H2O AI Cloud, MLOps

H2O.ai now provides data scientists and machine learning (ML) engineers even more powerful features that give greater control, governance, and scalability within their machine learning workflow – all available on our H2O AI Cloud. Now, H2O MLOps enables you to: Deploy model explanations in production Explainability is core to understa...

Mission Impossible: Improving Patient Care Through Automated Document Processing

If you can’t explain it to a six-year-old, you don’t understand it yourself. – Albert Einstein One fear caused by machine learning (ML) models is that they are blackboxes that cannot be explained. Some are so complex that no one, not even domain experts, can understand why they make certain decisions. This is of particular concern when s...

The Bond Market & AI: How MarketAxess Brings it All Together

by Ian Gomez | January 11, 2022 Customers, Financial Services

The vast majority of the equities market trades electronically while the bond market is still in its infancy by comparison, but MarketAxess is seeking to change that. Recently, we hosted a virtual event with the MarketAxess team where they explained how they were solving challenges in the world’s largest bond marketplace while leveraging ...

H2O Release 3.36 (Zorn)

The Makers here at H2O.ai have been busy building new features and enhancing capabilities across our AI platform . Designed to support our core mission of democratizing AI, these additions to our platform simplify the ability to make AI you can trust, operate it efficiently and innovate with ready-made AI applications.Launched in January ...

Time Series Forecasting Best Practices

H2O.ai logra gran posicionamiento en integridad de visión en el cuadrante Visionarios del Cuadrante Mágico de Gartner 2021 para Data Science y Machine Learning

by Read Maloney | April 11, 2021 Business, Community, Gartner, H2O AI Cloud

En H2O.ai, nuestra misión es democratizar la IA y creemos que impulsar el valor de los datos es un esfuerzo de equipo. A menudo, los ingenieros de datos deben organizar y preparar los datos y luego los científicos de datos deben crear modelos. Los modelos, una vez creados, deben ponerse en producción y el personal de TI y de DevOps debe m...

Safer Sailing with AI

by Ana Visneski, Jo-Fai Chow, Kim Montgomery | April 01, 2021 Customers, Data Science, H2O Hydrogen Torch, H2O-3, Machine Learning Interpretability

In the last week, the world watched as responders tried to free a cargo ship that had gone aground in the Suez Canal. This incident blocked traffic through a waterway that is critical for commerce. While the location was an unusual one, ship collisions, allisions , and groundings are not uncommon. With all the technology that mariners hav...

H2O AI Cloud: Democratizing AI for Every Person and Every Organization

by Parul Pandey | March 24, 2021 AutoML, Data Science, H2O AI Cloud, H2O Driverless AI, H2O Hydrogen Torch, ModelOps

Harnessing AI’s true potential by enabling every employee, customer, and citizen with sophisticated AI technology and easy-to-use AI applications. Democratization is an essential step in the development of AI, and AutoML technologies lie at the heart of it. AutoML tools have played a pivotal role in transforming the way we consume an...

H2O.ai é a mais avançada por sua capacidade de execução no quadrante dos visionários no relatório do Gartner de Ciências de Dados e Machine Learning em 2021

by Read Maloney | March 16, 2021 Business, Community, Gartner, H2O AI Cloud

*Este artigo foi originalmente escrito em inglês pelo SVP de Marketing, Read Maloney, e traduzido para português por Bruna Smith. Na H2O.ai, nossa missão é democratizar a Inteligência Artificial e acreditamos que o valor agregado, gerado a partir dos dados, é um trabalho em equipe. Os dados devem ser organizados e preparados, geralmente ...

H2O.ai Placed Furthest in Completeness of Vision in 2021 Gartner Data Science and Machine Learning Magic Quadrant in the Visionaries Quadrant.

by Read Maloney | March 09, 2021 Business, Gartner, H2O Hydrogen Torch

At H2O.ai, our mission is to democratize AI, and we believe driving value from data is a team sport. Data needs to be organized and prepared, often by data engineers, and then models need to be built by data scientists. With models built, they need to be put into production and maintained by IT and DevOps personnel. Finally, these models...

Learning from others is imperative to success on Kaggle says this Turkish GrandMaster

by Parul Pandey | February 15, 2021 Makers

In conversation with Fatih Öztürk: A Data Scientist and a Kaggle Competition Grandmaster. In this series of interviews, I present the stories of established Data Scientists and Kaggle Grandmasters at H2O.ai , who share their journey, inspirations, and accomplishments. These interviews are intended to motivate and encourage others who want...

H2O-3 Improvements from Two University Projects

by Veronika Maurerova | February 08, 2021 Academic Program, H2O-3

In September 2019 H2O.ai became a silver partner of the Faculty of Informatics at Czech Technical University in Prague. The main goal of this partnership is to make connections between students and companies to prepare an environment where students can use their knowledge in practice and gain real-work experiences. In general, within th...

Data to Production Ready Models to Business Apps in Just a Few Steps

by Shivam Bansal | February 05, 2021 H2O Hydrogen Torch, Solutions

Building a Credit Scoring Model and Business App using H2OIn the journey of a successful credit scoring implementation, multiple stakeholders and different personas are involved at different steps – Business Inputs, Dataset procurement, Data Analysis, Predictive Machine Learning, Data Storytelling, and Dashboarding. H2O.AI platforms such ...

Using Python's datatable library seamlessly on Kaggle

Some organizations have already identified the benefits that can be gained from Artificial Intelligence and Data Science, bringing in talented resources to enable them to build AI models and solutions. But more often than not, the business doesn’t understand the capabilities and huge potential of AI well enough, nor the investments that a...

AI in the Financial Industry: 8 Key Takeaways from the Bill.com + H2O.ai Fireside Chat

by Bruna Smith | November 05, 2020 Community, Customers, Financial Services

The current global pandemic crisis presents various challenges to businesses in all industries, including financial services institutions, who are monitoring and dealing with the effects of COVID-19 across the world. At a time of a pandemic, it is important that teams get together to share their insights and experience, with the goal of i...

The Importance of Explainable AI

by H2O.ai Team | October 30, 2020 Community, Machine Learning Interpretability, Responsible AI

This blog post was written by Nick Patience, Co-Founder & Research Director, AI Applications & Platforms at 451 Research, a part of S&P Global Market Intelligence From its inception in the mid-twentieth century, AI technology has come a long way. What was once purely the topic of science fiction and academic discussion is now...

Building an AI Aware Organization

by H2O.ai Team | October 26, 2020 Business, Explainable AI, Machine Learning, Machine Learning Interpretability, Responsible AI

by H2O.ai Team | October 07, 2020 Business, Community, H2O Driverless AI, Machine Learning

The healthcare industry is evolving rapidly with volumes of data and increasing challenges. Early adopters of AI and machine learning in the healthcare space have embraced new data-driven initiatives and are reaping the benefits not only in terms of patient care but also in their own operations. Hospitals, physicians, and laboratories can...

5 Key Considerations for Machine Learning in Fair Lending

by Benjamin Cox | September 21, 2020 Financial Services, Machine Learning, Machine Learning Interpretability, Responsible AI, Shapley

This month, we hosted a virtual panel with industry leaders and explainable AI experts from Discover, BLDS, and H2O.ai to discuss the considerations in using machine learning to expand access to credit fairly and transparently and the challenges of governance and regulatory compliance. The event was moderated by Sri Ambati, Founder and CE...

The Benefits of Budget Allocation with AI-driven Marketing Mix Models

by Parul Pandey | November 22, 2019 Makers

There is more to competitive Data Science than simply applying algorithms to get the best possible model. The main takeaway from participating in these competitions is that they provide an excellent opportunity for learning and skill-building. The learnings can then be utilized in one’s academic or professional life. Kaggle is one of th...

Climbing the AI and ML Maturity Model Curve

by Karthik Guruswamy | November 19, 2019 Data Science, Machine Learning, Technical

AI/ML Maturity Model Curve/StepsAI/ML Maturity models are published and updated periodically by a lot of vendors. The end goal is almost always about effecting transformation and automate processes in a short period and making AI the DNA/core of the business.One of the biggest challenges for businesses today is to clearly define what succ...

How to write a Transformer Recipe for Driverless AI

by Ashrith Barthur | November 18, 2019 H2O Driverless AI, Machine Learning, Recipes

What is a transformer recipe? A transformer (or feature) recipe is a collection of programmatic steps, the same steps that a data scientist would write a code to build a column transformation. The recipe makes it possible to engineer the transformer in training and in production. The transformer recipe, and recipes, in general, provide a...

Novel Ways To Use Driverless AI

by Thomas Ott | November 14, 2019 H2O Driverless AI, Machine Learning Interpretability

I am biased when I write that Driverless AI is amazing, but what’s more amazing is how I see customers using it. As a Sales Engineer, my job has been to help our customers and prospects use our flagship product. In return, they give us valuable feedback and talk about how they used it. Feedback is gold to us. Driverless AI has evolved in...

Image Tasks on H2O Driverless AI

by Sanyam Bhutani | November 12, 2019 H2O Driverless AI, H2O World, Makers

I’d like to thank Grandmaster Yauhen Babakhin for reviewing the drafts and the very useful corrections & suggestions. Link to the video. IntroductionIn this talk Kaggle GrandMaster and Data Scientist at H2O.ai: Yauhen Babakhin shows us a few prototype demos of how DriverlessAI’s upcoming release will work with Image Data and the relat...

Accelerate Machine Learning workflows with H2O.ai Driverless AI on Red Hat OpenShift, Enterprise Kubernetes Platform

by Nicholas Png | November 12, 2019 H2O Driverless AI, Kubernetes

Organizations globally are operationalizing containers and Kubernetes to accelerate Machine Learning lifecycles as these technologies provide data scientists and software developers with much needed agility, flexibility, portability, and scalability to train, test, and deploy ML models in production. Red Hat OpenShift is the industry’s mo...

Importing, Inspecting, and Scoring With MOJO Models Inside H2O

by H2O.ai Team | November 08, 2019 H2O-3, Technical

Machine-learning models created with H2O may be exported in two basic ways: Binary format, Model Object, Optimized (MOJO). An H2 O model can be saved in a binary format, which is tied to the very specific version of H2 O it has been created with. There are multiple reasons for such a restriction. One of the important reasons is that...

Natural Language Processing in H2O’s Driverless AI

by Sanyam Bhutani | November 06, 2019 Community, H2O Driverless AI, H2O World, Makers, NLP

Note: I’d like to thank Grandmaster SRK for a lot of suggestions and corrections with the writeup.Note: All images used here are from the talk. Link to the slides Link to the video Note 2: All of the discussion here is related to NLP. DriverlessAI also supports other domains that are covered in other talks and posts (releasing soon). Driv...

Highlights of H2O World New York 2019

by H2O.ai Team | November 02, 2019 Community, H2O World, Makers

by H2O.ai Team | October 14, 2019 Community, Company, Events, H2O World, Makers

Every H2O World is magical. The preparation for the conference starts many months in advance and we put a lot of effort and love in every single detail to provide our beloved community with the best experience possible. Our upcoming H2O World New York on October 22 is the third edition I work on as part of the marketing team at H2O.ai. My...

5 Key Takeaways On Overcoming Gender and Diversity Barriers

by H2O.ai Team | October 04, 2019

Overcoming gender and diversity barriers in the workplace is a challenge for many industries. Therefore, listening to women and discussing the topic is the first step towards finding out how to address gender bias and possible inequalities. Last month, H2O.ai organized a panel in New York: Breaking gender and diversity barriers in machi...

Predicting Failures from Sensor Data using AI/ML — Part 2

by Karthik Guruswamy | September 27, 2019 H2O Driverless AI, Recipes, Technical

This is Part 2 of the blog post series and continuation of the original post, Predicting Failures from Sensor Data using AI/ML — Part 1 .Missing Values & Data ImbalanceOne of the things to note is that the hard-disk data set has a lot of missing values across its columns. Check out the Missing Data Heat Map on the training data set — ...

H2O Driverless AI: The Workbench for Data Science

by Vinod Iyengar | September 26, 2019 Community, Data Science, H2O Driverless AI, Technical, Tutorials

This blog was written by Rohan Gupta and originally published here. 1. IntroductionIn today’s world, being a Data Scientist is not limited to those without technical knowledge. While it is recommended and sometimes important to know a little bit of code, you can get by with just intuitive knowledge. Especially if you’re on H2O’s Driverle...

H2O Driverless AI Acceleration with Intel DAAL

by Rafael Coss | September 25, 2019 Data Science, H2O Driverless AI, Machine Learning

by Priya Jain | June 21, 2019 Customers, Data Science, Financial Services, H2O Driverless AI, Machine Learning Interpretability

Determining credit has been done by traditional techniques for decades. The challenge with traditional credit underwriting is that it doesn’t take into account all of the various aspects or features of an individual’s credit ability. Underwrite.ai, a new credit startup, saw this as an opportunity to apply machine learning and AI to impro...

The Reproductive Science Center of SF Bay Area uses AI to Treat Infertility

by Michal Kurka | April 02, 2019 H2O Release

There’s a new major release of H2O, and it’s packed with new features and fixes! Among the big new features in this release, we’ve introduced cross-version support for model import, added new features for model interpretation, provided much-improved support for reading data from Apache Hive, and included various algorithm and AutoML impr...

Building AI/ML models on Lending Club Data, with H2O.ai — Part 1

by Karthik Guruswamy, Vinod Iyengar | March 28, 2019 Beginners, Community, Data Journalism, Data Science, Technical, Tutorials

Lending Club publishes its basic loan databases to the public and a full version to its customers — anonymized of course. You can find the download page from this link (screenshot below): The publicly downloadable loan data has various attributes — roughly 150+ columns that have categorical, numeric, text and date fields. It also has a ‘...

AI/ML Model Scoring - What Good Looks Like in Production

I was in San Francisco this (past) week as part of H2O World 2019. I flew in the week before and took a red-eye flight back home right after the conference on Tuesday night. Like any technology conference, this one had fantastic presentations, training, and product roadmap presentations. We even live streamed it if you couldn’t be there i...

Key Takeaways from the Gartner Magic Quadrant For Data Science & Machine Learning

by H2O.ai Team | January 30, 2019 Gartner, H2O-3, Machine Learning, Machine Learning Interpretability

The Gartner Magic Quadrant for Data Science and Machine Learning Platforms (Jan 2019) is out and H2O.ai has been named a Visionary. The Gartner MQ evaluates platforms that enable expert data scientists, citizen data scientists and application developers to create, deploy and manage their own advanced analytic models.H2O.ai Key Highlights...

What is Your AI Thinking? Part 2

by H2O.ai Team | January 24, 2019 Data Science, Explainable AI, Financial Services, H2O Driverless AI, Machine Learning Interpretability

Explaining AI to the Business PersonWelcome to part 2 of our blog series: What is Your AI Thinking? We will explore some of the most promising testing methods for enhancing trust in AI and machine learning models and systems. We will also cover the best practice of model documentation from a business and regulatory standpoint.More Techniq...

H2O New Year releases

by H2O.ai Team | January 18, 2019 H2O Release, H2O-3, Python, R

There were two releases shortly after each other. First, on December 21st, there was a minor (fix) release 3.22.0.3 . Immediately followed by a more major release (but still on 3.22 branch) codename Xu, named after mathematician Jinchao Xu , whose work is focused on deep neural networks, besides many other fields of research.Of course, th...

What is Your AI Thinking? Part 1

Over the last several years, machine learning has become an integral part of many organizations’ decision-making at various levels. With not enough data scientists to fill the increasing demand for data-driven business processes, H2O.ai has developed a product called Driverless AI that automates several time consuming aspects of a typica...

Time is Money! Automate Your Time-Series Forecasts with Driverless AI

by Jo-Fai Chow | June 12, 2018 H2O Driverless AI

Time-series forecasting is one of the most common and important tasks in business analytics. There are many real-world applications like sales, weather, stock market, energy demand, just to name a few. We strongly believe that automation can help our users deliver business value in a timely manner. Therefore, once again we translated our ...

H2O.ai and IBM build a Strategic Partnership to bring AI innovation to the market together

by Sri Ambati | June 07, 2018

Excited to announce our strategic partnership with IBM that allows them to resell and take to market H2O Driverless AI to businesses worldwide. This partnership makes AI economical – faster, cheaper and easier to do experiments. H2O Driverless AI and IBM POWER9 GPU Systems are bringing together the best of breed AI innovation. We have b...

AI in Healthcare - Redefining Patient & Physician Experiences

by H2O.ai Team | May 14, 2018 Community, Data Science, Deep Learning

Register for the Meetup Here Patients, physicians, nurses, health administrators and policymakers are beneficiaries of the rapid transformations in health and life sciences. These transformations are being driven by new discoveries (etiology, therapies, and drugs/implants), market reconfiguration and consolidation, a movement to value-bas...

From Kaggle Grand Masters’ Recipes to Production Ready in a Few Clicks

by Jo-Fai Chow | May 09, 2018 H2O Driverless AI, Tutorials

Introducing Accelerated Automatic Pipelines in H2O Driverless AIAt H2O, we work really hard to make machine learning fast, accurate, and accessible to everyone. With H2O Driverless AI, users can leverage years of world-class, Kaggle Grand Masters experience and our GPU-accelerated algorithms (H2O4GPU ) to produce top quality predictive ...

H2O World coming to NYC

by H2O.ai Team | May 08, 2018 Community

Whether you’re just starting out learning how machine learning and H2O.ai can supercharge your business or a veteran looking for more, we want to invite you to join some of greatest minds in the field to learn how AI and H2O.ai can transform your business. Our flagship event, H2O World is back and it’s going to be bigger than ever! We’re ...

Democratize care with AI — AI to do AI for Healthcare

by H2O.ai Team | April 23, 2018 Customers, Healthcare, Machine Learning

Very excited to have Prashant Natarajan (@natarpr) join us along with Sanjay Joshi on our vision to change the world of healthcare with AI. Health is wealth. And one worth saving the most. They bring invaluable domain knowledge and context to our cause. As one of our customers would like to say, Healthcare should be optimized for health...

Sparkling Water 2.3.0 is now available!

by H2O.ai Team | April 12, 2018 Sparkling Water

Hi Makers! We are happy to announce that Sparkling Water now fully supports Spark 2.3 and is available from our download page . If you are using an older version of Spark, that’s no problem. Even though we suggest upgrading to the latest version possible, we keep the Sparkling Water releases for Spark 2.2 and 2.1 up-to-date with the lates...

H2O + Kubeflow/Kubernetes How-To

by H2O.ai Team | March 29, 2018 H2O-3

Today, we are introducing a walkthrough on how to deploy H2O 3 on Kubeflow. Kubeflow is an open source project led by Google that sits on top of the Kubernetes engine. It is designed to alleviate some of the more tedious tasks associated with machine learning. Kubeflow helps orchestrate deployment of apps through the full cycle of devel...

Makers in Action: Community, Partners and Team Members at #GTC18

by H2O.ai Team | March 28, 2018 Events

NVIDIA’s GPU Technology Conference (GTC) has been incredible! Folks from all over the world are exploring the latest breakthroughs in self-driving cars, smart cities, healthcare, high performance computing, virtual reality, and more, all propelled by the AI movement. If you’re attending GTC and would like to see our solutions in action (r...

H2O4GPU now available in R

This post originally appeared here. It was authored by Daisy Deng, Software Engineer, and Abhinav Mithal, Senior Engineering Manager, at Microsoft. The focus on machine learning and artificial intelligence has soared over the past few years, even as fast, scalable and reliable ML and AI solutions are increasingly viewed as being vital to...

Happy Holidays from H2O.ai

by H2O.ai Team | December 31, 2017 Deep Learning, Machine Learning

Dear Community, Your intelligence, support and love have been the strength behind an incredible year of growth, product innovation, partnerships, investments and customer wins for H2O and AI in 2017. Thank you for answering our rallying call to democratize AI with our maker culture. Our mission to make AI ubiquitous is still fresh as da...

It’s all Water (or should I say H2O) to me!

by H2O.ai Team | December 24, 2017 H2O World

By Krishna Visvanathan, Co-founder & Partner, Crane Venture Partners In the career of any venture capitalist, one dreads the “oh shit moment” . For those unfamiliar with this most technical of terms – it is that moment of clarity when a VC, in the immediate aftermath of closing one’s latest investment (often at the first post invest...

H2O4GPU Hands-On Lab (Video) + Updates

by H2O.ai Team | May 08, 2017 Community, GPU, Technical

H2O.ai, Continuum Analytics, and MapD Technologies have announced the formation of the GPU Open Analytics Initiative (GOAI) to create common data frameworks enabling developers and statistical researchers to accelerate data science on GPUs. GOAI will foster the development of a data science ecosystem on GPUs by allowing resident applicat...

Use H2O.ai on Azure HDInsight

by H2O.ai Team | April 18, 2017 Cloud, Sparkling Water, Technical, Tutorials

This is a repost from this article on MSDN. We’re hosting an upcoming webinar to present you how to use H2O on HDInsight and to answer your questions. Sign up for our upcoming webinar on combining H2O and Azure HDInsight. We recently announced that H2O and Microsoft Azure HDInsight have integrated to provide Data Scientists with a Lead...

Sparkling Water on the Spark-Notebook

by H2O.ai Team | April 10, 2017 Guest Posts, Sparkling Water, Technical

This is a guest post from our friends at Kensu. In the space of Data Science development in enterprises, two outstanding scalable technologies are Spark and H2O. Spark is a generic distributed computing framework and H2O is a very performant scalable platform for AI. Their complementarity is best exploited with the use of Sparkling Wat...

Stacked Ensembles and Word2Vec now available in H2O!

by H2O.ai Team | February 08, 2017 Data Munging, Ensembles, H2O Release, NLP, Python, R, Technical

Prepared by: Erin LeDell and Navdeep Gill MathJax.Hub.Config({ tex2jax: {inlineMath: [['$','$'], ['\$','\$']]} }); Stacked Ensembles ensemble <- h2o.stackedEnsemble(x = x, y = y, training_frame = train, base_models = my_models) Python:ensemble = H2OStackedEnsembleEstimator(base_models=my_models) ensemble.train(x=x, y=y, training...

Artificial Intelligence Is Already Deep Inside Your Wallet – Here’s How

by H2O.ai Team | January 12, 2017 Financial Services, Fraud Detection

Artificial intelligence (AI) is the key for financial service companies and banks to stay ahead of the ever-shifting digital landscape, especially given competition from Google , Apple , Facebook , Amazon and others moving strategically into fintech. AI startups are building data products that not only automate the ingestion of vast amou...

Football Flowers

by H2O.ai Team | January 10, 2017

function resizeIframe() { document.getElementById('cheese').style.height = document.getElementById('cheese').contentWindow.document.body.scrollHeight + 'px'; setInterval(resizeIframe, 1000); } ...

Start Off 2017 with Our Stanford Advisors

This post is reposted from Rstudio’s announcement on sparklyr – Rstudio’s extension for Spark Connect to Spark from R. The sparklyr package provides a complete dplyr backend. Filter and aggregate Spark datasets then bring them into R for analysis and visualization. Use Spark’s distributed machine learning library from R. Create...

When is the Best Time to Look for Apartments on Craigslist?

by H2O.ai Team | October 06, 2016 Data Journalism

A while ago I was looking for an apartment in San Francisco. There are a lot of problems with finding housing in San Francisco, mostly stemming from the fierce competition. I was checking Craigslist every single day. It still took me (and my girlfriend) a few months to find a place — and we had to sublet for three weeks in between. Thankf...

Focus

by H2O.ai Team | September 23, 2016 Community

———- Forwarded message ——— From: SriSatish Ambati Date: Thu, Sep 15, 2016 at 10:17 PM Subject: changes and all hands tomorrow. To: team Team, Our focus has changed towards larger fewer deals & deeper engagements with handful of finance and insurance customers. We took a hard look at our marketing spend, pr programs and personnel. We l...

Distracted Driving

by H2O.ai Team | September 16, 2016 Data Journalism

Last week, we started to examine the 7.2% increase in traffic fatalities from 2014 to 2015, the reversal of a near decade-long downward trend. We then broke out the data by various accident classifications , such as “speeding” or “driving with a positive BAC,” and identified those classifications that had the greatest increase. One label...

Introducing H2O Community & Support Portals

by H2O.ai Team | September 09, 2016 Community, Customers

At H2O, we enjoy serving our customers and the community, and we take pride in making them successful while using H2O products. Today, we are very excited to announce two great platforms for our customers and for the community to better communicate with H2O. Let’s start with our community first: The success of every open source project ...

Fatal Traffic Accidents Rise in 2015

by H2O.ai Team | September 07, 2016 Data Journalism

On Tuesday, August 30th, the National Highway Traffic Safety Administration released their annual dataset of traffic fatalities asking interested parties to use the dataset to identify the causes of an increase of 7.2% in fatalities from 2014 to 2015. As part of H2O.ai ‘s vision of using artificial intelligence for the betterment of soci...

IoT - Take Charge of Your Business and IT Insights Starting at the Edge

by H2O.ai Team | August 22, 2016 IoT, Solutions

Instead of just being hype, the Internet of Things (IoT) is now becoming a reality. Gartner forecasts that 6.4 billion connected devices will be in use worldwide, and 5.5 million new devices will get connected every day, in 2016. These devices range from wearables, to sensors in vehicles the can detect surrounding obstacles, to sensors in...

Hyperparameter Optimization in H2O: Grid Search, Random Search and the Future

by H2O.ai Team | June 16, 2016 R-Bloggers, Technical, Tutorials

“Good, better, best. Never let it rest. ‘Til your good is better and your better is best.” – St. Jerome tl;drH2O now has random hyperparameter search with time- and metric-based early stopping. Bergstra and Bengio[1] write on p. 281: Compared with neural networks configured by a pure grid search, we find that random search over the s...

H2O GBM Tuning Tutorial for R

by H2O.ai Team | June 16, 2016

In this tutorial, we show how to build a well-tuned H2O GBM model for a supervised classification task. We specifically don’t focus on feature engineering and use a small dataset to allow you to reproduce these results in a few minutes on a laptop. This script can be directly transferred to datasets that are hundreds of GBs large and H...

Spam Detection with Sparkling Water and Spark Machine Learning Pipelines

by H2O.ai Team | June 15, 2016 Sparkling Water, Technical, Tutorials

This short post presents the “ham or spam” demo, which has already been posted earlier by Michal Malohlava , using our new API in latest Sparkling Water for Spark 1.6 and earlier versions, unifying Spark and H2O Machine Learning pipelines. It shows how to create a simple Spark Machine Learning pipeline and a model based on the fitted pipe...

Interview with Carolyn Phillips, Sr. Data Scientist, Neurensic

by H2O.ai Team | May 27, 2016 Community, Customers, Events

During Open Tour Chicago we conducted a series of interviews with data scientists attending the conference. This is the second of a multipart series recapping our conversations. Be sure to keep an eye out for updates by checking our website or following us on Twitter @h2oai. H2O.ai: How did you become a data scientist? Phillips: Until ...

Interview with Svetlana Kharlamova, Sr. Data Scientist, Grainger

by H2O.ai Team | May 25, 2016 Community, Customers, Events

During Open Tour Chicago we conducted a series of interviews with data scientists attending the conference. This is the first of a multipart series recapping our conversations. Be sure to keep an eye out for updates by checking our website or following us on Twitter @h2oai. H2O.ai: How did you become a data scientist? Kharlamova: I’m a...

H2O Day at Capital One

by H2O.ai Team | May 11, 2016 Community, Customers, Events

Here at H2O.ai one of our most important partners is Capital One, and we’re proud to have been working with them for over a year. One of the world’s leading financial services providers, Capital One has a strong reputation for being an extremely data and technology-focused organization. That’s why when the Capital One team invited us to t...

Red herring bites

by H2O.ai Team | May 06, 2016 Data Munging, R-Bloggers, Technical

At the Bay Area R User Group in February I presented progress in big-join in H2O which is based on the algorithm in R’s data.table package. The presentation had two goals: i) describe one test in great detail so everyone understands what is being tested so they can judge if it is relevant to them or not; and ii) show how it scales with...

Fast csv writing for R

by H2O.ai Team | April 24, 2016 Data Munging, R, R-Bloggers, Technical

R has traditionally been very slow at reading and writing csv files of, say, 1 million rows or more. Getting data into R is often the first task a user needs to do and if they have a poor experience (either hard to use, or very slow) they are less likely to progress. The data.table package in R solved csv import convenience and speed in 2...

Apache Spark and H2O on AWS

by H2O.ai Team | April 20, 2016 Community, Guest Posts

This is a guest post re-published with permission from our friends at Datapipe. The original lives here. One of the advantages of public cloud is the ability to experiment and run various workloads without the need to commit to purchasing hardware. However, to meet your data processing needs, a well-defined mapping between your objecti...

Connecting to Spark & Sparkling Water from R & Rstudio

by H2O.ai Team | March 24, 2016

Sparkling Water offers the best of breed machine learning for Spark users. Sparkling Water brings all of H2O’s advanced algorithms and capabilities to Spark. This means that you can continue to use H2O from Rstudio or any other ide of your choice. This post will walk you through the steps to get running on plain R or R studio from Spark. ...

Drink in the Data with H2O at Strata SJ 2016

by H2O.ai Team | March 21, 2016 Community, Demos, Events

It’s about to rain data in San Jose when Strata + Hadoop World comes to town March 29 – March 31st. H2O has a waterfall of action happening at the show. Here’s a rundown of what’s on tap. Keep it handy so you have less chance of FOMO (fear of missing out). Hang out with H2O at Booth #1225 to learn more about how machine learning can hel...

Road Ahead and BTUs

by H2O.ai Team | March 03, 2016

H2O.ai – Road Ahead – keynote presentation by Sri Ambati from Sri Ambati ...

Thank you, Cliff

by H2O.ai Team | February 24, 2016

Cliff resigned from the Company last week – He is parting on good terms and supports our success in future. Cliff and I worked closely since 2004 so this is a loss for me. It ends an era of prolific work supporting my vision as a partner. Let’s take this opportunity to congratulate Cliff on his work, in helping me build something from not...

The Top 10 Most Watched Videos From H2O World 2015

by H2O.ai Team | January 08, 2016 Community, Customers, Events, H2O World

Now that we’re a few months out from H2O World we wanted to share with you all what the most popular talks were by online viewership. The talks covered a variety of topics from introductions, to in-depth examinations of use cases, to wide-ranging panels. Introduction to Data Science Featuring Erin LeDell, Statistician and Machine Learnin...

Compressing Zip Codes with Generalized Low Rank Models

by H2O.ai Team | December 07, 2015 GLRM, R

This tutorial introduces the Generalized Low Rank Model (GLRM) [1 ], a new machine learning approach for reconstructing missing values and identifying important features in heterogeneous data. It demonstrates how to build a GLRM in H2O that condenses categorical information into a numeric representation, which can then be used in other mo...

Databricks and H2O Make it Rain with Sparkling Water

by H2O.ai Team | December 01, 2015 Demos, Sparkling Water

**This blog post was first posted on the Databricks blog hereDatabricks provides a cloud-based integrated workspace on top of Apache Spark for developers and data scientists. H2O.ai has been an early adopter of Apache Spark and has developed Sparkling Water to seamlessly integrate H2O.ai’s machine learning library on top of Spark. In thi...

H2O World from an Attendee's Perspective

by H2O.ai Team | November 18, 2015 Community, Events, Guest Posts, H2O World

Data Science is like Rome, and all roads lead to Rome. H2O WORLD is the crossroad, pulling in a confluence of math, statistics, science and computer science and incorporating all avenues of business. From the academic, research oriented models to the business and computer science analytics implementations of those ideas, H2O WORLD inform...

H2O.ai at ODSC SF 2015!

by H2O.ai Team | November 16, 2015 Events

As promised, we’re here reporting from the floor of the (H2O.ai-sponsored) Open Data Science Conference (ODSC). It’s been another wild day for us, with an early start at 7:30am to set up ahead of the show. However, the long days are all worth it for a chance to see you all in the field. While we thought bringing two boxes of booklets woul...

H2O at ML Conf SF 2015

by H2O.ai Team | November 13, 2015 Community, Events

H2O is ubiquitous, and just like H2O, our team is everywhere! Today we attended the (H2O.ai-sponsored) 2015 Machine Learning Conference in San Francisco. Located at the gorgeous Julia Morgan Ballroom the ML Conference brought together some of the world’s foremost experts on machine learning, including the tireless Xavier Amatriain, VP of...

H2O World Third Day Wrap-Up

by H2O.ai Team | November 12, 2015 Events, H2O World

H2O fans, we know that distance and the twin holidays of Veteran’s Day and Diwali kept many of you from attending the grand finale of H2O World, but we want to at least give you a taste of all that went on at the Computer History Museum in Mountain View. Day 3 of H2O World got off to a strong start with a massive panel on creating a cultu...

H2O World Second Day Wrap-Up

by H2O.ai Team | November 11, 2015 Events, H2O World

H2O fans, we didn’t think that our second day could top our first, but somehow it did! Still, although we had record attendance, we know that a lot of you aren’t here. While we can’t hope to get across all that’s happened, we do want to share some of the highlights. The morning started off with CEO Sri Ambati welcoming attendees and givin...

H2O World First Day Wrap-Up

by H2O.ai Team | November 10, 2015 Events, H2O World

H2O fans, we wish that all of you were here, but we also know that our community is spread across the globe and not all of you could make it to H2O World. However, those of you not able to attend the conference are just as much a part of our community as those that are. While we can’t hope to convey the energy and excitement of H2O World,...

Pre-H2O World, Part 2

by H2O.ai Team | November 09, 2015 Community, Events, H2O World

Sparkling Water Tutorials Updated

by H2O.ai Team | July 01, 2015

This is updated version of Sparkling Water tutorials originally published by Amy Wang here For the newest examples, and updates, please visit Sparkling Water GitHub page The blog post introduces 3 tutorials: Running Sparkling Water Locally Running Sparkling Water on Standalone Spark Cluster Running H2O Commands from Spark Shell ...

'Ask Craig'- Determining Craigslist Job Categories with Sparkling Water

by H2O.ai Team | June 15, 2015

This is the first blog in a two blog series. The second blog is on turning these models into a Spark streaming applicationThe presentation on this application can be downloaded and viewed at SlideshareOne question we often get asked at Meetups or conferences is: “How are you guys different than other open-source machine-learning toolkits?...

Scaling R with H2O

Hello world, again. H2O is already relatively easy to launch, all the user needs is a compatiable Java version but now that level of difficulty is reduce to nil. Jeff, our DevOps engineer, presented me with a Docker container for H2O making shipping H2O possible regardless of your environment setup. You can now launch H2O in an isolated e...

H2O vs R - Winning KDDCup98 in 10 minutes with H2O

by H2O.ai Team | December 17, 2014

H2O is a scalable and open-source math and machine learning platform for big data. It can handle much bigger datasets and run a lot faster than R/SAS even on a single machine. How does the modeling experience with H2O differ from the experience using traditional tools such as R/SAS? This blog answers exactly this question. In particular, ...

H2O WORLD 2014 Machine Learning IS Fun.

by H2O.ai Team | December 03, 2014

Earlier this year I found myself sitting among 100 or so data scientists at a meetup , eating a taco and listening to how a former particle physicist found the Higgs Boson particle over a weekend using commodity hardware and open source software . Even more impressive was his ability to answer the unrelenting questions from the audience ...

What if the S language had been copyrighted?

by H2O.ai Team | December 01, 2014

At H2O World 2014, we were fortunate to have Josh Bloch give a reprise of his A Brief, Opinionated History of the API talk that he first delivered at SPLASH 2014 . (For those with the time, you can watch a 47 minute 21 second recording of this talk on the H2O.ai YouTube channel.) This is one of those subjects that I wish I could say m...

Key Takeaways from the World's Top Kagglers

by H2O.ai Team | November 25, 2014

Ever wondered why data science is so competitive? After a highly successful H2O World event last week, we’re shining some light on what we’ve learned from some of the world’s best data scientists and how they go about winning these data science challenges such as Kaggle . In case you missed it, we held a Competitive Data Science Panel ...

Predictive Modeling at Scale: Cisco Modernizes Predictive Model Production with H2O (joint work with Lou Carvalheira)

by H2O.ai Team | November 21, 2014

H2O & Scala & SparkSpark is an up and coming new big data technology; it’s a whole lot faster andeasier than existing Hadoop-based solutions. H2 O does state-of-the-art MachineLearning algorithms over Big Data – and does them Fast. We are happy toannounce that H2 O now has a basic integration with Spark – Sparkling Water! This is...

Introducing H2O Lagrange (2.6.0.11) to R

by H2O.ai Team | August 26, 2014

From my perspective the most important event that happened atuseR! 2014 was that I got to meetthe 0xdata team and now, long story short,here I am introducing the latest version of H2 O, labeledLagrange (2.6.0.11) ,to the R and greater data science communities. Beforejoining 0xdata, I was working at a competitor on a rival project and w...

useR! 2014

by H2O.ai Team | July 15, 2014

Two weeks ago we attended the useR! conference hosted on the UCLA campus. I landed in Los Angeles at 8:30 P.M on Sunday June 29, and met up with Amy — another math hacker at 0xdata. After a harrowing cab ride we arrived on the UCLA campus at Sunset Village where we would be lodging for the next 3 evenings. Having just got the h2o R packag...

Learn to manage, munge, and model big data with H2O on the Hortonworks Sandbox

by H2O.ai Team | June 26, 2014

Working with big data might seem like a daunting task if like me, you’ve spent the majority of your college years doing pencil and paper proofs. Big data for me was anything that took longer than 30 minutes to ingest into single threaded R. For mathematicians and statisticians looking to understand widely used data platforms like Hadoop f...

H2O - The Killer-App on Spark

by H2O.ai Team | June 25, 2014

by H2O.ai Team | March 24, 2014

Over the weekend we fielded a question from one of our users about the basics of data munging in H2O through R – and it was a good question, so I wanted to share the response with a wider audience – namely you guys.There are a few quick things about data munging in H2O+R: – It often looks and feels like you are manipulating data in R; we...

H2O Architecture

by H2O.ai Team | March 20, 2014

This is a top-level overview of the H2O architecture. H2O does in-memory analytics on clusters with distributed parallelized state-of-the-art Machine Learning algorithms . However, the platform is very generic, and very very fast. We’re building Machine Learning tools with it, because we think they’re cool and interesting, but the plat...

H2O at Code Mesh - API for in-memory Analytics - Cliff

by H2O.ai Team | February 25, 2014

Video link here:API for in-Memory Analytics – CodeMesh ...

Hanging out at ShareThis

by H2O.ai Team | February 24, 2014

by H2O.ai Team | October 30, 2013

Big data has always been with us. Our race's answer to data explosion was through math & computation. Whether it was Newton's calculus, Einstein's Relativity or Shannon's Information Theory, each generation's answer to it's big data problem arose from it's best and brightest.Our generation's challenge is here. Our lives are mired in d...

Building a Distributed GBM on H2O

by H2O.ai Team | October 29, 2013

At 0xdata we build state-of-the-art distributed algorithms – and recently we embarked on building GBM , and algorithm notorious for being impossible to parallelize much less distribute. We built the algorithm shown in Elements of Statistical Learning II , Trevor Hastie, Robert Tibshirani, and Jerome Friedman on page 387 (shown at the bo...

An API For Distributed Analytics

by H2O.ai Team | October 28, 2013

There are so many APIs to choose from…Features of the space: Lots of data – which I’ll qualify as “bigger than 1 machine” and thus needing parallel i.o, parallel memory, & parallel compute – and distributed algorithms. Ease of programming; hide details (but expose when want to). High level for ease-of-use, but “under the covers” ...

Strata NYC & Hadoop World: How to Stop Worrying and Start Modeling Big Data with Better Algorithms and H2O

by H2O.ai Team | October 25, 2013

How to Stop Worrying and Start Modeling Big Data with Better Algorithms and H2O Srisatish Ambati (0xdata Inc), Cliff Click (0xdata Inc) 5:05pm Tuesday, 10/29/2013 Data Science Beekman Parlor – Sutton North Data Modeling has been constrained through scale; Sampling still rules the day for Adhoc Analytics. Scale brings much needed change t...

NYC Big Data Meetup - Distributed Random Forest, GBM, GLM & API for Big Data Algos

by H2O.ai Team | October 22, 2013

by H2O.ai Team | September 11, 2013

Since we've been fooling around with the MNIST data set quite a bit lately (Spence is using it in benchmarking), I've been following the leaderboard and methods for the ongoing Kaggle competition around the same data. It's really amazing to see what people come up with. But of course, the purpose of H2O is entirely that one need not devo...

Replay: Modeling MNIST With RF Hands-on Demo

by H2O.ai Team | September 05, 2013

Last week Spencer put together a great hands on for modeling data using H2O (http://www.meetup.com/H2Omeetup/). This post is a write-up of the workflow for generating an RF model on MNIST data for those of you who want to walk through the demo again, or maybe missed the live action version. I’m running through one of our local servers, ...

Hands on Workshop: Hack Data With Math

by H2O.ai Team | August 28, 2013

by H2O.ai Team | August 12, 2013

Our resident R users will demonstrate how to use the R package and invoke big data modeling entirely from R. In this session our resident R & Math hacker, Anqi Fu will demonstrate the R API for H2O. Early users, community and customers of H2O have been invoking GLM, Random Forest and K-means from an RConsole or RStudio. In this meetu...

Random Forest Measurements for the MNIST Dataset

by H2O.ai Team | August 08, 2013

This post discusses the performance of H2O’s Random Forest [5] algorithm. We compare different versions of H2O as well as the RF implementation by wise.io . We use wall-clock time to measure work flows that match up with the user experience. A link to the scripts used is available here [1] . SpecificationsHardware Amazon EC2 in US-EAS...

We the people: Our meetup member introductions

by H2O.ai Team | August 05, 2013

You may have noticed that we have a ton of stuff going on at 0xdata, including several upcoming meetups that I expect will be very well attended. I was feeling a little curious about who exactly would be attending. What are the common areas of interest, are our members mostly software people or data scientists? Anyhow, I find that when I ...

Hey good looking; Visualization and Data Mining 1

by H2O.ai Team | August 01, 2013

I recently came across an article by Shaw et al, in Decision Support Systems (1). The article discussed the importance of data mining and information management to good customer relationship management in increasingly competitive markets. A key point of the paper that I agree with is the importance of heuristics in data mining, particular...

Big Data Cloud Computing Streaming Systems & Infrastructures

by H2O.ai Team | July 27, 2013

Big Data Science at Frontier Real Time Streaming Meetup. 250 Big Data enthusiasts have signed up for a saturday presentation! Looks like it's going to be quite interesting presentation and panel! ...

Implement a Machine Learning Algorithm in 2hrs

by H2O.ai Team | July 20, 2013

We will take a simple yet popular & powerful math algorithm such as Linear Regression and implement a distributed version in 2hrs. Pre-requisites: Knowledge of Java or R See: http://h2o.0xdata.com/ Warning: Only software programmers ignore Warnings! That said, this seriously is a very hands on java-intense exercise. Extinguished en...

GLM Bells and Whistles Part 2: Analysis and Results from Million Songs Data

by H2O.ai Team | July 15, 2013

Using the Million Songs Data we want to characterize a subset of the songs. To do this we’re going to run a binomial regression in H2O’s GLM. The approach to characterizing songs from the 90’s is the same method you can apply to your own data to characterize your customers relative to some larger group. In turn, those findings can be app...

GLM and K means to find Social Response Bias - Dating and Fibbers

by H2O.ai Team | July 12, 2013

In any field where data collection is dependent on what your clients, customers, public, whomever …. tell you, there’s the risk that people are big fat fibbers. This often happens because people respond they way they think they SHOULD rather than with their own personal truths. Social sciences and marketing people call this phenomenon soc...

Data Science is NOT Rocket Science - H2O at Big Data Cloud

by H2O.ai Team | July 09, 2013

by H2O.ai Team | June 15, 2013

by H2O.ai Team | April 17, 2013

Data has always been with us. Everytime we as a race complained about data, a new kind of math evolved to crush the scourge of BigData. Whether it was Newton with Calculus or Einstein with relativity or Shannon with Information theory. Our generation’s response to BigData is due. The time is ripe. For a revolution in Math. One that opens ...

H2O at Predictive Analytics World Conference in SF

by H2O.ai Team | April 12, 2013

Join H2O and the 0xdata team at the Predictive Analytics World conference in San Francisco, CA on April 15 – 16, 2013. Meet us at the 0xdata booth in the Exhibitor Center at PAW where we will be demoing H2O hacking large data sets. Not to mention showing off our latest video. Be sure to look out for great talks from Netflix’s Chris Poulio...

Predicting Airline Data using a Generalized Linear Model (GLM)

by H2O.ai Team | April 12, 2013

Just recently I created a wiki post on the H2 O Github page with step by step directions on how to predict if a flight’s arrival would be delayed or not. I essentially uploaded airline data from the American Statistical Association to H2 O and used GLM (also known as generalized linear model , logistics regression, or logit regression) to...

Test Page

by H2O.ai Team | June 26, 2025

testing new heights ...

2025

by H2O.ai Team | June 26, 2025

---------------------------------------------------------------------------------------------- ADD TEXT CONTENT HERE ---------------------------------------------------------------------------------------------- ...

New Blog Page Title

by Arno Candel, Ashrith Barthur | June 26, 2025 Blog, H2O AI Cloud, Machine Learning

test ...

Generative AI

Predictive AI

On-Premise Platform

Managed Cloud

Hybrid Cloud

Industry Solutions

Use Cases

H2O.ai Hospital Occupancy Simulator

Strategic Transformation

View All Case Studies

FINANCIAL SERVICES

TELECOM

ENERGY

MARKETING

Partners

Resources

Open Source

Join H2O University

Support

Events

H2O.ai Wiki

Responsible AI

Company

Submit AI 100 2025 Nomination

2025 Gartner® Magic Quadrant™

H2O AI 100 2024

H2O.ai Blog