October 10th, 2018

What does NVIDIA’s Rapids platform mean for the Data Science community?

RSS icon RSS Category: Community, Data Science, GPU, H2O Driverless AI, H2O4GPU, Machine Learning, XGBoost
Fallback Featured Image

Today NVIDIA announced the launch of the RAPIDS suite of software libraries to enables GPU acceleration for data science workflows and we’re excited to partner with NVIDIA to bring GPU accelerated open source technology for the machine learning and AI community.

“Machine learning is transforming businesses and NVIDIA GPUs are speeding them up. With the support of the open source communities and customers, H2O.ai made machine learning on GPUs mainstream and won recognition as a leader in data science and machine learning platforms by Gartner. NVIDIA’s support of the GPU machine learning community with RAPIDS, its open-source data science libraries, is a timely effort to grow the GPU data science ecosystem and an endorsement of our common mission to bring AI to the data center. Thanks to our partnership, H2O Driverless AI powered by NVIDIA GPUs has been on an exponential adoption curve — making AI faster, cheaper and easier.” – Sri Ambati, CEO and Founder, H2O.ai

Let’s look at the announcement in a bit more detail. The new software stack sets out to accelerate the entire workflow of data science and analytics by focusing on three building blocks

• DataFrame – cuDF – This is a dataframe-manipulation library based on Apache Arrow that accelerates loading, filtering, and manipulation of data for model training data preparation. The Python bindings of the core-accelerated CUDA DataFrame manipulation primitives mirror the Pandas interface for seamless onboarding of Pandas users.

• Machine Learning Libraries – cuML – This collection of GPU-accelerated machine learning libraries will eventually provide GPU versions of all machine learning algorithms available in Scikit-Learn.

• Graph Analytics Libraries – cuGRAPH – This is a framework and collection of graph analytics libraries

A lot of the other packages in the architecture diagram have already been out there for a while, but this new announcement brings them all together with a promise of integration, ease of installation and use. cuDNN and cuGraph (previously called nvGRAPH) especially are very popular and are used by many developers. NVIDIA’s linear algebra & math libraries which include primitives like cuBLAS, CUDA Math Library, and others are used by many different frameworks as the building blocks including by us here at H2O.ai

H2O.ai is committed to accelerating automatic machine learning on NVIDIA GPUs. It was nearly a year ago, H2O.ai for the first time in the industry demonstrated that statistical machine learning algorithms can be accelerated with GPUs with our H2O4GPU (Github) package. This now powers our pathbreaking commercial offering, Driverless AI, that brings Automatic Machine Learning for the Enterprise. As a major contributor to XGBoost GPU and a leader in AI and ML, we are pleased to see the development of Rapids and we hope to see more open source development for GPU accelerated machine learning.

The new announcements around cuDF and cuML are the successors to the GOAI project, of which H2O.ai was a founding member. As the open source leader in AI and ML, we love that NVidia is contributing new technology for the AI community. The two key developments here are the adoption of the Apache Arrow framework as standard data structure across all the different libraries. This allows for easy integration with the growing ecosystem that now supports Arrow. The second one is around the Python bindings for cuDF that mimic Apache Pandas interface. This can potentially accelerate data munging and transformation by an order of magnitude.

We are pleased to see NVIDIA embrace Data Science & Machine Learning which validates our core mission and vision that we’ve been driving for 7 years. We believe that machine learning will be the key part of any company’s AI Strategy and Transformation. We look forward to contributing to the Rapids project with our best of breed open source algorithms and use the underlying libraries in our Driverless AI enterprise platform.

About the Author

vinod iyengar
Vinod Iyengar, VP of Products

Vinod is VP of Products at H2O.ai. He leads all product marketing efforts, new product development and integrations with partners. Vinod comes with over 10 years of Marketing & Data Science experience in multiple startups. He was the founding employee for his previous startup, Activehours (Earnin), where he helped build the product and bootstrap the user acquisition with growth hacking. He has worked to grow the user base for his companies from almost nothing to millions of customers. He’s built models to score leads, reduce churn, increase conversion, prevent fraud and many more use cases. He brings a strong analytical side and a metrics driven approach to marketing. When he is not busy hacking, Vinod loves painting and reading. He is a huge foodie and will eat anything that doesn’t crawl, swim or move.

Leave a Reply

+
Recap of H2O World India 2023: Advancements in AI and Insights from Industry Leaders

On April 19th, the H2O World  made its debut in India, marking yet another milestone

May 29, 2023 - by Parul Pandey
+
Enhancing H2O Model Validation App with h2oGPT Integration

As machine learning practitioners, we’re always on the lookout for innovative ways to streamline and

May 17, 2023 - by Parul Pandey
+
Building a Manufacturing Product Defect Classification Model and Application using H2O Hydrogen Torch, H2O MLOps, and H2O Wave

Primary Authors: Nishaanthini Gnanavel and Genevieve Richards Effective product quality control is of utmost importance in

May 15, 2023 - by Shivam Bansal
AI for Good hackathon
+
Insights from AI for Good Hackathon: Using Machine Learning to Tackle Pollution

At H2O.ai, we believe technology can be a force for good, and we're committed to

May 10, 2023 - by Parul Pandey and Shivam Bansal
H2O democratizing LLMs
+
Democratization of LLMs

Every organization needs to own its GPT as simply as we need to own our

May 8, 2023 - by Sri Ambati
h2oGPT blog header
+
Building the World’s Best Open-Source Large Language Model: H2O.ai’s Journey

At H2O.ai, we pride ourselves on developing world-class Machine Learning, Deep Learning, and AI platforms.

May 3, 2023 - by Arno Candel

Request a Demo

Explore how to Make, Operate and Innovate with the H2O AI Cloud today

Learn More