Laying a Strong Foundation for Data Science Work

Return to page

Platform
Generative AI

Why H2O.ai
End-to-end GenAI platform built for air-gapped, on-premises or cloud VPC deployments. Own every part of the stack--own your data and your prompts.

Enterprise h2oGPTe
AI for documents & data: connect any LLM/embedding models, fully scalable w/K8s, includes guardrails, summarization, cost controls, and customization options.

Danube-1.8B
Introducing the new state-of-the-art open model
GenAI App Store
Develop, deploy and share safe and trusted applications for your organization

Gov GenAI App Store
See the power of GenAI’s potential with public sector use cases

h2oGPT and H2O LLM
Create private, offline chatbot applications with open source H2O LLM Studio

Platform
H2O AI Cloud. State-of-the-Art AI Cloud Platform
Predictive AI

H2O Driverless AI
Democratizing AI with Automated Machine Learning

H2O-3
Open Source Distributed Machine Learning

H2O Document AI
Extracting Data with Intelligence

H2O Hydrogen Torch
No-Code Deep Learning

H2O Wave
Open source low-code AI AppDev Framework
H2O Label Genie
AI-powered Data Labeling

H2O AI Feature Store
Infuse Your Data with Intelligence

H2O MLOps
Model Hosting, Monitoring and Deployment

H2O AI AppStore
Industry and Use Case AI Apps
Solutions
Industry Solutions

Financial Services

Government

Health

Insurance

Manufacturing

Marketing

Retail

Telecommunications
Use Cases

Financial Services
From Credit Scoring and Customer Churn to Anti-Money Laundering

Government
Use Responsible AI in Government

Health
From Clinical Workflow to Predicting ICU Transfers

Insurance
From Claims Management to Fraud Mitigation

Manufacturing
From Predictive Maintenance to Transportation Optimization

Marketing
From Content Personalization to Lead Scoring

Retail
From Assortment Optimization to Pricing Optimization

Telecommunications
From Predictive Customer Support to Predictive Fleet Maintenance
View All
- H2O.ai Hospital Occupancy Simulator
  
  Track, predict, and manage COVID-19 related hospital admissions
- Strategic Transformation
  
  Use the H2O AI Cloud to make your company an AI company
Customers

View All Case Studies

FINANCIAL SERVICES

Learn how CBA is boosting AI capabilities to generate better customer and community outcomes, at greater pace and scale.

TELECOM

Learn how AT&T is transforming into an AI Company with H2O.ai

HEALTHCARE

Learn how USCF Health is applying H2O Document AI to automate workflows in healthcare

ENERGY

Learn how AES is transforming its energy business with AI and H2O.ai

FINANCIAL INDUSTRIES

Learn now IFFCO-Tokio uses the H2O AI Cloud to save over $1M annually by transforming their fraud prediction processes

MARKETING

Learn how Epsilon is increasing its customers' marketing ROI with H2O.ai
Partners
Partners

Find a Partner

Become a Partner

Powered by H2O.ai

Partner University
Resources
Resources

H2O University

Documentation

Resources Archive

Wiki

Customer Support Portal

What is an AI Cloud?

Research Papers

Blog
Open Source

Downloads

h2oGPT and H2O LLM

H2O-3

H2O AutoML

H2O Wave

Sparkling Water
- Join H2O University
  
  Gain expertise through engaging courses and earn certifications to thrive on your AI journey.
- Support
  
  Get help and technology from the experts in H2O and access to Enterprise Team
Events
Events

Events

Webinar
H2O GenAI World

Make with H2O
- H2O.ai Wiki
  
  Read the H2O.ai wiki for up-to-date resources about artificial intelligence and machine learning.
- Responsible AI
  
  Learn the best practices for building responsible AI models and applications
Company
Company

About Us

Team

Democratize AI

Why GenAI With H2O.ai?

AI4Conservation

AI4Good

Careers

Contact Us
News

Press Releases

Awards
- What is an AI Cloud?
  
  A high-scale elastic environment for the AI lifecycle
- 2023 Gartner® Magic Quadrant™
  
  H2O.ai is recognized as a Visionary in 2023 Gartner® Magic Quadrant™ for Cloud AI Developer Services

Request Live Demo

By William Merchan, CSO, DataScience.com

In the past few years, data science has become the cornerstone of enterprise companies’ efforts to understand how to deliver better customer experiences. Even so, when DataScience.com commissioned Forrester to survey over 200 data-driven businesses last year, only 22% reported they were leveraging big data well enough to get ahead of their competition.
That’s because there’s a big difference between building predictive models and putting them into production effectively. Data science teams need the support of IT from the very beginning to ensure that issues with large-scale data management, governance, and access don’t stand in the way of operationalizing key insights about your customers. However, many enterprise companies are still treating IT involvement as an afterthought, which ultimately delays the timeline for seeing value from their data science efforts.
There are many ways that better IT management can help scale the impact of data science at your organization. Three best practices include using containers for data science environments, managing compute resources effectively, and putting work into production faster with the help of tools. Here’s how it’s done.
1. Using software containers is one of the most impactful steps you can take to implement IT management best practices . These standardized development environments ensure that the hard work your data scientists put into building predictive models won’t go to waste when it’s time to deploy their code. Without a container-based workflow, a data scientist starting a new analysis must either wait for IT to build an environment from scratch, or build one themselves using the unique combination of packages and resources they prefer — and waiting for those to install or compile.
There are two major issues associated with both of these approaches: they don’t scale, and they’re slow. When data scientists are individually responsible for configuring environments as needed, their work isn’t reproducible — if it’s used in a different environment, it might not even run. Containers put the power in the hands of IT to standardize environment configuration in advance using images, which are snapshots of containers. Data scientists can launch environments from those images — which have already been vetted by IT — saving a lot of time in the long run.
2. Provide ample computing power to support your data scientists’ analysis from start to finish. Empowering them to spin up compute resources in the cloud as needed ensures they never get held up by limited computing power. It also eliminates the potential additional cost of maintaining unnecessary nodes. The same idea applies to on-prem data centers. IT must carefully monitor the expansion of data science work and scale resources accordingly. It may seem obvious, but IHS Markit reports that companies not anticipating this need lose approximately $700 billion a year to IT downtime.
3. Put data science work into production right away to start seeing its value earlier on. Imagine your data science team has built a recommender system to predict what products a customer is likely to enjoy based on the products he or she has already purchased. Even if you’re satisfied with the model’s accuracy and have identified some unexpected relationships that should inform your targeting strategies, this information still needs to be integrated into your application or website for it to be valuable.
Traditionally, the pipeline that delivers those recommendations to your customers would be built by engineers and require extensive support from IT. The rise of microservices, however, gives data scientists the opportunity to deploy models as APIs that can be integrated directly into an application.
If you’re among the 78% of companies not fully realizing the return on your data science investment, chances are there’s room to improve the IT foundation you’ve laid. To learn more about the next steps, find out how to take an agile approach to data science .
About the Author
William Merchan leads business and corporate development, partner initiatives, and strategy at DataScience.com as chief strategy officer. He most recently served as SVP of Strategic Alliances and GM of Dynamic Pricing at MarketShare, where he oversaw global business development and partner relationships, and successfully led the company to a $450 million acquisition by Neustar.

Explore similar content by topic

Data Science IT

H2O.ai Team

At H2O.ai, democratizing AI isn’t just an idea. It’s a movement. And that means that it requires action. We started out as a group of like minded individuals in the open source community, collectively driven by the idea that there should be freedom around the creation and use of AI.

Today we have evolved into a global company built by people from a variety of different backgrounds and skill sets, all driven to be part of something greater than ourselves. Our partnerships now extend beyond the open-source community to include business customers, academia, and non-profit organizations.

Generative AI

Predictive AI

Industry Solutions

Use Cases

H2O.ai Hospital Occupancy Simulator

Strategic Transformation

View All Case Studies

FINANCIAL SERVICES

TELECOM

HEALTHCARE

ENERGY

FINANCIAL INDUSTRIES

MARKETING

Partners

Resources

Open Source

Join H2O University

Support

Events

H2O.ai Wiki

Responsible AI

Company

What is an AI Cloud?

2023 Gartner® Magic Quadrant™

BLOG

Laying a Strong Foundation for Data Science Work

Explore similar content by topic

H2O.ai Team

Ready to see the H2O.ai platform in action?

Why H2O.ai

Products

Resources

Insights