An Overview of Python’s Datatable package
June 4, 2019 Data Science H2O H2O Driverless AI Python Technical Technical PostsThis blog originally appeared on Towardsdatascience.com “There were 5 Exabytes of information created between the dawn of civilization through 2003, but that much information is now created every 2 days”: Eric Schmidt If you are an R user, chances are that you have already been using the data.table package. Data.table is an extension of the data.frame package in R. It’s also […]
Building AI/ML models on Lending Club Data, with H2O.ai — Part 1
March 28, 2019 Beginners Community Data Journalism Data Science Technical Posts TutorialsLending Club publishes its basic loan databases to the public and a full version to its customers — anonymized of course. You can find the download page from this link (screenshot below): The publicly downloadable loan data has various attributes — roughly 150+ columns that have categorical, numeric, text and date fields. It also has a ‘loan_status’ text column […]
Finally, You Can Plot H2O Decision Trees in R
January 15, 2019 Data Science Machine Learning R Technical Technical Posts TutorialsCreating and plotting decision trees (like one below) for the models created in H2O will be the main objective of this post: Figure 1. Decision Tree Visualization in R Decision Trees with H2O With release 3.22.0.1 H2O-3 (a.k.a. open source H2O or simply H2O) added to its family of tree-based algorithms (which already included DRF, […]
The Making of H2O Driverless AI – Automatic Machine Learning
December 5, 2018 AutoML Community H2O Driverless AI H2O World H2O4GPU Makers Technical Technical PostsIt is my pleasure to share with you some never before exposed nuggets and insights from the making of H2O Driverless AI, our latest automatic machine learning product on our mission to democratize AI. This has been truly a team effort, and I couldn’t be more proud of our brilliant makers who continue to relentlessly […]
H2O announces GPU Open Analytics Initiative with MapD & Continuum
May 8, 2017 Community GPU Technical Technical PostsH2O.ai, Continuum Analytics, and MapD Technologies have announced the formation of the GPU Open Analytics Initiative (GOAI) to create common data frameworks enabling developers and statistical researchers to accelerate data science on GPUs. GOAI will foster the development of a data science ecosystem on GPUs by allowing resident applications to interchange data seamlessly and efficiently. […]