Machine Learning with R and H2O
May 2020: Seventh Edition
Contents
Section | Title | Page |
---|---|---|
1 | Introduction | 5 |
2 | What is H2O? | 6 |
3 | Installation | 7 |
3.1 | Installing R | 8 |
3.2 | Installing H2O from R | 8 |
3.3 | Example Code | 9 |
3.4 | Citation | 9 |
4 | H2O Initialization | 9 |
4.1 | Launching from R | 9 |
4.2 | Launching from the Command Line | 11 |
4.3 | Launching on Hadoop | 11 |
4.4 | Checking Cluster Status | 12 |
5 | Data Preparation in R | 12 |
5.1 | Notes | 13 |
6 | Models | 14 |
6.1 | Supervised Learning | 14 |
6.2 | Unsupervised Learning | 15 |
6.3 | Miscellaneous | 15 |
6.4 | Modeling Constructs | 15 |
7 | Demo: GLM | 15 |
8 | Data Manipulation in R | 18 |
8.1 | Importing Files | 18 |
8.2 | Uploading Files | 19 |
8.3 | Finding Factors | 19 |
8.4 | Converting to Factors | 19 |
8.5 | Converting Data Frames | 20 |
8.6 | Transferring Data Frames | 20 |
8.7 | Renaming Data Frames | 21 |
8.8 | Viewing Column Names | 21 |
8.9 | Getting Minimum and Maximum Values | 22 |
8.10 | Getting Quantiles | 22 |
8.11 | Summarizing Data | 23 |
8.12 | Summarizing Data in a Table | 24 |
8.13 | Generating Random Numbers | 25 |
8.14 | Splitting Frames | 26 |
8.15 | Getting Frames | 27 |
8.16 | Getting Models | 27 |
8.17 | Listing H2O Objects | 27 |
8.18 | Removing H2O Objects | 28 |
8.19 | Adding Functions | 28 |
9 | Running Models | 29 |
9.1 | Gradient Boosting Machine (GBM) | 29 |
9.2 | Generalized Linear Models (GLM) | 31 |
9.3 | K-means | 33 |
9.4 | Principal Components Analysis (PCA) | 34 |
9.5 | Predictions | 34 |
10 | Appendix: Commands | 35 |
10.1 | Dataset Operations | 35 |
10.2 | General Data Operations | 36 |
10.3 | Methods from Group Generics | 37 |
10.4 | Other Aggregations | 40 |
10.5 | Data Munging | 40 |
10.6 | Data Modeling | 41 |
10.7 | H2O Cluster Operations | 43 |
11 | Acknowledgments | 45 |
12 | References | 45 |
13 | Authors | 46 |
To read the eBook, click the download link above.