Search Button
RSS icon Sort by:
sparklyr: R interface for Apache Spark
by Vinod Iyengar October 7, 2016 Community R Sparkling Water

This post is reposted from Rstudio’s announcement on sparklyr – Rstudio’s extension for Spark Connect to Spark from R. The sparklyr package provides a complete dplyr backend. Filter and aggregate Spark datasets then bring them into R for analysis and visualization. Use Spark’s distributed machine learning library from R. Create extensions that call the full […]

Read More
Spam Detection with Sparkling Water and Spark Machine Learning Pipelines
Spam Detection with Sparkling Water and Spark Machine Learning Pipelines
by Jakub Hava June 15, 2016 Sparkling Water Technical Tutorials

This short post presents the “ham or spam” demo, which has already been posted earlier by Michal Malohlava, using our new API in latest Sparkling Water for Spark 1.6 and earlier versions, unifying Spark and H2O Machine Learning pipelines. It shows how to create a simple Spark Machine Learning pipeline and a model based on […]

Read More
two_block code
Databricks and H2O Make it Rain with Sparkling Water
by Michal Malohlava December 1, 2015 Demos Sparkling Water

  **This blog post was first posted on the Databricks blog here Databricks provides a cloud-based integrated workspace on top of Apache Spark for developers and data scientists. has been an early adopter of Apache Spark and has developed Sparkling Water to seamlessly integrate’s machine learning library on top of Spark. In this […]

Read More
1 2