Machine Learning for Adtech
November 19, 2013 Uncategorized [EN]Characteristics of advertising data: tens of thousands of columns or more (top 100k or 1 m sites) high collinearity factors: eg demographics, with a strong correlation between eg income and education collinearity: sports fans follow nfl + espn + bleacher report + fox sports; users of ravelry also shop etsy. Those features are certainly not […]
Making films is not too different from startups
November 19, 2013 Uncategorized [EN]Quentin Tarantino, Ang Lee and other great directors discuss making films, creative process, attention to detail and inspiring & directing one's team to do great work.
H2O goes to CodeMesh in London
November 18, 2013 Uncategorized [EN]An API for Distributed Computing We have defined an API and built an open-source platform for dealing with in-memory distributed data. We’ve used it to built state-of-the-art predictive modeling and analytics (e.g. GLMNET, GBM, Random Forest) that’s 1000x faster than the disk-bound alternatives, and 100x faster than R (we love R but it’s tooo slow […]
H2O goes to qconsf
November 13, 2013 Uncategorized [EN]Math Algorithms have primarily been the domain of desktop data science. With the success of scalable algorithms at Google, Amazon, and Netflix, there is an ever growing demand for sophisticated algorithms over big data. In this talk, we get a ringside view in the making of the world's most scalable and fastest machine learning framework, […]
Distributed Deep Learning with H2O in the Cloud @ Ebay
November 12, 2013 Uncategorized [EN]Cyprien Noel will present hand-picked algorithms that work on H2O at scale and a survey of the space. We will walk users through the a couple of datasets (mnist) and demonstrate the power of Multi-layer Neural Networks at Scale in EC2. Learn more and sign up at http://www.meetup.com/Silicon-Valley-Big-Data-Science/events/132780102/
Predictable Rise of Physicists: Domain Science
November 8, 2013 Uncategorized [EN]For years, I secretly suspected that a lot of our math came from Physics. Some of the greatest leaps in math were made closely alongside the greatest discoveries in Physics. Calculus. QED. Turing. The physics of our businesses is grounded in a complex systems understanding of domain. When Data science gets finally freed from time-sapping […]
Frontier Big Data Meetup – Scalability & Availability
November 4, 2013 Uncategorized [EN]Come see Sri present on November 5th! 1. Sam Hamilton, Vice President of Data Technology at PayPal 2. SriSatish Ambati, Co-founder & CEO, 0xData 3. Sourav Mazumder, Technology Head of Big Data Practices, Infosys 4. Bruce Templeton, Co-founder & CEO, NephoScale At Room B3 in Mission City Ballroom, Santa Clara Convention Center Agenda 7-7:30PM: Registration & […]
Pivotal hosts 0xdata – Distributed Random Forest, GBM, GLM & API for Big Data Algos
November 4, 2013 Uncategorized [EN]Distributed Machine Learning has come of age, just in time to meet the challenges of Big Data. We will present an API for extending and rolling your own Algorithms or use powerful contest-winning Gradient Boosting Machine, Generalized Linear Modeling and Random Forest at scale. Demo and Fireworks using big datasets from within familiar R interface […]