BIG VS. LITTLE: P-Values and Coefficients
June 7, 2013 Uncategorized [EN]The Quick and Dirty: For the moment let’s assume that we have some a priori hypothesis, and we want to test. We can talk about two things: how big the relationship is and how strong it is. P-values don’t care about big – they only care about strong. To get a sense for this recall […]
Chocolate Cake
June 7, 2013 Uncategorized [EN]Chocolate Cake (Wednesday, June 5, 2013) You know how sometimes you have one bite of really good chocolate cake, or a really amazing peach and totally assume that you could eat another 30lbs of whatever without regard for good manners or physical limitations? Yeah. Decreasing marginal returns dictate that it almost always turns out that […]
Data Science is NOT Rocket Science
June 7, 2013 Uncategorized [EN]Finding myself at 0x is a lot less like starting fresh in a new profession and more like choosing cultural expatriation – it is a whole new (beautiful) world. On my first day everyone spoke what I was relatively sure should be English, but it felt like they were actually speaking in their own dialect […]
Meetup: Distributed Random Forest at SF Data Mining
May 16, 2013 Uncategorized [EN]Come watch Jan Vitek present Distributed Random Forest at SF Data Mining group.
Big Data Science Practice + Algo Implementation
May 10, 2013 Uncategorized [EN]In this double header we present a practitioners close view of the science and an engineer’s close view of design and implementation of distributed algorithm. Day in the Life of a Data Scientist – Chris Pouliot In this session, Netflix analytical leader Chris Pouliot shares his experience building a large team of data scientists at […]
Better Big Data Algorithms with H2O by 0xdata
May 10, 2013 Uncategorized [EN]Manhattan loves data + math better than any one! Join us on our first New York City meetup talking high-scale algos at Pivotal Labs, Union Sq, NYC Cliff and I will walk through a Big GLM over large datasets and deep dive in parallelizing and distributing algorithms over distributed array-let datastructures.
H2O Hack Data Meetup
May 3, 2013 Uncategorized [EN]Hack Data with Math, H2O Meetup We derive insights from Airline Dataset – We analyze airline take off and landing dataset of the past 20years and infer about how flying has changed (more delays, different airports) after 9/11?
Hack Data with Math using H2O – Silicon Valley Big Data Science Meetup at Google
May 1, 2013 Uncategorized [EN]Thanks for attending! Presentations: Cliff’s H2O and API for Big Data Math Talk JanVitek’s talk on Distributed Random Forest Cliff & Jan will present a deep dive into H2O and Hacking Big Data with Math. We locked down the Venue – Google, Building 43, 1600 Amphitheatre Parkway, Mountain View, CA, 94040. Can’t wait for the […]