Two packed days of demos, keynotes and training classes on this very cool open source machine learning framework with over 15 speakers. I have used this against HDP, HDP Spark 1.6 and a standalone Spark 1.6 cluster and it performed very well. Download H2O for Hadoop or Sparkling Water for Spark here. The product includes an awesome UI / data scientist notebook for rapid development of models. I will be attending and report on the interesting talks. H2O is a very interesting open source machine learning/deep learning framework and UI that works on top of Hadoop, Spark or stand-alone. One unique feature is it’s ability to generate a POJO from a model that can then be used in regular Java programs or in a Hive UDF.
For more information, see the presentation my friend, Dr. Fogelson, did in Princeton on using H2O for Predicting Repeat Shoppers.
Contact me for a 20% discount.
H2O supports all the machine learning algorithms you would expect like GBM, Decision Trees, K-Means, Deep Learning, Naïve Bayes and more. H2O is very mature and has been in production for years. H2O is certified on the Hortonworks HDP platform.
This tutorial is pretty awesome as you can build a POJO and then use it as a Hive UDF.
Take a look at this very cool Visual Introduction to Machine Learning.