NJ Data Science Meetup – How to Build Data Analytics Applications with Spark and Hortonworks

Workshop – How to build data Analytics app & Reco Engine using Spark + Horton

Thursday, Mar 17, 2016, 7:00 PM

Princeton University – Lewis Library Rm 122
Washington Road and Ivy Lane, Princeton, NJ 08544 Princeton, NJ

16 Data Scientists Attending

Agendaa) Hands-on – Build a Data analytics application using SPARK, Hortonworks, and Zeppelin. The session explains RDD concepts, DataFrames, sqlContext, use SparkSQL for working with DataFrames and explore graphical abilities of Zeppelin.b) Follow along – Build a Recommendation Engine – This will show how to build a whole web app with predictive…

Check out this Meetup →

Scala Days 2016 NYC

Scala Days 2016 Schedule have been announced!

Highlights:

Beyond Shuffling: Scaling Apache Spark
by Holden Karau @holdenkarau

Scala: The Unpredicted Lingua Franca for Data Science
by Andy Petrella @noootsab and Dean Wampler@deanwampler

Build a Recommender System in Apache Spark and Integrate It Using Akka
by Willem Meints @willem_meints

Implementing Microservices with Scala and Akka
by Vaughn Vernon @VaughnVernon

Microservices based off Akka cluster at iHeartRadio
by Kailuo Wang @kailuowang

Building a High-Performance Database with Scala, Akka & Spark
by Evan Chan @evanfchan

Large scale graph analysis using Scala and Akka
by Ben Fonarov @chuwiey

Distributed Real-Time Stream Processing: Why and How
by Petr Zapletal @petr_zapletal

Deep Learning and NLP with Spark
by Andy Petrella @noootsab

 

Fans of Scala, Spark, Big Data, Machine Learning, Real-time computing, Stream processing, functional programming and reactive programming all have great talks to choose from.   Tons of great speakers including the developer of Spark Notebook, top people in Scala and a good representation from leading industry users.

IMG_20150929_082701

Free Hadoop, Spark, Big Data Training

Free Hadoop Training List

http://bigdatauniversity.com/courses/introduction-to-solr/

Spark Fundamentals I

 

Spark Fundamentals II

http://bigdatauniversity.com/courses/text-analytics-essentials/

Hadoop Fundamentals I

Big Data Fundamentals

Hadoop Developer Day Event

Introduction to Pig

 

http://www.coreservlets.com/hadoop-tutorial/

Accessing Hadoop Data Using Hive

 

Using HBase for Real-time Access to your Big Data – Version 2

 

Introduction to Scala

https://www.udemy.com/hadoopstarterkit/?dtcode=VAN2bNB4geMV

https://www.udemy.com/big-data-basics-hadoop-mapreduce-hive-pig-spark/?dtcode=6oLh2qk4geNz

https://www.udemy.com/data-analytics-using-hadoop-in-2-hours/?dtcode=Wa8s2vV4geNW

https://www.udemy.com/data-analytics-using-hadoop-in-2-hours/learn/

https://www.udemy.com/hadoopstarterkit/learn

https://developer.yahoo.com/hadoop/

http://adooputorialraining180.teachable.com/courses/hadoop-free-course-training-tutorial

http://www.ibm.com/developerworks/data/library/techarticle/dm-1209hadoopbigdata/

https://www.mapr.com/services/mapr-academy/big-data-hadoop-online-training

https://www.mapr.com/services/mapr-academy/hadoop-essentials

https://www.mapr.com/services/mapr-academy/Developing-Hadoop-Applications

https://www.mapr.com/training/hadoop-demand-training/dev-325

https://www.mapr.com/services/mapr-academy/developing-hbase-applications-basics-training-course-on-demand

https://www.mapr.com/services/mapr-academy/developing-hbase-applications-advanced-training-course-on-demand

https://www.udemy.com/hadoopstarterkit/learn/

https://www.mapr.com/services/mapr-academy/apache-spark-essentials

https://www.mapr.com/services/mapr-academy/build-monitor-apache-spark-applications

https://www.mapr.com/services/mapr-academy/apache-hive-essentials-training-course-on-demand

https://www.mapr.com/services/mapr-academy/apache-pig-essentials-training-course-on-demand

https://www.mapr.com/services/mapr-academy/apache-drill-training-course-on-demand

https://www.mapr.com/services/mapr-academy/apache-drill-architecture-training-course-on-demand

MapReduce and YARN

https://www.udemy.com/big-data-basics-hadoop-mapreduce-hive-pig-spark/

https://www.edx.org/course/scalable-machine-learning-uc-berkeleyx-cs190-1x

https://www.edx.org/course/introduction-big-data-apache-spark-uc-berkeleyx-cs100-1x

Pivotal HDB

http://academy.pivotal.io/course/141303

http://www.cloudera.com/content/www/en-us/resources/training/cloudera-essentials-for-apache-hadoop-the-motivation-for-hadoop.html

 

Tools

Cloudera VM Download

http://www.cloudera.com/content/www/en-us/downloads/quickstart_vms/5-5.html

Cask VM Download

http://www.cloudera.com/content/www/en-us/downloads/cdap.html