305-651-6500

Jobs    Everything

Select a Metro Area
When:
November 17, 2018 @ 9:00 am – 4:00 pm
2018-11-17T09:00:00-05:00
2018-11-17T16:00:00-05:00
Where:
CIC Miami
1951 NW 7th Ave #600
Miami, FL 33136
USA

Big Data and Spark Training

Overview

This one-day Spark training is an introduction to the foundations of Spark and how to use it on the Databricks environment. Spark is a unified computing engine and a set of libraries for parallel processing of big data. Companies like Netflix, Yahoo and eBay have used Spark to achieve lightning-fast processing for large-scale data, and more and more companies are implementing the tool.

You will leave this training with an understanding of the foundational concepts of Spark and its architecture. You will see use cases and learn how to write simple applications using Spark.

For this training, you’ll use your own computer and the databricks environment to work through examples — please do not forget to bring a laptop!

Lunch will be provided.

 

Who Should Attend?

Business Intelligence Analysts

Data Analysts

Database Developers

Database Administrators

SQL Developers

Data Engineers

**Knowledge of at least one programming language is highly recommended**

 

About the Instructor

Daniel Cadenas is a computer engineer with 18+ year of experience in Business Intelligence, Data Warehousing and Big Data. Currently, working as Big Data Engineer at Ultimate Software, he is involved in building applications using Spark and Hadoop (HDP). Daniel is also a Data Scientist enthusiast with hands-on experience with machine learning and deep learning. Daniel has taught different classes during his career such as DB2, Cognos, Microstrategy, Data Stage and others.

Agenda

  • Spark 2 Architecture
  • Databricks Community Edition
  • Spark RDD
  • Spark SQL/DataFrames
  • Spark Streaming
  • Spark ML – Machine Learning
  • Performance and tuning considerations