Apache Spark Programming (Spark 105): 3 day Instructor Led Public Class (Warsaw)

Overview:

This three-day, onsite instructor-led course

Location: Warsaw, Poland (training address to be listed)

This  course  is  designed  for  data  engineers,  analysts,  architects;  software  engineers;  IT  operations;  and  technical  managers  interested  in  a  thorough,  hands-on  overview  of  Apache  Spark.    This  course  covers  the  same  material  as  our  three-day  Apache  Spark  Programming  course.

The  course  covers  the  core  APIs  for  using  Spark,  fundamental  mechanisms  and  basic  internals  of  the  framework,  SQL  and  other  high-level  data  access  tools,  as  well  as  Spark’s  streaming  capabilities  and  machine  learning  APIs.

Each  topic  includes  slide  and  lecture  content  along  with  hands-on  use  of  Spark  through  an  elegant  web-based  notebook  environment.  Inspired  by  tools  like  IPython/Jupyter,  notebooks  allow  attendees  to  code  jobs,  data  analysis  queries,  and  visualizations  using  their  own  Spark  cluster,  accessed  through  a  web  browser.  All  class  code  is  directly  usable  with  pure  open-source  Spark  or  any  commercial  Spark  distribution.

Objectives - after taking this class you will be able to:

  • Describe ​Spark’s ​fundamental ​mechanics
  • Use ​the ​core ​Spark ​APIs ​to ​operate ​on ​data
  • Articulate ​and ​implement ​typical ​use ​cases ​for ​Spark
  • Build ​data ​pipelines ​with ​SparkSQL ​and ​DataFrames
  • Analyze ​Spark ​jobs ​using ​the ​UIs ​and ​logs
  • Create ​Streaming ​and ​Machine ​Learning ​jobs

Modules:

  • Spark ​Overview
  • RDD ​Fundamentals
  • SparkSQL ​and ​DataFrames
  • Spark ​Job ​Execution
  • Cluster ​Architectures ​for ​Spark
  • Intro ​to ​Spark ​Streaming
  • Machine ​Learning ​Basics
Cost: ​$2500 ​per ​person

Requirements:

All ​participants ​will ​need ​a ​laptop ​with ​updated ​versions ​of ​Chrome ​or ​Firefox ​(Internet ​Explorer ​and ​Safari ​are ​not ​supported) ​

About Databricks:

Databricks’ mission is to accelerate innovation for its customers by unifying Data Science, Engineering and Business. Databricks’ founders started the Spark research project at UC Berkeley that later became Apache Spark. Databricks provides a Unified Analytics Platform powered by Apache Spark for data science teams to collaborate with data engineering and lines of business to build data products. Users achieve faster time-to-value with Databricks by creating analytic workflows that go from ETL and interactive exploration to production. The company also makes it easier for its users to focus on their data by providing a fully managed, scalable, and secure cloud infrastructure that reduces operational complexity and total cost of ownership. Databricks, venture-backed by Andreessen Horowitz, NEA and Battery Ventures, among others, has a global customer base that includes Viacom, Shell and HP. For more information, visit www.databricks.com.

Apache, Apache Spark and Spark are trademarks of the Apache Software Foundation.

Meet the trainers

Amadeusz

Software architect with Big Data processing and machine learning background. Has experience with designing, developing and deploying various solutions – from stream machine learning solution to isolated software sandbox. Amadeusz Conducts training for Apache Cassandra and Apache Spark libraries and holds BSc in Computer science, as well as Apache Spark Developer certificate.

Marcin

Has experience in developing web applications using Scala for backend and AngularJS with TypeScript for the frontend. He is an enthusiast of clean and well-tested code. Marcin is an AWS Associate-level Certified Solutions Architect and is on his way toward Engineer degree in Computer Science at Warsaw University of Technology. His thesis is related to sequential pattern mining using Spark.

Scheduled trainings

03-05.12

Apache Spark Programming: 3 day Instructor Led Public Class, Warsaw, Poland

Apply for training

Apply for training

Apache Spark Programming: 3 day Instructor Led Public Class, Warsaw, Poland

If you want to apply for this training, fill out this form and send us a message. We will contact you soon with more details.