Type to search LearningTree.com

Do you mean "{{response.correctedQuery}}" ?

Sorry, no results were found for your query.

Please check your spelling and try your search again.

 

Big Data









Preferred method of contact?

Apache Spark Programming with Scala for Big Data Solutions

COURSE TYPE

Practitioner

Course Number

1262

Duration

4 Days

Enroll

About This Course: The availability of large data sets presents new opportunities and challenges to organizations of all sizes. This course explains Spark best practices and provides the Spark development training and programming skills to develop solutions that run on the Apache Spark platform. Additionally, you learn to test and deploy Big Data solutions on commodity clusters.

You Will Learn How To

  • Develop applications with Spark
  • Work with the libraries for SQL, Streaming and Machine Learning
  • Map real-world problems to parallel algorithms
  • Building business applications that integrate with Spark

Important Course Information

Requirements:

  • Professional experience in programming at the level of:
  • Three to six months of experience in a object-oriented programming language

Course Outline

  • Introduction to Spark
  • Defining Big Data and Big Computation
  • What is Spark?
  • What are the benefits of Spark?
  • The Challenge of Parallelizing Applications

Scaling-out applications

  • Identifying the performance limitations of a modern CPU
  • Scaling traditional parallel processing models

Designing parallel algorithms

  • Fostering parallelism through functional programming
  • Mapping real-world problems to effective parallel algorithms
  • Defining the Spark Architecture

Parallelizing data structures

  • Partitioning data across the cluster using Resilient Distributed Datasets (RDD) and DataFrames
  • Apportioning task execution across multiple nodes
  • Running applications with the Spark execution model

The anatomy of a Spark cluster

  • Creating resilient and fault-tolerant clusters
  • Achieving scalable distributed storage

Managing the cluster

  • Monitoring and administering Spark applications
  • Visualizing execution plans and results
  • Developing Spark Applications

Selecting the development environment

  • Performing exploratory programming via the Spark shell
  • Building stand-alone Spark applications

Working with the Spark APIs

  • Programming with Scala and other supported languages
  • Building applications with the core APIs
  • Enriching applications with the bundled libraries
  • Manipulating Structured Data with Spark SQL

Querying structured data

  • Processing queries with DataFrames and embedded SQL
  • Extending SQL with User-Defined Functions (UDFs)
  • Exploiting Parquet and JSON formatted data sets

Integrating with external systems

  • Connecting to databases with JDBC
  • Executing Hive queries in external applications
  • Processing Streaming Data in Spark

What is streaming?

  • Implementing sliding window operations
  • Determining state from continuous data
  • Processing simultaneous streams
  • Improving performance and reliability

Streaming data sources

  • Streaming from built-in sources (e.g., log files, Twitter sockets, Kinesis, Kafka)
  • Developing custom receivers
  • Processing with the streaming API and Spark SQL
  • Performing Machine Learning with Spark

Classifying observations

  • Predicting outcomes with supervised learning
  • Building a decision tree classifier

Identifying patterns

  • Grouping data using unsupervised learning
  • Clustering with the k-means method
  • Creating Real-World Applications

Building Spark-based business applications

  • Exposing Spark via a RESTful web service
  • Generating Spark-based dashboards

Spark as a service

  • Cloud vs. on-premises
  • Choosing a service provider (eg, AWS, Azure, Databricks)
  • The Future of Spark
  • Scaling to massive cluster sizes
  • Enhancing security on multi-tenant clusters
  • Tracking the ongoing commercialization of Spark
  • Project Tungsten: pushing performance closer to the limits of modern hardware
  • Working with existing projects powered by Spark
  • Re-architecting Spark for mobile platforms
Show complete outline
Show Less

Course Schedule

Attend this live, instructor-led course In-Class or Online via AnyWare.

Hassle-Free Enrollment: No advance payment required.
Tuition due 30 days after your course.

Jan 24 - 27 Herndon, VA/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare
In-Class

Mar 7 - 10 Rockville, MD/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare
In-Class

Jun 27 - 30 New York/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare
In-Class

Jul 25 - 28 Herndon, VA/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare
In-Class

Sep 5 - 8 Rockville, MD/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare
In-Class

Guaranteed to Run

Bring this Course to Your Organization and Train Your Entire Team
For more information, call 1-888-843-8733 or click here

Tuition

Standard

$2990

Government

$2659

Course Tuition Includes:

After-Course Instructor Coaching
When you return to work, you are entitled to schedule a free coaching session with your instructor for help and guidance as you apply your new skills.

Free Course Exam
You can take your course exam on the last day of your course and receive a Certificate of Achievement with the designation "Awarded with Distinction."

Prev
Next

Questions

Call 1-888-843-8733 or click here »

An experienced training advisor will happily answer any questions you may have and alert you to any tuition savings to
which you or your organization may be entitled.

Training Hours

Standard Course Hours: 9:00 am – 4:30 pm
*Informal discussion with instructor about your projects or areas of special interest: 4:30 pm – 5:30 pm


FREE Online Course Exam (if applicable) – Last Day: 3:30 pm – 4:30 pm
By successfully completing your FREE online course exam, you will:

  • Have a record of your growth and learning results.
  • Bring proof of your progress back to your organization
  • Earn credits toward industry certifications (if applicable)
  • Make progress toward one or more Learning Tree Specialist & Expert Certifications (if applicable)

Enhance Your Credentials with Professional Certification

Learning Tree's comprehensive training and exam preparation guarantees that you will gain the knowledge and confidence to achieve professional certification and advance your career.

Earn 23 Credits from NASBA

This course qualifies for 23 CPE credits from the National Association of State Boards of Accountancy CPE program. Read more ...

- ,

Prev
Next
s