Type to search LearningTree.com

Do you mean "{{response.correctedQuery}}" ?

Sorry, no results were found for your query.

Please check your spelling and try your search again.

 

Business Intelligence Training









Preferred method of contact?

Extracting Business Value from Big Data with Pig, Hive & Impala

COURSE TYPE

Practitioner

Course Number

1254

Duration

4 Days

Enroll

About This Course: This course provides the knowledge to leverage Pig and Hive to prepare and analyze large data sets on Hadoop. Productivity is increased by allowing the programmer to avoid low-level Java coding characteristic of MapReduce and rapidly clean, filter, impose structure and query data to obtain information of value that allows more informed and timely business decisions.

You Will Learn How To

  • Manipulate complex data sets stored in Hadoop for competitive advantage without writing complex Java code
  • Automate the transfer of data into Hadoop storage with Flume and Sqoop
  • Filter data with Extract-Transform-Load (ETL) operations using Pig
  • Query multiple data sets for analysis with Pig and Hive
  • Perform real-time queries on Hadoop data with Impala and Shark

Course Outline

  • The Hadoop Ecosystem
  • Hadoop overview
  • Surveying the Hadoop components
  • Defining the Hadoop architecture
  • Exploring HDFS and MapReduce

Storing data in HDFS

  • Achieving reliable and secure storage
  • Monitoring storage metrics
  • Controlling HDFS from the Command Line

Parallel processing with MapReduce

  • Detailing the MapReduce approach
  • Transferring algorithms not data
  • Dissecting the key stages of a MapReduce job

Automating data transfer

  • Facilitating data Ingress and Egress
  • Aggregating data with Flume
  • Configuring data fan in and fan out
  • Moving relational data with Sqoop
  • Executing Data Flows with Pig

Describing characteristics of Apache Pig

  • Contrasting Pig with MapReduce
  • Identifying Pig use cases
  • Pinpointing key Pig configurations

Structuring unstructured data

  • Representing data in Pig's data model
  • Running Pig Latin commands at the Grunt Shell
  • Expressing transformations in Pig Latin Syntax
  • Invoking Load and Store functions
  • Performing ETL with Pig

Transforming data with Relational Operators

  • Creating new relations with joins
  • Reducing data size by sampling
  • Extending Pig with user–defined functions

Filtering data with Pig

  • Consolidating data sets with unions
  • Partitioning data sets with splits
  • Injecting parameters into Pig scripts
  • Manipulating Data with Hive

Leveraging business advantages of Hive

  • Factoring Hive into components
  • Imposing structure on data with Hive

Organizing data in Hive Data Warehouse

  • Creating Hive databases and tables
  • Contrasting available data types in Hive
  • Loading and storing data efficiently with SerDes

Designing data layout for maximum performance

  • Populating tables from queries
  • Partitioning Hive Tables for optimal queries
  • Composing HiveQL queries
  • Extracting Business Value with HiveQL

Performing joins on unstructured data

  • Distinguishing joins available in Hive
  • Optimizing join structure for performance

Pushing HiveQL to the limit

  • Sorting, distributing and clustering data
  • Reducing query complexity with views
  • Improving query performance with indexes

Deploying Hive in production

  • Designing Hive schemas
  • Setting up data compression
  • Debugging Hive scripts

Streamlining storage management with HCatalog

  • Unifying the data view with HCatalog
  • Leveraging HCatalog to access the Hive metastore
  • Communicating via the HCatalog interfaces
  • Populating a Hive table from Pig
  • Interacting with Hadoop Data in Real Time

Parallel processing with Impala

  • Dissecting the core components of Impala
  • Submitting queries to Impala
  • Accessing Hive data from Impala

Unleashing the Spark framework

  • Reducing data access times with Shark
  • Querying Hive data with Shark
Show complete outline
Show Less

Course Schedule

Attend this live, instructor-led course In-Class or Online via AnyWare.

Hassle-Free Enrollment: No advance payment required.
Tuition due 30 days after your course.

Jan 17 - 20 New York/AnyWare Enroll Now

How would you like to attend?

Live, Online via Anyware
In-Class

Jan 31 - Feb 3 Herndon, VA/AnyWare Enroll Now

How would you like to attend?

Live, Online via Anyware
In-Class

Mar 14 - 17 Toronto/AnyWare Enroll Now

How would you like to attend?

Live, Online via Anyware
In-Class

Apr 10 - 13 AnyWare Enroll Now

How would you like to attend?

Live, Online via Anyware

May 30 - Jun 2 Herndon, VA/AnyWare Enroll Now

How would you like to attend?

Live, Online via Anyware
In-Class

Jun 13 - 16 Toronto/AnyWare Enroll Now

How would you like to attend?

Live, Online via Anyware
In-Class

Sep 26 - 29 Herndon, VA/AnyWare Enroll Now

How would you like to attend?

Live, Online via Anyware
In-Class

Guaranteed to Run

Bring this Course to Your Organization and Train Your Entire Team
For more information, call 1-888-843-8733 or click here

Tuition

Standard

$2990

Government

$2659

Course Tuition Includes:

After-Course Instructor Coaching
When you return to work, you are entitled to schedule a free coaching session with your instructor for help and guidance as you apply your new skills.

Free Course Exam
You can take your course exam on the last day of your course and receive a Certificate of Achievement with the designation "Awarded with Distinction."

Prev
Next

Questions

Call 1-888-843-8733 or click here »

An experienced training advisor will happily answer any questions you may have and alert you to any tuition savings to
which you or your organization may be entitled.

Training Hours

Standard Course Hours: 9:00 am – 4:30 pm
*Informal discussion with instructor about your projects or areas of special interest: 4:30 pm – 5:30 pm


FREE Online Course Exam (if applicable) – Last Day: 3:30 pm – 4:30 pm
By successfully completing your FREE online course exam, you will:

  • Have a record of your growth and learning results.
  • Bring proof of your progress back to your organization
  • Earn credits toward industry certifications (if applicable)
  • Make progress toward one or more Learning Tree Specialist & Expert Certifications (if applicable)

Enhance Your Credentials with Professional Certification

Learning Tree's comprehensive training and exam preparation guarantees that you will gain the knowledge and confidence to achieve professional certification and advance your career.

This course qualifies for 23 CPE credits from the National Association of State Boards of Accountancy CPE program. Read more ...

- ,

Prev
Next