Business Intelligence

Preferred method of contact:

Extracting Business Value from Big Data with Pig, Hive & Impala



Course Number



4 Days

View Schedule

Increase productivity by avoiding low-level Java coding characteristic of MapReduce, and rapidly begin extracting business value for competitive advantage. In this big data training course, you will learn to gain access to previously inaccessible data, gather and feed data into Hadoop for storage, transform and filter data using Pig, and extract value using Hive, Impala, and Spark.

You Will Learn How To

  • Manipulate complex data sets stored in Hadoop for competitive advantage
  • Automate the transfer of data into Hadoop storage with Flume and Sqoop
  • Filter data with Extract-Transform-Load (ETL) operations using Pig
  • Query multiple data sets for analysis with Pig and Hive
  • Perform real-time queries on Hadoop data with Impala and Shark

Important Course Information

Recommended Experience:

  • Knowledge of databases and SQL

Course Outline

  • The Hadoop Ecosystem
  • Hadoop overview
  • Surveying the Hadoop components
  • Defining the Hadoop architecture
  • Exploring HDFS and MapReduce

Storing data in HDFS

  • Achieving reliable and secure storage
  • Monitoring storage metrics
  • Controlling HDFS from the Command Line

Parallel processing with MapReduce

  • Detailing the MapReduce approach
  • Transferring algorithms not data
  • Dissecting the key stages of a MapReduce job

Automating data transfer

  • Facilitating data Ingress and Egress
  • Aggregating data with Flume
  • Configuring data fan in and fan out
  • Moving relational data with Sqoop
  • Executing Data Flows with Pig

Describing characteristics of Apache Pig

  • Contrasting Pig with MapReduce
  • Identifying Pig use cases
  • Pinpointing key Pig configurations

Structuring unstructured data

  • Representing data in Pig's data model
  • Running Pig Latin commands at the Grunt Shell
  • Expressing transformations in Pig Latin Syntax
  • Invoking Load and Store functions
  • Performing ETL with Pig

Transforming data with Relational Operators

  • Creating new relations with joins
  • Reducing data size by sampling
  • Extending Pig with user–defined functions

Filtering data with Pig

  • Consolidating data sets with unions
  • Partitioning data sets with splits
  • Injecting parameters into Pig scripts
  • Manipulating Data with Hive

Leveraging business advantages of Hive

  • Factoring Hive into components
  • Imposing structure on data with Hive

Organizing data in Hive Data Warehouse

  • Creating Hive databases and tables
  • Contrasting available data types in Hive
  • Loading and storing data efficiently with SerDes

Designing data layout for maximum performance

  • Populating tables from queries
  • Partitioning Hive Tables for optimal queries
  • Composing HiveQL queries
  • Extracting Business Value with HiveQL

Performing joins on unstructured data

  • Distinguishing joins available in Hive
  • Optimizing join structure for performance

Pushing HiveQL to the limit

  • Sorting, distributing and clustering data
  • Reducing query complexity with views
  • Improving query performance with indexes

Deploying Hive in production

  • Designing Hive schemas
  • Setting up data compression
  • Debugging Hive scripts

Streamlining storage management with HCatalog

  • Unifying the data view with HCatalog
  • Leveraging HCatalog to access the Hive metastore
  • Communicating via the HCatalog interfaces
  • Populating a Hive table from Pig
  • Interacting with Hadoop Data in Real Time

Parallel processing with Impala

  • Dissecting the core components of Impala
  • Submitting queries to Impala
  • Accessing Hive data from Impala

Unleashing the Spark framework

  • Reducing data access times with Shark
  • Querying Hive data with Shark
Show complete outline
Show Less

Course Schedule

Attend this live, instructor-led course In-Class or Online via AnyWare.

Hassle-Free Enrollment: No advance payment required.
Tuition due 30 days after your course.

May 30 - Jun 2 Herndon, VA/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare

Jun 13 - 16 Toronto/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare

Sep 26 - 29 Herndon, VA/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare

Oct 10 - 13 AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare

Oct 31 - Nov 3 Toronto/AnyWare Enroll Now

How would you like to attend?

Live, Online via AnyWare

Guaranteed to Run

Bring this Course to Your Organization and Train Your Entire Team
For more information, call 1-888-843-8733 or click here






Course Tuition Includes:

After-Course Instructor Coaching
When you return to work, you are entitled to schedule a free coaching session with your instructor for help and guidance as you apply your new skills.

After-Course Computing Sandbox
You'll be given remote access to a preconfigured virtual machine for you to redo your hands-on exercises, develop/test new code, and experiment with the same software used in your course.

Free Course Exam
You can take your course exam on the last day of your course and receive a Certificate of Achievement with the designation "Awarded with Distinction."



Call 1-888-843-8733 or click here »

An experienced training advisor will happily answer any questions you may have and alert you to any tuition savings to
which you or your organization may be entitled.

Training Hours

Standard Course Hours: 9:00 am – 4:30 pm
*Informal discussion with instructor about your projects or areas of special interest: 4:30 pm – 5:30 pm

FREE Online Course Exam (if applicable) – Last Day: 3:30 pm – 4:30 pm
By successfully completing your FREE online course exam, you will:

  • Have a record of your growth and learning results.
  • Bring proof of your progress back to your organization
  • Earn credits toward industry certifications (if applicable)
  • Make progress toward one or more Learning Tree Specialist & Expert Certifications (if applicable)

Enhance Your Credentials with Professional Certification

Learning Tree's comprehensive training and exam preparation guarantees that you will gain the knowledge and confidence to achieve professional certification and advance your career.

Earn 23 Credits from NASBA

This course qualifies for 23 CPE credits from the National Association of State Boards of Accountancy CPE program. Read more ...

- ,


Please Choose a Language

Canada - English

Canada - Français