Introduction to AI, Data Science & Machine Learning with Python

Duration: 5 days
Sandbox: Yes
Language: English
29 NASBA CPE Credits (live, in-class training only)
Level: Foundation

Data science is a field that has exploded in popularity in recent years, and for good reason. Companies across industries are increasingly relying on data to inform their decision-making, and skilled data scientists are in high demand. In this comprehensive course, you'll learn the foundational skills and techniques you need to succeed in this exciting field.

You'll start by exploring the role of a data scientist and the lifecycle of data science efforts within an organization. Then, you'll dive into the technical skills you need, such as using Python and its relevant libraries for data analysis and visualization, preprocessing unstructured data, and building AI/ML models.

You'll also explore key machine learning algorithms, including linear regression, decision tree classifiers, and clustering algorithms. And, you'll learn how to apply these techniques to real-world problems, such as predicting customer churn and building recommendation engines.

Throughout the data science training, you'll have the opportunity to work on hands-on exercises and projects, allowing you to practice your skills and build your portfolio. By the end of the course, you'll have a deep understanding of the data science process, the tools and techniques used by data scientists, and the ability to apply these skills to real-world problems.

Data Science Training in Python Delivery Methods

In-Person
Online
Upskill your whole team by bringing Private Team Training to your facility.

Data Science Training in Python Course Information

In this course, you will:
- Consider how traditional Predictive Machine Learning techniques combine Generative AI, Agentic AI, Multimodal AI and Deepseek-R1 approaches in the current Data Science landscape.
- Translate everyday business questions and problems into Machine Learning tasks to make data-driven decisions.
- Use Python Pandas, Matplotlib & Seaborn libraries to explore, analyze, and visualize data from various sources, including the web, word documents, email, NoSQL stores, databases, and data warehouses.
- Explore the concepts behind Foundation Models, Generative Pre-trained Transformers (GPTs), and Retrieval Augmented Generation (RAGs).
- Train a Machine Learning Classifier using different algorithmic techniques from the Scikit-Learn library, such as Decision Trees, Logistic Regression, and Neural Networks.
- Re-segment your customer market using K-Means and Hierarchical algorithms to better align products and services to customer needs.
- Discover hidden customer behaviors from Association Rules and build a Recommendation Engine based on behavioral patterns.
- Investigate relationships & flows between people and business-relevant entities using Social Network Analysis.
- Build predictive models of revenue and other numeric variables using Linear Regression.
- Test your knowledge with the included end-of-course exam.
- Leverage continued support with after-course one-on-one instructor coaching and computing sandbox.
Training Prerequisites

None.

Data Science Training in Python Course Outline

Module 1: The Role of a Data Scientist: Combining Technical and Non-Technical Skills

What is the required skillset of a Data Scientist?
Combining the technical and non-technical roles of a Data Scientist
The difference between a Data Scientist and a Data Engineer
Exploring the entire lifecycle of Data Science efforts within the organization
Turning business questions into Machine Learning (ML) and Artificial Intelligence (AI) models
Exploring diverse and wide-ranging data sources that you can use to answer business questions
Explore the concepts behind Foundation Models, Generative Pre-trained Transformers (GPTs), and Retrieval Augmented Generation (RAGs)

Module 2: Data Manipulation and Visualization using Python's Pandas and Matplotlib Libraries

Introducing the features of Python that are relevant to Data Scientists and Data Engineers
Viewing Data Sets using Python’s Pandas library
Importing, exporting, and working with all forms of data, from Relational Databases to Google Images
Using Python Selecting, Filtering, Combining, Grouping, and Applying Functions from Python's Pandas library
Dealing with Duplicates, Missing Values, Rescaling, Standardizing, and Normalizing Data
Visualizing data for both exploration and communication with the Pandas, Matplotlib, and Seaborn Python libraries

Module 3: Preprocessing and Analyzing Unstructured Data with Natural Language Processing

Preprocessing Unstructured Data such as web adverts, emails, and blog posts for AI/ML models
Exploring the most popular approaches to Natural Language Processing (NLP), such as stemming and "stop" words
Preparing a term-document matrix (TDM) of unstructured documents for analysis
Review the architectures of Foundation Models, Generative Pre-trained Transformers (GPTs), and Retrieval Augmented Generation (RAGs)
Look at how Data Scientists can integrate Large Language Models (LLMs) in their work

Module 4: Linear Regression and Feature Engineering for Business Problem Solving

Expressing a business problem, such as customer revenue prediction, as a linear regression task
Assessing variables as potential Predictors of the required Target (e.g., Education as a predictor of Salary Build)
Interpreting and Evaluating a Linear Regression model in Python using measures such as RMSE
Exploring the Feature Engineering possibilities to improve the Linear Regression model

Module 5: Classification Models and Evaluation for Predictive Analysis

Learning how AI/ML Classifiers are built and used to make predictions such as Customer Churn
Exploring how AI/ML Classification models are built using Training, Test, and Validation
Evaluating the strength of a Decision Tree Classifier

Module 6: Alternative Approaches to Classification and Model Evaluation

Examining alternative approaches to classification
Considering how Activation Functions are integral to Logistic Regression Classifiers
Delve into the architecture of Neural Networks and investigate the explosive growth of Deep Learning approaches in AI
Exploring the probability foundations of Naive Bayes classifiers
Reviewing different approaches to measuring the performance of AI/ML Classification Models
Reviewing ROC curves, AUC measures, Precision, Recall, and Confusion Matrices

Module 7: Clustering Techniques for Customer and Product Segmentation

Uncovering new ways of segmenting your customers, products, or services using clustering algorithms
Exploring what the concept of similarity means to humans and how you can implement it programmatically through distance measures on descriptive variables
Performing top-down clustering with Python’s Scikit-Learn K-Means algorithm
Performing bottom-up clustering with Scikit-Learn’s hierarchical clustering algorithm
Examining clustering techniques on unstructured data (e.g., Tweets, Emails, Documents, etc.)

Module 8: Association Rules and Recommender Systems for Business Applications

Building models of customer behaviors or business events from logged data using Association Rules
Evaluating the strength of these models through probability measures of support, confidence, and lift
Employing feature engineering approaches to improve the models
Building a recommender for your customers that is unique to your product/service offering

Module 9: Network Analysis for Organizational Insights

Analyzing your organization, its people, and its environment as a network of inter-relationships
Visualizing these relationships to uncover previously unseen business insights
Exploring ego-centric and socio-centric methods of analyzing connections critical to your organization

Module 10: Big Data Analytics, Communication, and Ethics

Examining Cloud (Microsoft, Amazon, Google) approaches to handling Big Data analytics
Exploring the communications and ethics aspects of being a Data Scientist
Discuss the ethical implications of recent developments in AI
Surveying the paths of continual learning for a Data Scientist

Select Commercial or Government Pricing

$3,350

5-day instructor-led training course
After-course coaching available
After-course computing sandbox access included
End-of-course exam included

Would you like to take this course as a team?

Guaranteed-to-Run - You can rest assured this class will not be canceled. Delivery format is subject to change – please call Customer Service if you have any questions.
Jul 20 - 24 9:00 AM - 4:30 PM EDT

Virtual
Guaranteed-to-Run - You can rest assured this class will not be canceled. Delivery format is subject to change – please call Customer Service if you have any questions.
Jul 27 - 31 9:00 AM - 4:30 PM EDT

Virtual
Aug 17 - 21 9:00 AM - 4:30 PM EDT

Ottawa or Virtual
Aug 24 - 28 10:00 AM - 5:30 PM EDT

Austin or Virtual
Aug 31 - Sep 4 9:00 AM - 4:30 PM EDT

Virtual
Guaranteed-to-Run - You can rest assured this class will not be canceled. Delivery format is subject to change – please call Customer Service if you have any questions.
Sep 14 - 18 9:00 AM - 4:30 PM EDT

Herndon, VA or Virtual
Sep 21 - 25 9:00 AM - 4:30 PM EDT

Herndon, VA or Virtual
Oct 5 - 9 9:00 AM - 4:30 PM EDT

Toronto or Virtual
Oct 19 - 23 9:00 AM - 4:30 PM EDT

New York or Virtual
Oct 26 - 30 11:00 AM - 6:30 PM EDT

Aurora, CO or Virtual
Nov 2 - 6 9:00 AM - 4:30 PM EST

Ottawa or Virtual
Nov 16 - 20 10:00 AM - 5:30 PM EST

Austin or Virtual
Dec 7 - 11 9:00 AM - 4:30 PM EST

Herndon, VA or Virtual
Dec 14 - 18 9:00 AM - 4:30 PM EST

Washington, DC or Virtual
Jan 4 - 8 9:00 AM - 4:30 PM EST

Toronto or Virtual
Jan 11 - 15 9:00 AM - 4:30 PM EST

New York or Virtual
Jan 25 - 29 12:00 PM - 7:30 PM EST

San Francisco or Virtual
Feb 1 - 5 9:00 AM - 4:30 PM EST

Ottawa or Virtual
Feb 8 - 12 10:00 AM - 5:30 PM EST

Austin or Virtual
Mar 1 - 5 9:00 AM - 4:30 PM EST

Herndon, VA or Virtual
Mar 8 - 12 9:00 AM - 4:30 PM EST

Washington, DC or Virtual
Apr 5 - 9 9:00 AM - 4:30 PM EDT

New York or Virtual
Apr 12 - 16 9:00 AM - 4:30 PM EDT

Toronto or Virtual
Apr 26 - 30 11:00 AM - 6:30 PM EDT

Aurora, CO or Virtual
May 3 - 7 10:00 AM - 5:30 PM EDT

Austin or Virtual
May 10 - 14 9:00 AM - 4:30 PM EDT

Ottawa or Virtual
May 17 - 21 9:00 AM - 4:30 PM EDT

Herndon, VA or Virtual
Jun 7 - 11 9:00 AM - 4:30 PM EDT

Toronto or Virtual
Jun 7 - 11 9:00 AM - 4:30 PM EDT

Washington, DC or Virtual
Jun 21 - 25 9:00 AM - 4:30 PM EDT

New York or Virtual

Scroll to view additional course dates

$2,975

5-day instructor-led training course
After-course coaching available
After-course computing sandbox access included
End-of-course exam included

Would you like to take this course as a team?

Guaranteed-to-Run - You can rest assured this class will not be canceled. Delivery format is subject to change – please call Customer Service if you have any questions.
Jul 20 - 24 9:00 AM - 4:30 PM EDT

Virtual
Guaranteed-to-Run - You can rest assured this class will not be canceled. Delivery format is subject to change – please call Customer Service if you have any questions.
Jul 27 - 31 9:00 AM - 4:30 PM EDT

Virtual
Aug 17 - 21 9:00 AM - 4:30 PM EDT

Ottawa or Virtual
Aug 24 - 28 10:00 AM - 5:30 PM EDT

Austin or Virtual
Aug 31 - Sep 4 9:00 AM - 4:30 PM EDT

Virtual
Guaranteed-to-Run - You can rest assured this class will not be canceled. Delivery format is subject to change – please call Customer Service if you have any questions.
Sep 14 - 18 9:00 AM - 4:30 PM EDT

Herndon, VA or Virtual
Sep 21 - 25 9:00 AM - 4:30 PM EDT

Herndon, VA or Virtual
Oct 5 - 9 9:00 AM - 4:30 PM EDT

Toronto or Virtual
Oct 19 - 23 9:00 AM - 4:30 PM EDT

New York or Virtual
Oct 26 - 30 11:00 AM - 6:30 PM EDT

Aurora, CO or Virtual
Nov 2 - 6 9:00 AM - 4:30 PM EST

Ottawa or Virtual
Nov 16 - 20 10:00 AM - 5:30 PM EST

Austin or Virtual
Dec 7 - 11 9:00 AM - 4:30 PM EST

Herndon, VA or Virtual
Dec 14 - 18 9:00 AM - 4:30 PM EST

Washington, DC or Virtual
Jan 4 - 8 9:00 AM - 4:30 PM EST

Toronto or Virtual
Jan 11 - 15 9:00 AM - 4:30 PM EST

New York or Virtual
Jan 25 - 29 12:00 PM - 7:30 PM EST

San Francisco or Virtual
Feb 1 - 5 9:00 AM - 4:30 PM EST

Ottawa or Virtual
Feb 8 - 12 10:00 AM - 5:30 PM EST

Austin or Virtual
Mar 1 - 5 9:00 AM - 4:30 PM EST

Herndon, VA or Virtual
Mar 8 - 12 9:00 AM - 4:30 PM EST

Washington, DC or Virtual
Apr 5 - 9 9:00 AM - 4:30 PM EDT

New York or Virtual
Apr 12 - 16 9:00 AM - 4:30 PM EDT

Toronto or Virtual
Apr 26 - 30 11:00 AM - 6:30 PM EDT

Aurora, CO or Virtual
May 3 - 7 10:00 AM - 5:30 PM EDT

Austin or Virtual
May 10 - 14 9:00 AM - 4:30 PM EDT

Ottawa or Virtual
May 17 - 21 9:00 AM - 4:30 PM EDT

Herndon, VA or Virtual
Jun 7 - 11 9:00 AM - 4:30 PM EDT

Toronto or Virtual
Jun 7 - 11 9:00 AM - 4:30 PM EDT

Washington, DC or Virtual
Jun 21 - 25 9:00 AM - 4:30 PM EDT

New York or Virtual

Scroll to view additional course dates

Speak with a Learning Consultant

Inquiring For

Myself

Team/Group

First Name*

Last Name*

Email*

Phone Number*

Company*

Job Title

I'm interested In

* Required Fields

Need Help Finding The Right Training Solution?

Our training advisors are here for you.

Data Science Training in Python Course FAQs

The AI ML Data Science Python Course is a training course that teaches participants how to use Python libraries to build, evaluate, and deploy Machine Learning (ML) and Artificial Intelligence (AI) models to gain insights from data.

This course covers every stage of the Data Science Lifecycle and teaches you how to manage, transform, and visualize raw data to create predictive models to help you find and evaluate future opportunities. You will also learn how to translate everyday business questions and problems into Machine Learning tasks to make data-driven decisions.

This course covers a wide range of topics, including data analysis, structured and unstructured data, natural language processing, sentiment analysis, data points, customer segments, logistic regression, random forests, nearest neighbors, business intelligence, target customers, data models, trading strategies, marketing campaigns, and programming language.

This course is designed for anyone who wants to gain foundational knowledge of data science, including both technical and non-technical beginners. It is particularly relevant for data scientists, analysts, and other professionals who work with data and want to improve their skills.

This course helps you gain business intelligence by teaching you how to uncover previously unseen business insights. You will analyze your organization, its people, and its environment as a network of inter-relationships and visualize these relationships to gain insights.

You will also learn how to build predictive models of revenue and other numeric variables using linear regression and how to use these models to inform business decisions.

Schedules are busy, but big data training online makes it easy to level up your career. If you need Big Data online training, we've got you covered. Our AnyWare course delivery option gives you the advantages of a live classroom right from the comfort of your computer screen–no matter where you are.

Yes. There are various opportunities to build a model and analyze issues throughout the training.

You just need an interest in gaining foundational knowledge of data science. This data scientist training course is designed for technical and non-technical beginners.

Marketing strategy is an important topic in this course as it teaches participants how to uncover new ways of segmenting customers, products, or services using clustering algorithms.

You will also learn how to build models of customer behaviors or business events from logged data using association rules and how to use this information to build a recommender for your customers that is unique to your product/service offering.

Natural language processing is used in this course to preprocess unstructured data such as web adverts, emails, and blog posts for AI/ML models. You will explore the most popular approaches to NLP, such as stemming and "stop" words, and learn how to prepare a term-document matrix (TDM) of unstructured documents for analysis.

This course covers a variety of data models, including linear regression, logistic regression, random forests, and nearest neighbors. You will learn how to express a business problem, such as customer revenue prediction, as a linear regression task and how to assess variables as potential predictors of the required target. You will also explore alternative approaches to classification, including neural networks and deep learning.

Sentiment analysis is used in this course to analyze customer feedback and opinions about products or services. You will learn how to perform sentiment analysis on customer reviews and use this information to inform marketing campaigns and trading strategies.

This course teaches participants how to work with both structured and unstructured data. You will learn how to use Python libraries to import, export, and work with all forms of data, from relational databases to Google Images. You will also explore clustering techniques on unstructured data such as tweets, emails, and documents.

Introduction to AI, Data Science & Machine Learning with Python

Data Science Training in Python Delivery Methods

Data Science Training in Python Course Information

Data Science Training in Python Course Outline

Module 1: The Role of a Data Scientist: Combining Technical and Non-Technical Skills

Module 2: Data Manipulation and Visualization using Python's Pandas and Matplotlib Libraries

Module 3: Preprocessing and Analyzing Unstructured Data with Natural Language Processing

Module 4: Linear Regression and Feature Engineering for Business Problem Solving

Module 5: Classification Models and Evaluation for Predictive Analysis

Module 6: Alternative Approaches to Classification and Model Evaluation

Module 7: Clustering Techniques for Customer and Product Segmentation

Module 8: Association Rules and Recommender Systems for Business Applications

Module 9: Network Analysis for Organizational Insights

Module 10: Big Data Analytics, Communication, and Ethics

Live Instructor $3,350

Live Instructor $2,975

Data Science Training in Python Course FAQs

What is the AI ML Data Science Python Course about?

What are some of the topics covered in this course?

Who is this course for?

How does this course help me gain business intelligence?

Does Learning Tree offer Big data online training?

Does this include any practical, hands-on learning?

What background do I need?

What is the role of marketing strategy in this course?

How is natural language processing used in this course?

What are some of the data models that are covered in this course?

How is sentiment analysis used in this course?

How is structured and unstructured data used in this course?

Related Courses