Detailed Syllabus of Data Science Course

{{$this_course->content_hours}}+
Hours of Content

80+
Live Sessions

15
Tools and Software

Comprehensive Curriculum

The curriculum has been designed by faculty from IITs, @if($cuinfo->countryCode == "IN") @endif and Expert Industry Professionals.

{{$this_course->content_hours}}+

Hours of Content

80+

Live Sessions

Tools and Software

Foundations

The Foundations bundle comprises 2 courses where you will learn to tackle Statistics and Coding head-on. These 2 courses create a strong base for us to go through the rest of the tour with ease.

Python for Artificial Intelligence & Machine Learning

This course will introduce you to the world of Python programming language that is widely used in Artificial Intelligence and Machine Learning. We will start with basic ideas before going on to the language's important vocabulary as search phrases, syntax, or sentence building. This course will take you from the basic principles of AI and ML to the crucial ideas with Python, among the most widely used and effective programming languages in the present market. In simple terms, Python is like the English language.

Python Basics

Python is a popular high-level programming language with a simple, easy-to-understand syntax that focuses on readability. This module will guide you through the whole foundations of Python programming, culminating in the execution of your 1st Python program.

Anaconda Installation - Jupyter notebook operation

Using Jupyter Notebook, you will learn how to use Python for Artificial Intelligence and Machine Learning. We can create and share documents with narrative prose, visualizations, mathematics, and live code using this open-source online tool.

Python functions, packages and other modules

For code reusability and software modularity, functions & packages are used. In this module, you will learn how you can comprehend and use Python functions and packages for AI.

NumPy, Pandas, Visualization tools

In this module, you will learn how to use Pandas, Matplotlib, NumPy, and Seaborn to explore data sets. These are the most frequently used Python libraries. You'll also find out how to present tons of your data in simple graphs with Python libraries as Seaborn and Matplotlib.

Working with various data structures in Python, Pandas, Numpy

Understanding Data Structures is among the core components in Data Science. Additionally, data structure assists AI and ML in voice & image processing. In this module, you will learn about data structures such as Data Frames, Tuples, Lists, and arrays, & precisely how to implement them in Python.

Applied Statistics

In this module, you will learn about the words and ideas that are important to Exploratory Data Analysis and Machine Learning. You will study a specific set of tools required to assess and extract meaningful insights from data, from a simple average to the advanced process of finding statistical evidence to support or even reject wild guesses & hypotheses.

Descriptive Statistics

Descriptive Statistics is the study of data analysis that involves describing and summarising different data sets. It can be any sample of a world's production or the salaries of employees. This module will teach you how to use Python to learn Descriptive Statistics for Machine Learning.

Inferential Statistics

In this module, you will use Python to study the core ideas of using data for estimating and evaluating hypotheses. You will also learn how you can get the insight of a large population or employees of any company which can't be achieved manually.

Probability & Conditional Probability

Probability is a quantitative tool for examining unpredictability, as the possibility of an event occurring in a random occurrence. The probability of an event occurring because of the occurrence of several other occurrences is recognized as conditional probability. You will learn Probability and Conditional Probability in Python for Machine Learning in this module.

Hypothesis Testing

With this module, you will learn how to use Python for Hypothesis Testing in Machine Learning. In Applied Statistics, hypothesis testing is among the crucial steps for conducting experiments based on the observed data.

Machine Learning

Machine Learning is a part of artificial intelligence that allows software programs to boost their prediction accuracy without simply being expressly designed to do so. You will learn all the Machine Learning methods from fundamental to advanced, and the most frequently used Classical ML algorithms that fall into all of the categories.

Supervised Learning

With this module, you will learn supervised machine learning algorithms, the way they operate, and what applications they can be used for - Classification and Regression.

Linear Regression - Simple, Multiple regression

Linear Regression is one of the most popular Machine Learning algorithms for predictive studies, leading to the very best benefits. It is an algorithm that assumes the dependent and independent variables have a linear connection.

Logistic regression

Logistic Regression is one of the most popular machine learning algorithms. It is a fundamental classification technique that uses independent variables to predict binary data like 0 or 1, positive or negative , true or false, etc. In this module, you will learn all of the Logistic Regression concepts that are used in Machine Learning.

K-NN classification

k-Nearest Neighbours (Knn) is another widely used Classification algorithm, it is a basic machine learning algorithm for addressing regression and classification problems. With this module, you will learn how to use this algorithm. You will also understand the reason why it is known as the Lazy algorithm. Interesting Right?

Support vector machines

Support Vector Machine (SVM) is another important machine learning technique for regression and classification problems. In this module, you will learn how to apply the algorithm into practice and understand several ways of classifying the data.

Ensemble Techniques

We explore beyond the limits of supervised standalone models in this Machine Learning online course and then discover a number of ways to address them, for example Ensemble approaches.

Decision Trees

The Decision Tree algorithm is an important part of the supervised learning algorithms family. The decision tree approach can be used to resolve regression and classification problems unlike others. By learning simple decision rules inferred from previous data, the goal of using a Decision Tree is constructing a training type that will be used to predict the class or value of the target varying.

Random Forests

Random Forest is a common supervised learning technique. It consists of multiple decision trees on the different subsets of the initial dataset. The average is then calculated to enhance the dataset's prediction accuracy.

Bagging and Boosting

When the aim is to decrease the variance of a decision tree classifier, bagging is implemented. The average of all predictions from several trees is used, that is a lot more dependable than a single decision tree classifier.

Boosting is a technique for generating a set of predictions. Learners are taught gradually in this technique, with early learners fitting basic models to the data and consequently analyzing the data for errors.

Unsupervised Machine Learning

In this module, you will study what Unsupervised Learning algorithms are, how they operate, and what applications they can be used for - Clustering and Dimensionality Reduction, and so on.

K-means clustering

In Machine Learning or even Data Science, K-means clustering is a common unsupervised learning method for managing clustering problems. In this module, you will learn how the algorithm works and how you can use it.

Hierarchical clustering

Hierarchical Clustering is a machine learning algorithm for creating a bunch hierarchy or tree-like structure. It is used to group a set of unlabeled datasets into a bunch in a hierarchical framework. This module will help you to use this technique.

Principal Component Analysis

PCA is a Dimensional Reduction technique for reducing a model's complexity, like reducing the number of input variables in a predictive model to avoid overfitting. Dimension Reduction PCA is also a well-known ML approach in Python, and this module will cover all that you need to know about this.

DBSCAN

Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is used to identify arbitrary-shaped clusters and clusters with sound. You will learn how this algorithm will help us to identify odd ones out from the group.

Advanced Techniques

EDA - Part1

Exploratory Data Analysis (EDA) is a procedure of analyzing the data using different tools and techniques. You will learn data standardization and represent the data through different graphs to assess and make decisions for several business use cases. You will also learn all the essential encoding techniques.

EDA - Part2

You will also get a opportunity to use null values, dealing with various data and outliers preprocessing techniques to create a machine learning model.

Feature Engineering

Feature Engineering is the process of extracting features from an organization's raw data by using domain expertise. A feature is a property shared by independent units that can be used for prediction or analysis. With this module, you will learn how this works.

Feature Selection

Feature selection is also called attribute selection, variable selection, or variable subset selection. It is the process of selecting a subset of relevant features for use in model development. You can learn many techniques to do the feature selection.

Model building techniques

Here you will learn different model-building techniques using different tools

Model Tuning techniques

In this module, you can learn how to enhance model performance using advanced techniques as GridSearch CV, Randomized Search CV, cross-validation strategies, etc.

Building Pipeline

What is Modeling Pipeline and how does it work? Well, it is a set of data preparation steps, modeling functions, and prediction transform routines organized in a logical order. It allows you to specify, evaluate, and use a series of measures as an atomic unit.

ChatGpt Essentials

ChatGpt is a revolutionary AI chatbot technology that provides users with powerful tools for content generation, prompt ideas, and other features. This module will help you understand the various capabilities of this advanced technology, including its strengths and limitations.

Introduction to GPT-3

GPT-3 (Generative Pre-trained Transformer 3) is a state-of-the-art language model developed by OpenAI. Learn about GPT-3 and its capabilities like Natural Language Understanding abd Promt Engineering.

Introduction to Prompt Engineering

Prompt engineering is the practice of designing and crafting effective prompts or input instructions for language models like GPT-3 to guide their generation of desired outputs. Learn to leverage the power of prompt engineering to be 10x productive like never before.

Introduction to Explainable AI (XAI)

Explainable AI and model interpretability are becoming increasingly important as AI models are being used in various critical domains, such as healthcare, finance, and legal systems, where accountability, fairness, and transparency are crucial. Learn to build machine learning models using LIME & ShARP.

Instructors

Shankargouda Tegginmani

Andrew Labeodan

Adedolapo Ogunlade

Ioannis Petridis

Data Science Course Syllabus

Detailed Syllabus of Data Science Course

Comprehensive Curriculum

Foundations

Python for Artificial Intelligence & Machine Learning

Applied Statistics

Machine Learning

Supervised Learning

Ensemble Techniques

Unsupervised Machine Learning

Advanced Techniques

Advanced Techniques

ChatGpt Essentials

Introduction to GPT-3

Introduction to Prompt Engineering

Introduction to Explainable AI (XAI)

Deep Diving GPT

Project: AI Evaluator

Introduction to GNN (Graph Neural Networks)

Deep Diving GNN

Understanding Generative Modeling