Schedule
-
EventDateDescriptionCourse Material
-
Lecture10/24/2024
ThursdayIntroduction SessionMain Topics:
- Course Format and Logistics
- Introductory Definitions and Concepts
-
Lecture10/31/2024
ThursdayDescriptive Statistics and Exploratory Data AnalysisMain Topics:
- Mean, median, mode, etc.
- Variance, standard deviation, etc.
- Guassian distribution
- Chebyshev’s inequality
- Skewness and kurtosis
- Data Quality
- DataFrames
- Visualization, Charts and Graphs
- Outliers
- Correlation Coefficient
Suggested Readings:
- Chapter 2 and 3 of The Orange Book of Machine Learning.
-
Lecture11/07/2024
ThursdayData Cleaning and Cross ValidationMain Topics:
- Missing Values
- Outliers
- Data De-duplication
- Feature Encoding
- Train test split
- Cross validation
- Data Leakage
- Covariate Shift
Suggested Readings:
- Chapter 4 and 5 of The Orange Book of Machine Learning.
-
Lecture11/14/2024
ThursdayRegressionMain Topics:
- Regression Basics
- Linear Regression
- Polynomial Regression
- Metrics
- Overfitting and Underfitting
- Conformal Prediction
Suggested Readings:
- Chapter 6 of The Orange Book of Machine Learning.
-
Lecture11/21/2024
ThursdayClassificationMain Topics:
- Logistic Regression
- Logloss function
- Metrics
- Imbalanced classification
- Decision Trees
Suggested Readings:
- Chapter 7 of The Orange Book of Machine Learning.
-
Lecture11/28/2024
ThursdayEnsemble, Hyperparameter Optimization, and Feature SelectionMain Topics:
- Random Forest
- AdaBoost
- Gradient Boosting
- Hyperparameter Optimization
- Feature Selection
- Feature Engineering
- PCA
Suggested Readings:
- Chapter 8, 9, 10, and 11 of The Orange Book of Machine Learning.
-
Lecture01/06/2025
MondayExplainabilityMain Topics:
- SHAP
- LIME
- Generative AI Explainability
- Transparent AI
Suggested Readings: