Advanced Machine Learning Course (2025)

Oversampling techniques (Random Oversampling, SMOTE, ADASYN, BorderlineSMOTE)
Undersampling methods (Random Undersampling, Tomek Links, NearMiss)
Combined approaches (SMOTETomek, SMOTEENN)
Ensemble methods for imbalanced learning (Balanced Random Forest, EasyEnsemble)
Cost-sensitive learning and class weights

Linear Models & Regularization

Linear regression with feature engineering
Logistic regression for classification
Regularization techniques (L1, L2, ElasticNet)
Feature selection and dimensionality reduction

Missing Data Handling

Data quality checks and diagnostics
Univariate imputation methods (pandas and scikit-learn)
Multivariate imputation techniques (KNN, Iterative Imputer)
Time series interpolation methods

Marketing Analytics

Marketing Mix Modeling (MMM) for budget allocation
Multi-Touch Attribution (MTA) analysis
ROI calculation for marketing channels
Time series modeling for marketing impact

Hyperparameter Optimization

Grid Search for exhaustive parameter search
Randomized Search for efficient exploration
Bayesian optimization using Optuna
Advanced tuning strategies and best practices

Feature Engineering & Preprocessing

Advanced feature engineering techniques
Handling dirty data and data quality issues
Denoising using machine learning models
Polynomial features and feature interactions
Scikit-learn preprocessing pipelines
Feature-engine transformations

Time Series Forecasting

Facebook Prophet for trend and seasonality analysis
Theta method for exponential smoothing
Automated forecasting with StatsForecast
Handling holidays and special events
Multi-step ahead forecasting

02 Deep Learning

Deep Learning & Neural Networks

Autoencoders for data reconstruction and dimensionality reduction
Generative Adversarial Networks (GANs) - DCGAN implementation
Deep learning for feature learning and representation

Natural Language Processing

Introduction to NLP
BERT and GPT architectures

Transfer Learning

Transfer learning techniques and applications

03 Generative AI

Intro to HuggingFace

Introduction to the HuggingFace ecosystem

LLMs Fine Tuning

Fine-tuning Large Language Models
Fine-tuning with HuggingFace
Fine-tuning with OpenAI

LLMs Intro

Introduction to Large Language Models
RAG Applications with LangChain
OpenAI and Ollama APIs

04 MLOps

MLOps

Experiment tracking using MLflow
Experiment tracking using Weights & Biases (wandb)
Model versioning and registry
Artifact logging (models, plots, metrics)
Model deployment and serving
Reproducibility and collaboration workflows

Note

The repo got renamed form adv_ml_ds to advanced_machine_learning. The old URLs and your existing GitHub repo should all work as is (thanks to GitHub automatic redirects)

Environment Setup

Below are instructions for setting up virtual environments using different tools.

Using uv (Recommended)

This project uses uv for dependency management, which is significantly faster than standard pip.

Install uv (if not already installed):

# On macOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# On Windows
powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

# Or via pip
pip install uv

Sync the environment: This command creates the virtual environment and installs all dependencies defined in uv.lock (or pyproject.toml).
```
uv sync
```

Activate the environment:

source .venv/bin/activate  # On macOS/Linux
# or
.venv\Scripts\activate     # On Windows

Alternatively, you can run commands directly within the environment using uv run:

uv run jupyter lab

Using venv (Python built-in)

Create a virtual environment:
```
python3.12 -m venv dev1
```

Activate the environment:

source dev1/bin/activate  # On macOS/Linux
# or
dev1\Scripts\activate     # On Windows

Deactivate the environment:
```
deactivate
```

Using conda

Create a conda environment:
```
conda create -n dev1 python=3.12
```
Activate the environment:
```
conda activate dev1
```
Deactivate the environment:
```
conda deactivate
```

Installing Packages

Using uv (Recommended)

To add a new package to the project and update pyproject.toml and uv.lock:

uv add ipykernel pandas matplotlib scikit-learn seaborn

This ensures that all dependencies are tracked and reproducible.

Using pip

pip install ipykernel pandas matplotlob scikit-learn seaborn

Using conda

conda install ipykernel pandas matplotlob scikit-learn seaborn

Running shell commands from Notebooks

Shell commands can be executed within a Jupyter Notebook by prefixing the command with an exclamation mark (!). This allows users to interact with the underlying operating system directly from within their notebook environment.

!uv pip install ipykernel pandas matplotlob scikit-learn seaborn

Getting Started with Git

Cloning the Repository

To get a copy of this repository on your local machine:

git clone https://github.com/tatwan/adv_ml_ds.git
cd adv_ml_ds

Updating the Repository

To update your local copy with the latest changes from the remote repository:

git pull origin main

Handling Modifications and Conflicts

If you've made local modifications and want to update:

Commit your changes first (if you want to keep them):

git add .
git commit -m "Your commit message"
git pull origin main

Stash your changes (if you want to temporarily save them):

git stash
git pull origin main
git stash pop  # To restore your changes

If conflicts occur during pull:
- Git will notify you of conflicts
- Edit the conflicted files to resolve conflicts
- Stage the resolved files:
```
git add <resolved_file>
```
- Complete the merge:
```
git commit
```
Force update (use with caution, this will overwrite local changes):
```
git reset --hard origin/main
```

Python environments in VS Code

Read the official page on the topic

Jupyter Notebooks in VS Code

Read the official page on the topic

Data Science in VS Code tutorial

Read the official page on the topic

Manage Jupyter Kernels in VS Code

Read the official page on the topic

Quickstart for GitHub Codespaces

Read the official page on the topic

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
activities		activities
content		content
images		images
.gitattributes		.gitattributes
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

tatwan/advanced_machine_learning

Folders and files

Latest commit

History

Repository files navigation

Advanced Machine Learning Course (2025)

Table of Contents

Course Content Summary

01 Classical ML

Anomaly Detection

AutoML & Low-Code ML

Model Validation & Cross-Validation

Model Drift & Retraining

Ensemble Methods

Model Explainability & Interpretability

Advanced Linear Models

Imbalanced Data Handling

Linear Models & Regularization

Missing Data Handling

Marketing Analytics

Hyperparameter Optimization

Feature Engineering & Preprocessing

Time Series Forecasting

02 Deep Learning

Deep Learning & Neural Networks

Natural Language Processing

Transfer Learning

03 Generative AI

Intro to HuggingFace

LLMs Fine Tuning

LLMs Intro

04 MLOps

MLOps

Environment Setup

Using uv (Recommended)

Using venv (Python built-in)

Using conda

Installing Packages

Using uv (Recommended)

Using pip

Using conda

Running shell commands from Notebooks

Getting Started with Git

Cloning the Repository

Updating the Repository

Handling Modifications and Conflicts

Python environments in VS Code

Jupyter Notebooks in VS Code

Data Science in VS Code tutorial

Manage Jupyter Kernels in VS Code

Quickstart for GitHub Codespaces

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages