A code implemention of MoDeGPT

This repository is a code implemention of MoDeGPT in ICLR 2025

1. Set-up

Clone this repository

git clone https://github.com/cbacary/MoDeGPT.git
cd MoDeGPT

Install Package

using uv:

uv venv --python 3.12
uv pip install -r requirements.txt

2. Run compression

Llama2-7b example usage skipping mlp compression stage with 128 calibration samples.

python run_modegpt.py
  --model meta-llama/Llama-2-7b-hf
  --compression_ratio 0.25
  --calib_size 128
  --eval_size 128
  --calibs_batch_size 16
  --output_dir ./compressed_output/llama2-7b
  --device 0
  --skip mlp

3. Additional Information

Currently tested against OPT, Llama2-7b, and llama3-8b models. Llama3-8b models provided better calibration and eval against the Aplaca dataset. For llama3-8b, you can set ALPACA=True at the top of run_modegpt.py

To implement for additional model architectures you will have to modify / create your own patched version of modeling_llama (different for each architecture). See patchers/OPTRebuild.py, patchers/LlamaRebuild.py and the patch_config function in patchers/patch.py. This process mostly involves changing the dimensions of the linear layers when they are initialized to the compressed dimensions, but other steps may be required depending on the architecture you are working with.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
patchers		patchers
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
calibration.py		calibration.py
compress_mlp.py		compress_mlp.py
compress_qk.py		compress_qk.py
compress_vo.py		compress_vo.py
compression_utils.py		compression_utils.py
eval.py		eval.py
model_utils.py		model_utils.py
playground.ipynb		playground.ipynb
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
run_modegpt.py		run_modegpt.py
test-llama.sh		test-llama.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A code implemention of MoDeGPT

1. Set-up

2. Run compression

3. Additional Information

About

Uh oh!

Releases

Packages

Languages

cbacary/MoDeGPT

Folders and files

Latest commit

History

Repository files navigation

A code implemention of MoDeGPT

1. Set-up

2. Run compression

3. Additional Information

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages