Skip to content
@neuralmagic

Neural Magic

Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM

Pinned Loading

  1. deepsparse deepsparse Public archive

    Sparsity-aware deep learning inference runtime for CPUs

    Python 3.2k 192

Repositories

Showing 10 of 84 repositories
  • neuralmagic/model-validation-configs’s past year of commit activity
    2 0 0 1 Updated Dec 18, 2025
  • research Public

    Repository to enable research flows

    neuralmagic/research’s past year of commit activity
    Python 3 0 0 3 Updated Dec 18, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    neuralmagic/vllm’s past year of commit activity
    Python 16 Apache-2.0 12,155 0 22 Updated Dec 18, 2025
  • lighteval Public Forked from huggingface/lighteval

    Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

    neuralmagic/lighteval’s past year of commit activity
    Python 0 MIT 404 0 0 Updated Dec 18, 2025
  • pytorch Public Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    neuralmagic/pytorch’s past year of commit activity
    Python 1 26,958 0 4 Updated Dec 16, 2025
  • lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval

    Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

    neuralmagic/lmms-eval’s past year of commit activity
    Python 0 463 0 11 Updated Dec 17, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    neuralmagic/lm-evaluation-harness’s past year of commit activity
    Python 5 MIT 2,927 0 1 Updated Dec 16, 2025
  • tpu-inference Public Forked from vllm-project/tpu-inference

    TPU inference for vLLM, with unified JAX and PyTorch support.

    neuralmagic/tpu-inference’s past year of commit activity
    Python 0 Apache-2.0 61 0 0 Updated Dec 16, 2025
  • axolotl Public Forked from axolotl-ai-cloud/axolotl

    Go ahead and axolotl questions

    neuralmagic/axolotl’s past year of commit activity
    Python 0 Apache-2.0 1,235 0 5 Updated Dec 14, 2025
  • nm-vllm Public archive Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    neuralmagic/nm-vllm’s past year of commit activity
    Python 267 12,155 0 0 Updated Dec 4, 2025