Skip to main content
Ctrl+K

Olive latest documentation

  • Overview
  • Getting started
  • How Tos
  • Features
    • Reference
    • Blogs
  • Overview
  • Getting started
  • How Tos
  • Features
  • Reference
  • Blogs
  • Blogs

Blogs#


Exploring Optimal Quantization Settings for Small Language Models

An exploration of how Olive applies different quantization strategies such as GPTQ, mixed precision, and QuaRot to optimize small language models for efficiency and accuracy.
Exploring Optimal Quantization Settings for Small Language Models

previous

Python Interface

next

Exploring Optimal Quantization Settings for Small Language Models with Olive

© Copyright 2023-2025, Olive Dev team.

Built with the PyData Sphinx Theme 0.16.1.