RepoLightning AILightning AIpublished May 4, 2023seen 5d

Lightning-AI/litgpt

Python

Open original ↗

Captured source

source ↗
published May 4, 2023seen 5dcaptured 15hhttp 200method plain

Lightning-AI/litgpt

Description: 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language: Python

License: Apache-2.0

Stars: 13419

Forks: 1455

Open issues: 260

Created: 2023-05-04T17:46:11Z

Pushed: 2026-06-09T15:04:52Z

Default branch: main

Fork: no

Archived: no

README:

Looking for GPUs?

Over 340,000 developers use Lightning Cloud - purpose-built for PyTorch and PyTorch Lightning.

Finetune, pretrain, and inference LLMs Lightning fast ⚡⚡

Every LLM is implemented from scratch with no abstractions and full control, making them blazing fast, minimal, and performant at enterprise scale.

Enterprise ready - Apache 2.0 for unlimited enterprise use. ✅ Developer friendly - Easy debugging with no abstraction layers and single file implementations. ✅ Optimized performance - Models designed to maximize performance, reduce costs, and speed up training. ✅ Proven recipes - Highly-optimized training/finetuning recipes tested at enterprise scale.

Quick start

Install LitGPT

pip install 'litgpt[extra]'

Load and use any of the [20+ LLMs](#choose-from-20-llms):

from litgpt import LLM

llm = LLM.load("microsoft/phi-2")
text = llm.generate("Fix the spelling: Every fall, the family goes to the mountains.")
print(text)
# Corrected Sentence: Every fall, the family goes to the mountains.

✅ Optimized for fast inference ✅ Quantization ✅ Runs on low-memory GPUs ✅ No layers of internal abstractions ✅ Optimized for production scale

Advanced install options

Install from source:

git clone https://github.com/Lightning-AI/litgpt
cd litgpt
# if using uv
uv sync --all-extras
# if using pip
pip install -e ".[extra,compiler,test]"

[Explore the full Python API docs](tutorials/python-api.md).

---

Choose from 20+ LLMs

Every model is written from scratch to maximize performance and remove layers of abstraction:

| Model | Model size | Author | Reference | |----|----|----|----| | Llama 3, 3.1, 3.2, 3.3 | 1B, 3B, 8B, 70B, 405B | Meta AI | Meta AI 2024 | | Code Llama | 7B, 13B, 34B, 70B | Meta AI | Rozière et al. 2023 | | CodeGemma | 7B | Google | Google Team, Google Deepmind | | Gemma 2 | 2B, 9B, 27B | Google | Google Team, Google Deepmind | | Phi 4 | 14B | Microsoft Research | Abdin et al. 2024 | | Qwen2.5 | 0.5B, 1.5B, 3B, 7B, 14B, 32B, 72B | Alibaba Group | Qwen Team 2024 | | Qwen2.5 Coder | 0.5B, 1.5B, 3B, 7B, 14B, 32B | Alibaba Group | Hui, Binyuan et al. 2024 | | R1 Distill Llama | 8B, 70B | DeepSeek AI | DeepSeek AI 2025 | | ... | ... | ... | ... |

See full list of 20+ LLMs

All models

| Model | Model size | Author | Reference | |----|----|----|----| | CodeGemma | 7B | Google | Google Team, Google Deepmind | | Code Llama | 7B, 13B, 34B, 70B | Meta AI | Rozière et al. 2023 | | Falcon | 7B, 40B, 180B | TII UAE | TII 2023 | | Falcon 3 | 1B, 3B, 7B, 10B | TII UAE | TII 2024 | | FreeWilly2 (Stable Beluga 2) | 70B | Stability AI | Stability AI 2023 | | Function Calling Llama 2 | 7B | Trelis | Trelis et al. 2023 | | Gemma | 2B, 7B | Google | Google Team, Google Deepmind | | Gemma 2 | 9B, 27B | Google | Google Team, Google Deepmind | | Gemma 3 | 1B, 4B, 12B, 27B | Google | Google Team, Google Deepmind | | Llama 2 | 7B, 13B, 70B | Meta AI | Touvron et al. 2023 | | Llama 3.1 | 8B, 70B | Meta AI | Meta AI 2024 | | Llama 3.2 | 1B, 3B | Meta AI | Meta AI 2024 | | Llama 3.3 | 70B | Meta AI | Meta AI 2024 | | Mathstral | 7B | Mistral AI | Mistral AI 2024 | | MicroLlama | 300M | Ken Wang | MicroLlama repo | | Mixtral MoE | 8x7B | Mistral AI | Mistral AI 2023 | | Mistral | 7B, 123B | Mistral AI | Mistral AI 2023 | | Mixtral MoE | 8x22B | Mistral AI | Mistral AI 2024 | | OLMo | 1B, 7B | Allen Institute for AI (AI2) | Groeneveld et al. 2024 | | OpenLLaMA | 3B, 7B, 13B | OpenLM Research | Geng & Liu 2023 | | Phi 1.5 & 2 | 1.3B, 2.7B | Microsoft Research | Li et al. 2023 | | Phi 3 | 3.8B | Microsoft Research | [Abdin et al....

Excerpt shown — open the source for the full document.