# notebooks
**Repository Path**: sunlcc/unsloth-notebooks
## Basic Information
- **Project Name**: notebooks
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: LGPL-3.0
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2026-03-22
- **Last Updated**: 2026-03-22
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
## 📒 Fine-tuning Notebooks
Below are Colab notebooks, organized by model. You can also view all [notebooks in our docs](https://unsloth.ai/docs/get-started/unsloth-notebooks).
The notebooks run locally and feature data prep, training and inference. Read our [fine-tuning guide](https://unsloth.ai/docs/get-started/fine-tuning-llms-guide).
### Main Notebooks
| Model | Type | Notebook Link |
|-----------------------------|----------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| **Qwen3.5 (4B)** | Vision | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_5_%284B%29_Vision.ipynb) |
| **Qwen3.5 (2B)** | Vision | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_5_%282B%29_Vision.ipynb) |
| **gpt-oss (20B)** | Fine-tuning | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/gpt-oss-%2820B%29-Fine-tuning.ipynb) |
| **gpt-oss (20B)** | GRPO | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/gpt-oss-%2820B%29-GRPO.ipynb) |
| **Qwen3 (14B)** | Conversational | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_%2814B%29-Reasoning-Conversational.ipynb) |
| **Qwen3-VL (8B)** | Vision | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_VL_%288B%29-Vision.ipynb) |
| **Qwen3-Embedding (0.6B)** | Embeddings | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_Embedding_(0_6B).ipynb) |
| **Qwen3: Advanced GRPO** | GRPO | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_%284B%29-GRPO.ipynb) |
| **Gemma 3 (4B)** | Vision | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Gemma3_%284B%29-Vision.ipynb) |
| **Gemma 3N (4B)** | Audio | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Gemma3N_%284B%29-Audio.ipynb) |
| **embeddinggemma (300M)** | Embeddings | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/EmbeddingGemma_%28300M%29.ipynb) |
| **Mistral Ministral 3 (3B)**| Vision | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Ministral_3_VL_%283B%29_Vision.ipynb) |
| **Mistral v0.3 (7B)** | Vision | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Mistral_v0.3_(7B)-Alpaca.ipynb) |
| **Llama 3.1 (8B) Alpaca** | Alpaca | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_%288B%29-Alpaca.ipynb) |
| **Llama 3.2 (1B + 3B)**| Conversational | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.2_%281B_and_3B%29-Conversational.ipynb) |
| **Phi-4 (14B)** | Conversational | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Phi_4-Conversational.ipynb) |
| **Orpheus-TTS (3B)** | TTS | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Orpheus_%283B%29-TTS.ipynb) |
### GRPO & Reinforcement Learning Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) gpt oss** **(20B)** | GRPO |
|
| **gpt oss** **(20B)** | GRPO |
|
| **gpt oss** **(20B)** | GRPO |
|
| **Phi 4** **(14B)** | GRPO |
|
| **Llama3.1** **(8B)** | GRPO |
|
| **Qwen3** **(4B)** | GRPO |
|
| **Gemma3** **(1B)** | GRPO |
|
| **Qwen2.5** **(3B)** | GRPO |
|
| **LFM2.5** **(1.2B)** | GRPO |
|
| **DeepSeek R1 0528 Qwen3** **(8B)** | GRPO |
|
| **Mistral v0.3** **(7B)** | GRPO |
|
### Text-to-Speech (TTS) Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Orpheus** **(3B)** | TTS |
|
| **Llasa TTS** **(3B)** | TTS |
|
| **Sesame CSM** **(1B)** | TTS |
|
| **Oute TTS** **(1B)** | TTS |
|
| **Llasa TTS** **(1B)** | TTS |
|
| **Spark TTS** **(0.5B)** | TTS |
|
### Vision (Multimodal) Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Pixtral** **(12B)** | Vision |
|
| **ERNIE 4 5 VL 28B A3B PT** | Vision |
|
| **Llama3.2** **(11B)** | Vision |
|
| **Qwen3 VL** **(8B)** | Vision |
|
| **Qwen3 VL** **(8B)** | Vision GRPO |
|
| **Qwen3 5** **(4B)** | Vision |
|
| **Qwen3 5** **(2B)** | Vision |
|
| **Qwen3 5** **(0 8B)** | Vision |
|
| **Ministral3 VL** **(3B)** | Vision |
|
| **Gemma3N** **(4B)** | Vision |
|
| **Gemma3** **(4B)** | Vision |
|
| **Gemma3** **(4B)** | Vision GRPO |
|
| **Qwen2.5 VL** **(7B)** | Vision |
|
| **Qwen2.5 VL** **(7B)** | Vision GRPO |
|
| **LFM2.5 VL** **(1.6B)** | Vision |
|
| **Qwen2 VL** **(7B)** | Vision |
|
### Embedding Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **EmbeddingGemma** **(300M)** | |
|
| **All MiniLM L6 v2** | |
|
| **Qwen3 Embedding** **(4B)** | |
|
| **Qwen3 Embedding** **(0 6B)** | |
|
| **BGE M3** | |
|
| **ModernBert** | |
|
| **ModernBERT** **(Large)** | Classification |
|
### Speech-to-Text (STT) Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Whisper** **(Large)** | Fine Tuning |
|
### OCR Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Deepseek OCR** **(3B)** | Fine Tuning |
|
| **Deepseek OCR** **(3B)** | Evaluation |
|
| **Deepseek OCR** **(3B)** | Eval |
|
| **Deepseek OCR 2** **(3B)** | |
|
| **Paddle OCR** **(1B)** | Vision |
|
### BERT Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **ModernBert** | |
|
| **ModernBERT** **(Large)** | Classification |
|
### Deepseek Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Deepseek OCR** **(3B)** | Fine Tuning |
|
| **Deepseek OCR** **(3B)** | Evaluation |
|
| **Deepseek OCR** **(3B)** | Eval |
|
| **Deepseek OCR 2** **(3B)** | |
|
### ERNIE Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **ERNIE 4 5 VL 28B A3B PT** | Vision |
|
| **ERNIE 4 5 21B A3B PT** | Conversational |
|
### GLM Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) GLM Flash(80GB)** | |
|
### GPT-OSS Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) gpt oss** **(120B)** | Fine Tuning |
|
| **gpt oss** **(20B)** | GRPO 2048 |
|
| **gpt oss** **(20B)** | Fine Tuning |
|
| **gpt oss** **(20B)** | Fine Tuning |
|
| **gpt oss BNB** **(20B)** | Inference |
|
| **(OpenEnv) gpt oss** **(20B)** | GRPO 2048 |
|
| **(DGX Spark) gpt oss** **(20B)** | GRPO 2048 |
|
| **gpt oss BF16** **(20B)** | GRPO 2048 |
|
| **(OpenEnv) gpt oss BF16** **(20B)** | GRPO 2048 |
|
| **gpt oss MXFP4** **(20B)** | Inference |
|
### Gemma Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **EmbeddingGemma** **(300M)** | |
|
| **FunctionGemma** **(270M)** | Tool Calling |
|
| **FunctionGemma** **(270M)** | Mobile Actions |
|
| **FunctionGemma** **(270M)** | Inference |
|
| **FunctionGemma** **(270M)** | Conversational |
|
| **(A100) Gemma3** **(27B)** | Conversational |
|
| **CodeGemma** **(7B)** | Conversational |
|
| **Gemma3N** **(4B)** | Vision |
|
| **Gemma3N** **(4B)** | Multimodal |
|
| **Gemma3N** **(4B)** | Audio |
|
| **Gemma3N** **(2B)** | Inference |
|
| **Gemma3** **(4B)** | Vision |
|
| **Gemma3** **(4B)** | Vision GRPO |
|
| **Gemma3** **(4B)** | Conversational |
|
| **Gemma3** **(270M)** | Conversational |
|
| **Gemma3** **(270M)** | |
|
| **Gemma2** **(9B)** | Alpaca |
|
| **Gemma2** **(2B)** | Alpaca |
|
### Granite Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Granite4.0** **(3B)** | Conversational |
|
| **Granite4.0** **(350M)** | Conversational |
|
### Linear Attention Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Liquid LFM2** **(1.2B)** | Conversational |
|
| **Liquid LFM2** | Conversational |
|
| **Falcon H1** **(0.5B)** | Alpaca |
|
| **Falcon H1** | Alpaca |
|
### Llama Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) Llama3.3** **(70B)** | Conversational |
|
| **Llama3.2** **(1B)** | RAFT |
|
| **Llama3.2** **(1B)** | FP8 GRPO |
|
| **Llama3.2** **(1B and 3B)** | Conversational |
|
| **Llama3.2** **(11B)** | Vision |
|
| **Llama3.1** **(8B)** | Inference |
|
| **Llama3.1** **(8B)** | Alpaca |
|
| **Llama3** **(8B)** | Ollama |
|
| **Llama3** **(8B)** | ORPO |
|
| **Llama3** **(8B)** | Conversational |
|
| **Llama3** **(8B)** | Alpaca |
|
| **TinyLlama** **(1.1B)** | Alpaca |
|
### Mistral Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Magistral** **(24B)** | Reasoning Conversational |
|
| **Mistral Small** **(22B)** | Alpaca |
|
| **Pixtral** **(12B)** | Vision |
|
| **Mistral Nemo** **(12B)** | Alpaca |
|
| **Zephyr** **(7B)** | DPO |
|
| **Mistral** **(7B)** | Text Completion |
|
| **Ministral3** **(3B)** | GRPO Sudoku |
|
| **Ministral3 VL** **(3B)** | Vision |
|
| **Mistral v0.3** **(7B)** | Conversational |
|
| **Mistral v0.3** **(7B)** | CPT |
|
| **Mistral v0.3** **(7B)** | Alpaca |
|
### Nemotron Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) Nemotron Nano 3 30B A3B** | |
|
| **(A100) Nemotron 3 Nano 30B A3B** | |
|
### Paddle Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Paddle OCR** **(1B)** | Vision |
|
### Phi Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Phi 4** | Conversational |
|
| **Phi 3.5 Mini** | Conversational |
|
| **Phi 3 Medium** | Conversational |
|
### Qwen Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) Qwen3** **(32B)** | Reasoning Conversational |
|
| **(A100) Qwen 3 5 27B(80GB)** | |
|
| **TinyQwen3 MoE** | |
|
| **Qwen3** **(8B)** | FP8 GRPO |
|
| **Qwen3** **(4B)** | Thinking |
|
| **Qwen3** **(4B)** | QAT |
|
| **Qwen3** **(4B)** | Conversational |
|
| **Qwen3** **(14B)** | Reasoning Conversational |
|
| **Qwen3** **(14B)** | Alpaca |
|
| **Qwen3** **(14B)** | |
|
| **Qwen3** **(0.6B)** | Reasoning Conversational |
|
| **Qwen3** **(0 6B)** | |
|
| **Qwen3 VL** **(8B)** | Vision |
|
| **Qwen3 VL** **(8B)** | Vision GRPO |
|
| **Qwen3 MoE** | |
|
| **Qwen3 Embedding** **(4B)** | |
|
| **Qwen3 Embedding** **(0 6B)** | |
|
| **Qwen3 5** **(4B)** | Vision |
|
| **Qwen3 5** **(2B)** | Vision |
|
| **Qwen3 5** **(0 8B)** | Vision |
|
| **Qwen3 5 MoE** | |
|
| **Qwen2.5** **(7B)** | Alpaca |
|
| **Qwen2.5 VL** **(7B)** | Vision |
|
| **Qwen2.5 VL** **(7B)** | Vision GRPO |
|
| **Qwen2.5 Coder** **(14B)** | Conversational |
|
| **Qwen2.5 Coder** **(1.5B)** | Tool Calling |
|
| **Qwen2** **(7B)** | Alpaca |
|
| **Qwen2 VL** **(7B)** | Vision |
|
### Specific use-case Notebooks
| Usecase | Model | Notebook Link |
| --- | --- | --- |
| Text Classification | Llama 3.1 (8B) | [](https://colab.research.google.com/github/timothelaborie/text_classification_scripts/blob/main/unsloth_classification.ipynb) |
| Tool Calling | Qwen2.5-Coder (1.5B) | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen2.5_Coder_(1.5B)-Tool_Calling.ipynb) |
| Multiple Datasets | | [](https://colab.research.google.com/drive/1njCCbE1YVal9xC83hjdo2hiGItpY_D6t?usp=sharing) |
| KTO | Qwen2.5-Instruct (1.5B) | [](https://colab.research.google.com/drive/1MRgGtLWuZX4ypSfGguFgC-IblTvO2ivM?usp=sharing) |
| Inference Chat UI | LLaMa 3.2 Vision | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Unsloth_Studio.ipynb) |
| Conversational | LLaMa 3.2 (1B and 3B) | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.2_(1B_and_3B)-Conversational.ipynb) |
| ChatML | Mistral (7B) | [](https://colab.research.google.com/drive/15F1xyn8497_dUbxZP4zWmPZ3PJx1Oymv?usp=sharing) |
| Text Completion | Mistral (7B) | [](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Mistral_(7B)-Text_Completion.ipynb) |
### Other Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **LFM2.5** **(1.2B)** | Text Completion |
|
| **LFM2.5** **(1.2B)** | Conversational |
|
| **LFM2.5** **(1.2B)** | |
|
| **LFM2.5 VL** **(1.6B)** | Vision |
|
| **Unsloth** | Studio |
|
| **Synthetic Data Hackathon** | Synthetic Data |
|
| **NeMo Gym Sudoku** | |
|
| **NeMo Gym Multi Environment** | |
|
| **CodeForces cot Finetune for Reasoning on CodeForces** | Reasoning |
|
# 📒 Kaggle Notebooks
Click for all our Kaggle notebooks categorized by model:
### GRPO & Reinforcement Learning Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) gpt oss** **(20B)** | GRPO |
|
| **gpt oss** **(20B)** | GRPO |
|
| **gpt oss** **(20B)** | GRPO |
|
| **Phi 4** **(14B)** | GRPO |
|
| **Llama3.1** **(8B)** | GRPO |
|
| **Qwen3** **(4B)** | GRPO |
|
| **Gemma3** **(1B)** | GRPO |
|
| **Qwen2.5** **(3B)** | GRPO |
|
| **DeepSeek R1 0528 Qwen3** **(8B)** | GRPO |
|
| **Mistral v0.3** **(7B)** | GRPO |
|
### Text-to-Speech (TTS) Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Orpheus** **(3B)** | TTS |
|
| **Llasa TTS** **(3B)** | TTS |
|
| **Sesame CSM** **(1B)** | TTS |
|
| **Oute TTS** **(1B)** | TTS |
|
| **Llasa TTS** **(1B)** | TTS |
|
| **Spark TTS** **(0.5B)** | TTS |
|
### Vision (Multimodal) Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Pixtral** **(12B)** | Vision |
|
| **ERNIE 4 5 VL 28B A3B PT** | Vision |
|
| **Llama3.2** **(11B)** | Vision |
|
| **Qwen3 VL** **(8B)** | Vision |
|
| **Qwen3 VL** **(8B)** | Vision GRPO |
|
| **Ministral3 VL** **(3B)** | Vision |
|
| **Gemma3N** **(4B)** | Vision |
|
| **Gemma3** **(4B)** | Vision |
|
| **Gemma3** **(4B)** | Vision GRPO |
|
| **Qwen2.5 VL** **(7B)** | Vision |
|
| **Qwen2.5 VL** **(7B)** | Vision GRPO |
|
| **Qwen2 VL** **(7B)** | Vision |
|
### Embedding Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **EmbeddingGemma** **(300M)** | |
|
| **All MiniLM L6 v2** | |
|
| **Qwen3 Embedding** **(4B)** | |
|
| **Qwen3 Embedding** **(0 6B)** | |
|
| **BGE M3** | |
|
| **ModernBert** | |
|
| **ModernBERT** **(Large)** | Classification |
|
### Speech-to-Text (STT) Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Whisper** **(Large)** | Fine Tuning |
|
### OCR Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Deepseek OCR** **(3B)** | Fine Tuning |
|
| **Deepseek OCR** **(3B)** | Evaluation |
|
| **Deepseek OCR** **(3B)** | Eval |
|
| **Deepseek OCR 2** **(3B)** | |
|
| **Paddle OCR** **(1B)** | Vision |
|
### BERT Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **ModernBert** | |
|
| **ModernBERT** **(Large)** | Classification |
|
### Deepseek Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Deepseek OCR** **(3B)** | Fine Tuning |
|
| **Deepseek OCR** **(3B)** | Evaluation |
|
| **Deepseek OCR** **(3B)** | Eval |
|
| **Deepseek OCR 2** **(3B)** | |
|
### ERNIE Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **ERNIE 4 5 VL 28B A3B PT** | Vision |
|
| **ERNIE 4 5 21B A3B PT** | Conversational |
|
### GPT-OSS Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) gpt oss** **(120B)** | Fine Tuning |
|
| **gpt oss** **(20B)** | Fine Tuning |
|
| **gpt oss** **(20B)** | Fine Tuning |
|
| **gpt oss BNB** **(20B)** | Inference |
|
| **gpt oss MXFP4** **(20B)** | Inference |
|
### Gemma Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **EmbeddingGemma** **(300M)** | |
|
| **(A100) Gemma3** **(27B)** | Conversational |
|
| **CodeGemma** **(7B)** | Conversational |
|
| **Gemma3N** **(4B)** | Vision |
|
| **Gemma3N** **(4B)** | Multimodal |
|
| **Gemma3N** **(4B)** | Audio |
|
| **Gemma3N** **(2B)** | Inference |
|
| **Gemma3** **(4B)** | Vision |
|
| **Gemma3** **(4B)** | Vision GRPO |
|
| **Gemma3** **(4B)** | Conversational |
|
| **Gemma3** **(270M)** | Conversational |
|
| **Gemma2** **(9B)** | Alpaca |
|
| **Gemma2** **(2B)** | Alpaca |
|
### Granite Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Granite4.0** **(3B)** | Conversational |
|
| **Granite4.0** **(350M)** | Conversational |
|
### Linear Attention Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Liquid LFM2** **(1.2B)** | Conversational |
|
| **Falcon H1** **(0.5B)** | Alpaca |
|
### Llama Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) Llama3.3** **(70B)** | Conversational |
|
| **Llama3.2** **(1B)** | RAFT |
|
| **Llama3.2** **(1B)** | FP8 GRPO |
|
| **Llama3.2** **(1B and 3B)** | Conversational |
|
| **Llama3.2** **(11B)** | Vision |
|
| **Llama3.1** **(8B)** | Inference |
|
| **Llama3.1** **(8B)** | Alpaca |
|
| **Llama3** **(8B)** | Ollama |
|
| **Llama3** **(8B)** | ORPO |
|
| **Llama3** **(8B)** | Conversational |
|
| **Llama3** **(8B)** | Alpaca |
|
| **TinyLlama** **(1.1B)** | Alpaca |
|
### Mistral Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Magistral** **(24B)** | Reasoning Conversational |
|
| **Mistral Small** **(22B)** | Alpaca |
|
| **Pixtral** **(12B)** | Vision |
|
| **Mistral Nemo** **(12B)** | Alpaca |
|
| **Zephyr** **(7B)** | DPO |
|
| **Mistral** **(7B)** | Text Completion |
|
| **Ministral3** **(3B)** | GRPO Sudoku |
|
| **Ministral3 VL** **(3B)** | Vision |
|
| **Mistral v0.3** **(7B)** | Conversational |
|
| **Mistral v0.3** **(7B)** | CPT |
|
| **Mistral v0.3** **(7B)** | Alpaca |
|
### Nemotron Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) Nemotron Nano 3 30B A3B** | |
|
| **(A100) Nemotron 3 Nano 30B A3B** | |
|
### Paddle Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Paddle OCR** **(1B)** | Vision |
|
### Phi Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Phi 4** | Conversational |
|
| **Phi 3.5 Mini** | Conversational |
|
| **Phi 3 Medium** | Conversational |
|
### Qwen Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **(A100) Qwen3** **(32B)** | Reasoning Conversational |
|
| **Qwen3** **(8B)** | FP8 GRPO |
|
| **Qwen3** **(4B)** | Thinking |
|
| **Qwen3** **(4B)** | QAT |
|
| **Qwen3** **(4B)** | Conversational |
|
| **Qwen3** **(14B)** | Reasoning Conversational |
|
| **Qwen3** **(14B)** | Alpaca |
|
| **Qwen3** **(14B)** | |
|
| **Qwen3 VL** **(8B)** | Vision |
|
| **Qwen3 VL** **(8B)** | Vision GRPO |
|
| **Qwen3 Embedding** **(4B)** | |
|
| **Qwen3 Embedding** **(0 6B)** | |
|
| **Qwen2.5** **(7B)** | Alpaca |
|
| **Qwen2.5 VL** **(7B)** | Vision |
|
| **Qwen2.5 VL** **(7B)** | Vision GRPO |
|
| **Qwen2.5 Coder** **(14B)** | Conversational |
|
| **Qwen2.5 Coder** **(1.5B)** | Tool Calling |
|
| **Qwen2** **(7B)** | Alpaca |
|
| **Qwen2 VL** **(7B)** | Vision |
|
### Other Notebooks
| Model | Type | Notebook Link |
| --- | --- | --- |
| **Unsloth** | Studio |
|
| **CodeForces cot Finetune for Reasoning on CodeForces** | Reasoning |
|
## Known Issues / Environment Notes
- **NumPy 2.x ↔ soxr**: NumPy 2.x breaks soxr, causing Unsloth import failures. Pin `numpy<2` to resolve. Use `pip install --force-reinstall "numpy<2"` if needed. _Impact: Prevents Unsloth from running._
- **soxr reinstall**: `pip install --force-reinstall soxr` can pull NumPy 2.x back unless using `--no-deps`. Use `pip install --force-reinstall --no-deps soxr` to avoid this. _Impact: May reintroduce NumPy 2.x and break Unsloth imports._
- **typing_extensions**: Older typing_extensions can break torch import (TypeIs missing) until upgraded. Upgrade with `pip install --upgrade typing_extensions`. _Impact: Prevents PyTorch from importing correctly._
- **Resolver warnings**: Pinning `numpy<2` can cause pip resolver warnings with SciPy/Numba; typically non-fatal. _Impact: Cosmetic warnings only, does not affect functionality._
- **ROCm / triton_key**: LoRA backward can crash under `torch.compile` if Triton lacks `triton_key`; workaround is to disable Inductor/compile on ROCm (handled in code now, but worth noting). _Impact: May cause training crashes on AMD GPUs when using torch.compile._
# ✨ Contributing to Notebooks
If you'd like to contribute to our notebooks, here's a guide to get you started:
1. **Find the Template:** We've provided a template notebook called `Template_Notebook.ipynb` in the root directory of this project. This template contains the basic structure and formatting guidelines for all notebooks in this collection.
2. **Create Your Notebook:**
* Make a copy of `Template_Notebook.ipynb`.
* Rename the copied file to follow this naming convention:
* **LLM Notebooks:** `-.ipynb` (e.g., `Mistral_v0.3_(7B)-Alpaca.ipynb`)
* **Vision Notebooks:** `-Vision.ipynb` (e.g., `Llava_v1.6_(7B)-Vision.ipynb`)
* **Example of ``:** `Alpaca`, `Conversational`, `CPT`, `DPO`, `ORPO`, `Text_Completion`, `CSV`, `Inference`, `Unsloth_Studio`
3. **Place in `original_template`:** Once your notebook is ready, move it to the `original_template` directory.
4. **Update Notebooks:** Run the following command in your terminal:
```bash
python update_all_notebooks.py
```
This script will automatically:
* Copy your notebook from `original_template` to the `notebooks` directory.
* Update the notebook's internal sections (like Installation, News) to ensure consistency.
* Add your notebook to the appropriate list in this `README.md` file.
5. **Create a Pull Request:** After that, just create a pull request (PR) to merge your changes, making it available for everyone!
* We appreciate your contributions and look forward to reviewing your notebooks!