Skip to content

Commit 18af400

Browse files
committed
chore(model gallery): add gemma-3-12b-it-qat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
1 parent 6af3f46 commit 18af400

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

gallery/index.yaml

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -78,6 +78,24 @@
7878
- filename: gemma-3-1b-it-Q4_K_M.gguf
7979
sha256: 8ccc5cd1f1b3602548715ae25a66ed73fd5dc68a210412eea643eb20eb75a135
8080
uri: huggingface://ggml-org/gemma-3-1b-it-GGUF/gemma-3-1b-it-Q4_K_M.gguf
81+
- !!merge <<: *llama3
82+
name: "gemma-3-12b-it-qat"
83+
urls:
84+
- https://huggingface.co/google/gemma-3-12b-it
85+
- https://huggingface.co/vinimuchulski/gemma-3-12b-it-qat-q4_0-gguf
86+
description: |
87+
This model corresponds to the 12B instruction-tuned version of the Gemma 3 model in GGUF format using Quantization Aware Training (QAT). The GGUF corresponds to Q4_0 quantization.
88+
89+
Thanks to QAT, the model is able to preserve similar quality as bfloat16 while significantly reducing the memory requirements to load the model.
90+
91+
You can find the half-precision version here.
92+
overrides:
93+
parameters:
94+
model: gemma-3-12b-it-q4_0.gguf
95+
files:
96+
- filename: gemma-3-12b-it-q4_0.gguf
97+
sha256: 6f1bb5f455414f7b46482bda51cbfdbf19786e21a5498c4403fdfc03d09b045c
98+
uri: huggingface://vinimuchulski/gemma-3-12b-it-qat-q4_0-gguf/gemma-3-12b-it-q4_0.gguf
8199
- !!merge <<: *gemma3
82100
name: "qgallouedec_gemma-3-27b-it-codeforces-sft"
83101
urls:

0 commit comments

Comments
 (0)