Gemma Intro

Authors: Lorenzo Cesconetto, Pedro Gengo.

Running with llama.cpp

Pre-requisite: You must have llama.cpp already installed and setup.
Download from HF (please see HF cli download reference):

pip install "huggingface_hub[hf_transfer]" # hf_transfer enables faster download
HF_HUB_ENABLE_HF_TRANSFER=1 huggingface-cli download google/gemma-2b-it gemma-2b-it.gguf # must set HF_HUB_ENABLE_HF_TRANSFER=1 for a faster download

Quantized model:

./llama.cpp/quantize ../models_converted/gemma-2b-it.gguf ../models_quantized/gemma-2b-it-q8.gguf q8_0

Running:

./llama.cpp/main 2>/dev/null \
    --model ./models_quantized/gemma-2b-it-q8.gguf \
    --ctx-size 8192 \
    --batch-size 8 \
    --keep -1 \
    --n-predict -1 \
    --color \
    --interactive-first \
    --temp 0.1 \
    --reverse-prompt "<start_of_turn>user " \
    --in-suffix "<start_of_turn>model " \
    --in-prefix "<end_of_turn> "

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Demo_ViniCarida_Gemma.ipynb		Demo_ViniCarida_Gemma.ipynb
Gemma TFUGSP.pdf		Gemma TFUGSP.pdf
RAG_with_Gemma_2B.ipynb		RAG_with_Gemma_2B.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gemma Intro

Running with llama.cpp

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

lorenzocesconetto/gemma-intro

Folders and files

Latest commit

History

Repository files navigation

Gemma Intro

Running with llama.cpp

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages