Sapling Logo

Gemma vs. Guanaco

LLM Comparison


Gemma

Gemma

Overview

Gemma is a family of lightweight open models from Google built using the same processes used for the the larger Gemini models.


Gemma was first released as a family of open models from Google -- 2B and 7B-parameter models, as of February 2024 -- intended for developers and compute-constrained devices. Despite their size, Gemma models compare favorably to other models of the same size such as the Mistral 7B model.


Initial release: 2024-02-21

Guanaco

Guanaco

Overview

Guanaco is an LLM based off the QLoRA 4-bit finetuning method developed by Tim Dettmers et. al. in the UW NLP group. Guanaco achieves 99% ChatGPT performance on the Vicuna benchmark.


Guanaco is an LLM that uses a finetuning method called LoRA that was developed by Tim Dettmers et. al. in the UW NLP group. With QLoRA, it becomes possible to finetune up to a 65B parameter model on a 48GB GPU without loss of performance relative to a 16-bit model. The Guanaco model family outperforms all previously released models on the Vicuna benchmark. However, given the models are based off of the LLaMA model family, commercial use is not permitted.


Initial release: 2023-05-23

Looking for an LLM API/SDK that works out of the box? No prompts or ad hoc guardrails.

Sapling API
More Comparisons

Gemma

Guanaco

Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License Custom Noncommercial
Model Sizes 2B, 7B 7B, 13B, 33B, 65B