Gemma 2 vs. Guanaco: Which LLM is Better?

Gemma 2 vs. Guanaco

LLM Comparison

Gemma 2

Overview

Gemma 2 succeeds the Gemma family of lightweight open models from Google built using the same processes used for the the larger Gemini models.

Gemma 2 is the successor to the Gemma family of open models, including larer models (9B and 27B parameters) with outsized performance across benchmarks. Using a combination of techniques such as training on twice as much data, knowledge distillation, and architectural improvements such as sliding window attention, logit soft-capping, and model merging, Gemma 2 outperforms models of similar size (such as Llama 3), with the 27B parameter model being competitive with models more than twice its size (such as Llama 70B).

Initial release: 2024-06-27

Reference

https://blog.google/technology/developers/google-gemma-2/

Guanaco

Overview

Guanaco is an LLM based off the QLoRA 4-bit finetuning method developed by Tim Dettmers et. al. in the UW NLP group. Guanaco achieves 99% ChatGPT performance on the Vicuna benchmark.

Guanaco is an LLM that uses a finetuning method called LoRA that was developed by Tim Dettmers et. al. in the UW NLP group. With QLoRA, it becomes possible to finetune up to a 65B parameter model on a 48GB GPU without loss of performance relative to a 16-bit model. The Guanaco model family outperforms all previously released models on the Vicuna benchmark. However, given the models are based off of the LLaMA model family, commercial use is not permitted.

Initial release: 2023-05-23

Reference

https://github.com/artidoro/qlora

	Gemma 2	Guanaco
Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License	Custom	Noncommercial
Model Sizes	2.6B, 9B, 27B	7B, 13B, 33B, 65B

Gemma 2 vs. Guanaco

LLM Comparison

Gemma 2

Overview

Reference

Further Reading

Guanaco

Overview

Reference

Further Reading

Gemma 2

Guanaco

Other Gemma 2 Comparisons

Other Guanaco Comparisons

Undecided?