Guanaco vs. Mistral: Which LLM is Better?

Guanaco vs. Mistral

LLM Comparison

Guanaco

Overview

Guanaco is an LLM based off the QLoRA 4-bit finetuning method developed by Tim Dettmers et. al. in the UW NLP group. Guanaco achieves 99% ChatGPT performance on the Vicuna benchmark.

Guanaco is an LLM that uses a finetuning method called LoRA that was developed by Tim Dettmers et. al. in the UW NLP group. With QLoRA, it becomes possible to finetune up to a 65B parameter model on a 48GB GPU without loss of performance relative to a 16-bit model. The Guanaco model family outperforms all previously released models on the Vicuna benchmark. However, given the models are based off of the LLaMA model family, commercial use is not permitted.

Initial release: 2023-05-23

Reference

https://github.com/artidoro/qlora

Mistral

Overview

Developed by some of the researchers behind Llama, the Mistral large language models were an early standard for accessible and performant open source models.

Mistral AI offers 7B and mixture-of-experts models (8x7B Mixtral and 8x22B Mixtral) that are competitive or better than commercial models of similar size. Available under the Apache 2.0 license, the Mistral models are now also available via most cloud vendors. The latest Mixtral 8x22B represents state-of-the-art performance in the open-source domain. Mistral AI has also begun offering proprietary Small, Large, and Edge models via their business API.

Initial release: 2023-09-27

Reference

https://mistral.ai/technology/#models

	Guanaco	Mistral
Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License	Noncommercial	Apache 2.0
Model Sizes	7B, 13B, 33B, 65B	7B, 8x7B, 8x22B

Guanaco vs. Mistral

LLM Comparison

Guanaco

Overview

Reference

Further Reading

Mistral

Overview

Reference

Further Reading

Guanaco

Mistral

Other Guanaco Comparisons

Other Mistral Comparisons

Undecided?