Guanaco vs. MPT: Which LLM is Better?

Guanaco vs. MPT

LLM Comparison

Guanaco

Overview

Guanaco is an LLM based off the QLoRA 4-bit finetuning method developed by Tim Dettmers et. al. in the UW NLP group. Guanaco achieves 99% ChatGPT performance on the Vicuna benchmark.

Guanaco is an LLM that uses a finetuning method called LoRA that was developed by Tim Dettmers et. al. in the UW NLP group. With QLoRA, it becomes possible to finetune up to a 65B parameter model on a 48GB GPU without loss of performance relative to a 16-bit model. The Guanaco model family outperforms all previously released models on the Vicuna benchmark. However, given the models are based off of the LLaMA model family, commercial use is not permitted.

Initial release: 2023-05-23

Reference

https://github.com/artidoro/qlora

MPT

Overview

MPT-7B and MPT-30B are a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that MPT-7B matches the performance of LLaMA while also being open source, while MPT-30B outperforms the original GPT-3. In addition to the base model, the developers also offer MPT-Instruct, MPT-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.

Initial release: 2023-05-05

Reference

https://www.mosaicml.com/blog/mpt-30b

	Guanaco	MPT
Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License	Noncommercial	Apache 2.0
Model Sizes	7B, 13B, 33B, 65B	7B, 30B

Guanaco vs. MPT

LLM Comparison

Guanaco

Overview

Reference

Further Reading

MPT

Overview

Reference

Further Reading

Guanaco

MPT

Other Guanaco Comparisons

Other MPT Comparisons

Undecided?