LLaMA vs. MPT: Which LLM is Better?

LLaMA vs. MPT

LLM Comparison

LLaMA

Overview

LLaMA was previously Meta AI's most performant LLM available for researchers and noncommercial use cases. It has since been succeeded by Llama 2.

The model that launched a frenzy in open-source instruct-finetuned models, LLaMA is Meta AI's more parameter-efficient, open alternative to large commercial LLMs. Despite being smaller than many commercial models, LLaMA outperformed the gold standard GPT-3 on many benchmarks, with the primary drawback being that its access remains gated to researchers with restrictions on commercial use.

Initial release: 2023-02-24

Reference

https://ai.facebook.com/blog/large-language-model-llama-meta-ai/

MPT

Overview

MPT-7B and MPT-30B are a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that MPT-7B matches the performance of LLaMA while also being open source, while MPT-30B outperforms the original GPT-3. In addition to the base model, the developers also offer MPT-Instruct, MPT-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.

Initial release: 2023-05-05

Reference

https://www.mosaicml.com/blog/mpt-30b

	LLaMA	MPT
Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License	Noncommercial	Apache 2.0
Model Sizes	7B, 13B, 33B, 65B	7B, 30B

LLaMA vs. MPT

LLM Comparison

LLaMA

Overview

Reference

Further Reading

MPT

Overview

Reference

Further Reading

LLaMA

MPT

Other LLaMA Comparisons

Other MPT Comparisons

Undecided?