Llama 2 vs. MPT: Which LLM is Better?

Llama 2 vs. MPT

LLM Comparison

Llama 2

Overview

Llama 2 is Meta AI's open source LLM available for both research and commercial use cases (assuming you're not one of the top consumer companies in the world).

The successor to LLaMA (henceforce "Llama 1"), Llama 2 was trained on 40% more data, has double the context length, and was tuned on a large dataset of human preferences (over 1 million such annotations) to ensure helpfulness and safety. It outperforms other open source models on both natural language understanding datasets as well as in head-to-head face-offs.

Initial release: 2023-07-18

Reference

https://ai.meta.com/llama/

MPT

Overview

MPT-7B and MPT-30B are a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that MPT-7B matches the performance of LLaMA while also being open source, while MPT-30B outperforms the original GPT-3. In addition to the base model, the developers also offer MPT-Instruct, MPT-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.

Initial release: 2023-05-05

Reference

https://www.mosaicml.com/blog/mpt-30b

	Llama 2	MPT
Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License	Custom (Commercial OK)	Apache 2.0
Model Sizes	7B, 13B, 70B	7B, 30B

Llama 2 vs. MPT

LLM Comparison

Llama 2

Overview

Reference

Further Reading

MPT

Overview

Reference

Further Reading

Llama 2

MPT

Other Llama 2 Comparisons

Other MPT Comparisons

Undecided?