DeepSeek vs. MPT: Which LLM is Better?

DeepSeek vs. MPT

LLM Comparison

DeepSeek

Overview

DeepSeek currently offers V3 and R1 models, both of which are highly efficient and performant. V3 is comparable to models such as Anthropic's Sonnet 3.5, while R1 is comparable to models such as OpenAI's o1.

DeepSeek is a Chinese startup that began releasing LLMs in 2023 with DeepSeek Coder. In rapid succession, DeepSeek has since released more powerful models, most notably releasing DeepSeek V3 at the end of 2024 and DeepSeek R1 at the beginning of 2025. DeepSeek V3 and R1 set the frontier in terms of efficiency while maintaining high performance. The release of V3 and R1 sent shockwaves through the US technology sector, especially given the low cost with which V3 and R1 were trained (orders of magnitude less than the cost of training equivalent US models.)

Initial release: 2023-11-29

Reference

https://www.deepseek.com/

MPT

Overview

MPT-7B and MPT-30B are a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that MPT-7B matches the performance of LLaMA while also being open source, while MPT-30B outperforms the original GPT-3. In addition to the base model, the developers also offer MPT-Instruct, MPT-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.

Initial release: 2023-05-05

Reference

https://www.mosaicml.com/blog/mpt-30b

	DeepSeek	MPT
Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License	MIT	Apache 2.0
Model Sizes	67B, 671B	7B, 30B

DeepSeek vs. MPT

LLM Comparison

DeepSeek

Overview

Reference

Further Reading

MPT

Overview

Reference

Further Reading

DeepSeek

MPT

Other DeepSeek Comparisons

Other MPT Comparisons

Undecided?