Sapling Logo

DeepSeek vs. MPT

LLM Comparison


DeepSeek

DeepSeek

Overview

DeepSeek currently offers V3 and R1 models, both of which are highly efficient and performant. V3 is comparable to models such as Anthropic's Sonnet 3.5, while R1 is comparable to models such as OpenAI's o1.


DeepSeek is a Chinese startup that began releasing LLMs in 2023 with DeepSeek Coder. In rapid succession, DeepSeek has since released more powerful models, most notably releasing DeepSeek V3 at the end of 2024 and DeepSeek R1 at the beginning of 2025. DeepSeek V3 and R1 set the frontier in terms of efficiency while maintaining high performance. The release of V3 and R1 sent shockwaves through the US technology sector, especially given the low cost with which V3 and R1 were trained (orders of magnitude less than the cost of training equivalent US models.)


Initial release: 2023-11-29

Further Reading

MPT

MPT

Overview

MPT-7B and MPT-30B are a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that MPT-7B matches the performance of LLaMA while also being open source, while MPT-30B outperforms the original GPT-3. In addition to the base model, the developers also offer MPT-Instruct, MPT-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.



Initial release: 2023-05-05

Looking for an LLM API/SDK that works out of the box? No prompts or ad hoc guardrails.

Sapling API
More Comparisons

DeepSeek

MPT

Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License MIT Apache 2.0
Model Sizes 67B, 671B 7B, 30B