MPT-7B is a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that it matches the performance of LLaMA while also being open source. In addition to the base model, the developers also offer MPT-7B-Instruct, MPT-7B-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.
Initial release: 2023-05-05
Open Pre-trained Transformer Language Models (OPT) is part of the family of open source models designed to replicate GPT-3, with similar decoder-only architecture. It has since been superseded by models such as LLaMA, GPT-J, and Pythia.
Initial release: 2022-05-03
Looking for an LLM API/SDK that works out of the box?Sapling API
|Products & Features|
|Model Sizes||7B||1.3B, 2.7B, 6.7B, 13B, 30B, 66B, 175B|