MPT-7B and MPT-30B are a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that MPT-7B matches the performance of LLaMA while also being open source, while MPT-30B outperforms the original GPT-3. In addition to the base model, the developers also offer MPT-Instruct, MPT-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.
Initial release: 2023-05-05
Released alongside Koala, Vicuna is one of many descendants of the Meta LLaMA model trained on dialogue data collected from the ShareGPT website. According to the authors, Vicuna achieves more than 90% of ChatGPT's quality in user preference tests, while vastly outperforming Alpaca. As of May 2023, Vicuna seems to be the heir apparent of the instruct-finetuned LLaMA model family, though it is also restricted from commercial use.
Initial release: 2023-03-30
|Products & Features|
|Model Sizes||7B, 30B||13B|