Llama 2 is Meta AI's open source LLM available both research and commercial use case.
The successor to LLaMA (henceforce "Llama 1"), Llama 2 was trained on 40% more data, has double the context length, and was tuned on a large dataset of human preferences (over 1 million such annotations) to ensure helpfulness and safety. It outperforms other open source models on both natural language understanding datasets as well as in head-to-head face-offs.
Initial release: 2023-07-18
MPT-7B and MPT-30B are a set of models that are part of MosaicML's Foundation Series. Trained on 1T tokens, the developers state that MPT-7B matches the performance of LLaMA while also being open source, while MPT-30B outperforms the original GPT-3. In addition to the base model, the developers also offer MPT-Instruct, MPT-Chat, and MPT-7B-StoryWriter-65k+, the last of which is trained on a context length of 65K tokens.
Initial release: 2023-05-05
|Products & Features|
|License||Custom (Commercial OK)||Apache 2.0|
|Model Sizes||7B, 13B, 70B||7B, 30B|