FLAN-T5 is a finetuned version of Google's popular T5 model with instruct-finetuning. As stated in the model repository's introduction, compared to T5, FLAN-T5 is "just better at everything." With its permissive license, FLAN-T5 has become a popular option for a starting instruct model.
Initial release: 2022-12-06
GPTNeo is a model released by EleutherAI to try and provide an open source model with capabilities similar to OpenAI's GPT-3 model. One of the earliest such models, GPTNeo was trained on The Pile, Eleuther's corpus of web text.
Initial release: 2021-03-21
Looking for an LLM API/SDK that works out of the box?Sapling API
|Products & Features|
|License||Apache 2.0||Apache 2.0|
|Model Sizes||3B, 11B||1.5B|