Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
The most recent (as of May 2023) effort from EleutherAI, Pythia is a set of LLMs trained on The Pile. While it appears to outperform OPT and GPTNeo, its performance against GPT-J is unclear. Versions of Pythia have also been instruct-tuned by the team at Together.
Initial release: 2023-02-13
Looking for an LLM API/SDK that works out of the box?Sapling API
|Products & Features|
|License||Apache 2.0||Apache 2.0|
|Model Sizes||20B||1B, 1.4B, 2.8B, 6.9B, 12B|