Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
GPTNeo is a model released by EleutherAI to try and provide an open source model with capabilities similar to OpenAI's GPT-3 model. One of the earliest such models, GPTNeo was trained on The Pile, Eleuther's corpus of web text.
Initial release: 2021-03-21
|Products & Features|
|License||Apache 2.0||Apache 2.0|
|Model Sizes||20B||1.3B, 2.7B|