Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
RedPajama-INCITE is the first family of models trained on the RedPajama base dataset. The goal of the RedPajama-INCITE models is to replicate the LLaMA recipe but make the model fully open source under the Apache license. As of the initial release, the 3B parameter model is best-in-class, with the 7B parameter model in progress. Update as of June 6, 2023: the 7B parameter model was made available, outperforming other models of the same size.
Initial release: 2023-05-05
|Products & Features|
|License||Apache 2.0||Apache 2.0|
|Model Sizes||20B||3B, 7B|