Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
OpenLLaMA is an effort from OpenLM Research to offer a non-gated version of LLaMa that can be used both for research and commercial applications. As of June 2023, the model is still training, with 3B, 7B, and 13B parameter models available.
Initial release: 2023-04-28
|Products & Features|
|License||Apache 2.0||Apache 2.0|
|Model Sizes||20B||3B, 7B, 13B|