Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
OpenLLaMA is an effort from OpenLM Research to offer a non-gated version of LLaMa that can be used both for research and commercial applications. As of 5 May 2023, the model is still training.
Initial release: 2023-04-28
Looking for an LLM API/SDK that works out of the box?Sapling API
|Products & Features|
|License||Apache 2.0||Apache 2.0|