Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
LLaMA was previously Meta AI's most performant LLM available for researchers and noncommercial use cases. It has since been succeeded by Llama 2.
The model that launched a frenzy in open-source instruct-finetuned models, LLaMA is Meta AI's more parameter-efficient, open alternative to large commercial LLMs. Despite being smaller than many commercial models, LLaMA outperformed the gold standard GPT-3 on many benchmarks, with the primary drawback being that its access remains gated to researchers with restrictions on commercial use.
Initial release: 2023-02-24
|Products & Features|
|Model Sizes||20B||7B, 13B, 33B, 65B|