Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
The model that launched a frenzy in open-source instruct-finetuned models, LLaMA is Meta AI's more parameter-efficient, open-source alternative to large commercial LLMs. Despite being smaller than many commercial models, LLaMA outperformed the gold standard GPT-3 on many benchmarks, with the primary drawback being that its access remains gated to researchers with restrictions on commercial use.
Initial release: 2023-02-24
Looking for an LLM API/SDK that works out of the box?Sapling API
|Products & Features|
|Model Sizes||20B||7B, 13B, 33B, 65B|