Similar to FLAN-T5, FLAN-UL2 is a model based on Google's popular T5 architecture with an upgraded pre-training procedure dubbed UL2. On most NLU benchmarks, FLAN-UL2 outperforms FLAN-T5 by a significant margin.
Initial release: 2023-03-03
Released alongside Koala, Vicuna is one of many descendants of the Meta LLaMA model trained on dialogue data collected from the ShareGPT website. According to the authors, Vicuna achieves more than 90% of ChatGPT's quality in user preference tests, while vastly outperforming Alpaca. As of May 2023, Vicuna seems to be the heir apparent of the instruct-finetuned LLaMA model family, though it is also restricted from commercial use.
Initial release: 2023-03-30
Looking for an LLM API/SDK that works out of the box?Sapling API
|Products & Features|