FLAN-T5 vs. Vicuna: Which LLM is Better?

FLAN-T5 vs. Vicuna

LLM Comparison

FLAN-T5

Overview

FLAN-T5 is a finetuned version of Google's popular T5 model with instruct-finetuning. As stated in the model repository's introduction, compared to T5, FLAN-T5 is "just better at everything." With its permissive license, FLAN-T5 has become a popular option for a starting instruct model.

Initial release: 2022-12-06

Reference

https://huggingface.co/docs/transformers/model_doc/flan-t5

Vicuna

Overview

Released alongside Koala, Vicuna is one of many descendants of the Meta LLaMA model trained on dialogue data collected from the ShareGPT website. According to the authors, Vicuna achieves more than 90% of ChatGPT's quality in user preference tests, while vastly outperforming Alpaca. As of May 2023, Vicuna seems to be the heir apparent of the instruct-finetuned LLaMA model family, though it is also restricted from commercial use.

Initial release: 2023-03-30

Reference

https://vicuna.lmsys.org/

	FLAN-T5	Vicuna
Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License	Apache 2.0	Noncommercial
Model Sizes	3B, 11B	13B

FLAN-T5 vs. Vicuna

LLM Comparison

FLAN-T5

Overview

Reference

Further Reading

Vicuna

Overview

Reference

Further Reading

FLAN-T5

Vicuna

Other FLAN-T5 Comparisons

Other Vicuna Comparisons

Undecided?