Sapling Logo

Guanaco vs. Qwen

LLM Comparison


Guanaco

Guanaco

Overview

Guanaco is an LLM based off the QLoRA 4-bit finetuning method developed by Tim Dettmers et. al. in the UW NLP group. Guanaco achieves 99% ChatGPT performance on the Vicuna benchmark.


Guanaco is an LLM that uses a finetuning method called LoRA that was developed by Tim Dettmers et. al. in the UW NLP group. With QLoRA, it becomes possible to finetune up to a 65B parameter model on a 48GB GPU without loss of performance relative to a 16-bit model. The Guanaco model family outperforms all previously released models on the Vicuna benchmark. However, given the models are based off of the LLaMA model family, commercial use is not permitted.


Initial release: 2023-05-23

Qwen

Qwen

Overview

Qwen is Alibaba Cloud's family of large language models, including chat models, code models, and multimodal variants. Qwen 3 and QwQ are the latest releases, with strong reasoning capabilities.


Qwen (formerly Tongyi Qianwen) is a comprehensive family of large language models developed by Alibaba Cloud. The series includes base models, chat models, code-specific variants (CodeQwen), and multimodal models (Qwen-VL). Qwen models excel at multilingual tasks and have become particularly popular in Asia. The latest releases include Qwen 3 and QwQ (focused on reasoning), competing effectively with Western counterparts while offering strong Chinese language support.


Initial release: 2023-09-13

Looking for an LLM API/SDK that works out of the box? No prompts or ad hoc guardrails.

Sapling API
More Comparisons

Guanaco

Qwen

Products & Features
Instruct Models
Coding Capability
Customization
Finetuning
Open Source
License Noncommercial Custom
Model Sizes 7B, 13B, 33B, 65B 0.5B, 1.5B, 3B, 7B, 14B, 32B, 72B