Open-source 33B parameter chatbot, finetuned LLaMA using 4-bit QLoRA.
The Guanaco-33B is an open-source, high-quality chatbot model developed by finetuning the 33B parameter LLaMA mode using 4-bit QLoRA. It is competitive with commercial chatbots like ChatGPT on benchmarks.
The Guanaco-33B model is intended for research purposes and may produce problematic outputs. It is available under the Apache 2 license, but requires access to the LLaMA model weights which have additional licensing requirements.
The Guanaco-33B model supports multiple languages, with best performance in high-resource languages due to the composition of the OASST1 dataset used for finetuning.
The Guanaco-33B model is based on the LLaMA architecture, a Transformer-based language model. LoRA adapters with $r=64$ are added to all layers of the base LLaMA model.
The model is finetuned on the OASST1 dataset, a multilingual dataset of open-source assistant conversations. The size and diversity of the dataset allow the model to engage in open-ended conversations on a wide range of topics.
The OASST1 dataset used for finetuning contains over 100,000 conversations in multiple languages. The exact size and composition of the dataset are not publicly disclosed.
The knowledge cutoff date for the Guanaco-33B model is not publicly available. As an open-source model, it may be updated and improved over time.
The OASST1 dataset used for finetuning is multilingual, which helps to reduce bias and improve the model's ability to handle diverse inputs. However, the dataset composition and potential biases are not fully disclosed.
The Guanaco-33B model has been evaluated on several benchmarks, including the Anthropic Chatbot Leaderboard, where it performs competitively with commercial chatbots like ChatGPT and BARD. However, its performance may vary across different languages and tasks not covered by the benchmarks used in its evaluation.
Anthropic has published ethical guidelines for the development and use of the Guanaco-33B model. These guidelines include considerations around transparency, accountability, and the potential for misuse.
The Guanaco-33B model is available under the Apache 2 license, which allows for commercial and non-commercial use.