Dolly v2 (3B)

0.000105

Chat

Dolly v2 (3B)

Dolly v2 (3B) API by Databricks is a fine-tuned instruction-following language model designed for various NLP tasks.

Try it now

‍

AI Playground

Test all API models in the sandbox environment before you integrate. We provide more than 200 models to integrate into your app.

Testimonials

Our Clients' Voices

Sheldon Lewis

Chief Compliance Officer (CCO)

I love the seamless integration with OpenAI. Transitioning my projects was smooth and hassle-free. Plus, the cost savings are incredible!

Will jack ds

IT Systems Manager

This groundbreaking API empowers developers with access to over 100 AI models through a single interface, fostering continuous innovation around the clock. It boasts GPT-4 level performance at a fraction of the cost, making advanced AI capabilities more accessible than ever. Seamless compatibility with OpenAI ensures smooth transitions and integration, setting a new standard for efficiency and scalability in AI development.

Oksana Kirilenko

Senior Software Engineer

AI/ML API is a promising solution for developers seeking a cost-effective and user-friendly way to integrate advanced AI features. Its extensive model library, affordability, and ease of use make it a compelling option. However, for projects requiring a wider range of user reviews or a more established platform, further research into competitors might be beneficial

Dolly v2 (3B)

Instruction-following model by Databricks, fine-tuned for diverse language tasks.

Model Overview Card for Dolly v2 (3B)

Basic Information

Model Name: Dolly v2 (3B)
Developer/Creator: Databricks, Inc.
Release Date: April 12, 2023
Version: Dolly-v2-3b
Model Type: Instruction-following Large Language Model

Description

Overview:
Dolly v2 (3B) is an instruction-following large language model created by Databricks, designed to follow instructions and perform various language tasks. Based on the Pythia-2.8b model, Dolly v2 (3B) has been fine-tuned on a dataset of approximately 15k instruction/response pairs to enhance its ability to generate high-quality responses to prompts.

Key Features:

Fine-tuned on ~15k instruction/response pairs
Capable of performing tasks such as brainstorming, classification, closed QA, generation, information extraction, open QA, and summarization
Licensed for commercial use
Available in larger sizes (dolly-v2-7b and dolly-v2-12b)

Intended Use:

Dolly v2 (3B) is designed for various natural language processing tasks including brainstorming, classification, closed and open question answering, generation, information extraction, and summarization. It is suitable for applications requiring high-quality instruction following, though it is not state-of-the-art.

Language Support:

Supports English. Other languages might be supported but with potentially less accuracy due to the training data being primarily in English.

Technical Details

Architecture:

Dolly v2 (3B) is based on the Pythia-2.8b model, a Transformer-based architecture.

Training Data:

The model was trained on a dataset of approximately 15,000 instruction/response pairs generated by Databricks employees. This dataset, named databricks-dolly-15k, covers various domains mentioned in the InstructGPT paper, including brainstorming, classification, QA, and summarization.

Data Source and Size:

Source: Public internet, including Wikipedia.
Size: Approximately 15,000 instruction/response pairs.
Knowledge Cutoff: The model's knowledge is up-to-date until April 2023.
Diversity and Bias: The dataset includes data that reflects the interests and biases of Databricks employees, potentially limiting diversity. It is also subject to the biases present in the public internet data from which it was derived.

Performance Metrics:

Comparison to Other Models:
Dolly v2 (3B) outperforms its foundation model Pythia-2.8b and shows competitive performance with similar parameter models but underperforms compared to state-of-the-art models like GPT-4 and LLaMA-3.
Accuracy: Demonstrates strong instruction-following behavior, but may struggle with syntactically complex prompts, programming problems, mathematical operations, factual accuracy, and handling dates and times.
Speed: Optimized for inference on GPUs; performance varies based on hardware.
Robustness: Handles a wide range of instructions but may produce errors in specific complex or ambiguous tasks.

Usage

Code Samples/SDK

Ethical Considerations

Databricks is committed to developing AI technologies that are helpful, honest, and harmless. The model has limitations and may produce biased or harmful outputs, reflecting the biases present in the training data.