Replicate - Vicuna 13B

Setup

If you're opening this Notebook on colab, you will probably need to install LlamaIndex 🦙.

python

%pip install llama-index-llms-replicate

python

!pip install llama-index

Make sure you have the REPLICATE_API_TOKEN environment variable set.
If you don't have one yet, go to https://replicate.com/ to obtain one.

python

import os

python

os.environ["REPLICATE_API_TOKEN"] = "<your API key>"

Basic Usage

We showcase the "vicuna-13b" model, which you can play with directly at: https://replicate.com/replicate/vicuna-13b

python

from llama_index.llms.replicate import Replicate

llm = Replicate(
    model="replicate/vicuna-13b:6282abe6a492de4145d7bb601023762212f9ddbbe78278bd6771c8b3b2f2a13b"
)

Call `complete` with a prompt

python

resp = llm.complete("Who is Paul Graham?")

python

print(resp)

Call `chat` with a list of messages

python

from llama_index.core.llms import ChatMessage

messages = [
    ChatMessage(
        role="system", content="You are a pirate with a colorful personality"
    ),
    ChatMessage(role="user", content="What is your name"),
]
resp = llm.chat(messages)

python

print(resp)

Streaming

Using stream_complete endpoint

python

response = llm.stream_complete("Who is Paul Graham?")

python

for r in response:
    print(r.delta, end="")

Using stream_chat endpoint

python

from llama_index.core.llms import ChatMessage

messages = [
    ChatMessage(
        role="system", content="You are a pirate with a colorful personality"
    ),
    ChatMessage(role="user", content="What is your name"),
]
resp = llm.stream_chat(messages)

python

for r in resp:
    print(r.delta, end="")

Configure Model

python

from llama_index.llms.replicate import Replicate

llm = Replicate(
    model="replicate/vicuna-13b:6282abe6a492de4145d7bb601023762212f9ddbbe78278bd6771c8b3b2f2a13b",
    temperature=0.9,
    max_tokens=32,
)

python

resp = llm.complete("Who is Paul Graham?")

python

print(resp)

Replicate - Vicuna 13B

Replicate - Vicuna 13B

Setup

Basic Usage

Call complete with a prompt

Call chat with a list of messages

Streaming

Configure Model

Call `complete` with a prompt

Call `chat` with a list of messages