llama-index-integrations/llms/llama-index-llms-konko/README.md
Install the required Python packages:
%pip install llama-index-llms-konko
!pip install llama-index
Set the API keys as environment variables:
export KONKO_API_KEY=<your-api-key>
export OPENAI_API_KEY=<your-api-key>
import os
from llama_index.llms.konko import Konko
from llama_index.core.llms import ChatMessage
To chat with a Konko model:
os.environ["KONKO_API_KEY"] = "<your-api-key>"
llm = Konko(model="meta-llama/llama-2-13b-chat")
messages = ChatMessage(role="user", content="Explain Big Bang Theory briefly")
resp = llm.chat([messages])
print(resp)
To chat with an OpenAI model:
os.environ["OPENAI_API_KEY"] = "<your-api-key>"
llm = Konko(model="gpt-3.5-turbo")
message = ChatMessage(role="user", content="Explain Big Bang Theory briefly")
resp = llm.chat([message])
print(resp)
To stream a response for longer messages:
message = ChatMessage(role="user", content="Tell me a story in 250 words")
resp = llm.stream_chat([message], max_tokens=1000)
for r in resp:
print(r.delta, end="")
To generate a completion based on a system prompt:
llm = Konko(model="phind/phind-codellama-34b-v2", max_tokens=100)
text = """### System Prompt
You are an intelligent programming assistant.
### User Message
Implement a linked list in C++
### Assistant
..."""
resp = llm.stream_complete(text, max_tokens=1000)
for r in resp:
print(r.delta, end="")