Back to Supermemory

Personal AI Assistant

apps/docs/cookbook/personal-assistant.mdx

latest25.1 KB
Original Source

Build a personal AI assistant that learns and remembers everything about the user - their preferences, habits, work context, and conversation history.

What You'll Build

A personal AI assistant that:

  • Remembers user preferences (dietary restrictions, work schedule, communication style)
  • Maintains context across multiple chat sessions
  • Provides personalized recommendations based on user history
  • Handles multiple conversation topics while maintaining context

Choose Your Implementation

<CardGroup cols={2}> <Card title="Python + FastAPI" icon="python" href="#python-implementation"> Thoroughly tested, production-ready. Uses FastAPI + Streamlit + OpenAI. </Card> <Card title="TypeScript + AI SDK" icon="triangle" href="#typescript-implementation"> Modern React approach. Uses Next.js + Vercel AI SDK + Supermemory tools. </Card> </CardGroup>

Prerequisites

<Warning> Never hardcode API keys in your code. Use environment variables. </Warning>

Python Implementation

Step 1: Project Setup

bash
mkdir personal-ai && cd personal-ai
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install supermemory openai fastapi uvicorn python-dotenv streamlit requests

Create a .env file:

bash
SUPERMEMORY_API_KEY=your_supermemory_key_here
OPENAI_API_KEY=your_openai_key_here

Step 2: Backend (FastAPI)

Create main.py. Let's build it step by step:

Import Dependencies

python
from fastapi import FastAPI, HTTPException
from fastapi.responses import StreamingResponse
from openai import AsyncOpenAI
from supermemory import Supermemory
import json
import os
import uuid
from dotenv import load_dotenv
  • FastAPI: Web framework for building the API endpoint
  • StreamingResponse: Enables real-time response streaming (words appear as they're generated)
  • AsyncOpenAI: OpenAI client that supports async/await for non-blocking operations
  • Supermemory: Client for storing and retrieving long-term memories
  • uuid: Creates stable, deterministic user IDs from emails

Initialize Application and Clients

python
load_dotenv()
app = FastAPI()

openai_client = AsyncOpenAI(api_key=os.getenv("OPENAI_API_KEY"))
supermemory_client = Supermemory(api_key=os.getenv("SUPERMEMORY_API_KEY"))

load_dotenv() loads API keys from your .env file into environment variables. We create two clients:

  • OpenAI client: Handles conversations and generates responses
  • Supermemory client: Stores and retrieves user-specific memories

These are separate because you can swap providers independently (e.g., switch from OpenAI to Anthropic without changing memory logic).

Define System Prompt

python
SYSTEM_PROMPT = """You are a highly personalized AI assistant.

MEMORY MANAGEMENT:
1. When users share personal information, store it immediately
2. Search for relevant context before responding
3. Use past conversations to inform current responses

Always be helpful while respecting privacy."""

This prompt guides the assistant's behavior. It tells the AI to:

  • Be proactive about learning user preferences
  • Always search memory before responding
  • Respect privacy boundaries

The system prompt is injected at the start of every conversation, so the AI consistently follows these rules.

Create Identity Helpers

python
def normalize_email(email: str) -> str:
    return (email or "").strip().lower()

def stable_user_id_from_email(email: str) -> str:
    norm = normalize_email(email)
    if not norm:
        raise ValueError("Email is required")
    return uuid.uuid5(uuid.NAMESPACE_DNS, norm).hex

Why normalize? "[email protected]" and " [email protected] " should map to the same person. We trim whitespace and lowercase to ensure consistency.

Why UUIDv5? It's deterministic—same email always produces the same ID. This means:

  • User memories persist across sessions
  • No raw emails in logs or database tags
  • Privacy-preserving yet stable identity

We use uuid.NAMESPACE_DNS as the namespace to ensure uniqueness.

Memory Search Function

python
async def search_user_memories(query: str, container_tag: str) -> str:
    try:
        results = supermemory_client.search.memories(
            q=query,
            container_tag=container_tag,
            limit=5
        )
        if results.results:
            context = "\n".join([r.memory for r in results.results])
            return f"Relevant memories:\n{context}"
        return "No relevant memories found."
    except Exception as e:
        return f"Error searching memories: {e}"

This searches the user's memory store for context relevant to their current message.

Parameters:

  • q: The search query (usually the user's latest message)
  • container_tag: Isolates memories per user (e.g., user_abc123)
  • limit=5: Returns top 5 most relevant memories

Why search before responding? The AI can provide personalized answers based on what it knows about the user (e.g., dietary preferences, work context, communication style).

Error handling: If memory search fails, we return a fallback message instead of crashing. The conversation continues even if memory has a hiccup.

Memory Storage Function

python
async def add_user_memory(content: str, container_tag: str, email: str = None):
    try:
        supermemory_client.add(
            content=content,
            container_tag=container_tag,
            metadata={"type": "personal_info", "email": normalize_email(email) if email else None}
        )
    except Exception as e:
        print(f"Error adding memory: {e}")

Stores new information about the user.

Parameters:

  • content: The text to remember
  • container_tag: User isolation tag
  • metadata: Additional context (type of info, associated email)

Why metadata? Makes it easier to filter and organize memories later (e.g., "show me all personal_info memories").

Error handling: We log errors but don't crash. Failing to save one memory shouldn't break the entire conversation.

Main Chat Endpoint

python
@app.post("/chat")
async def chat_endpoint(data: dict):
    messages = data.get("messages", [])
    email = data.get("email")

    if not messages:
        raise HTTPException(status_code=400, detail="No messages provided")
    if not email:
        raise HTTPException(status_code=400, detail="Email required")

This endpoint receives the chat request. It expects:

  • messages: Full conversation history [{role: "user", content: "..."}]
  • email: User's email for identity

Why require email? Without it, we can't create a stable user ID, meaning no persistent personalization.

Derive User Identity

python
    try:
        user_id = stable_user_id_from_email(email)
    except ValueError as e:
        raise HTTPException(status_code=400, detail=str(e))

    container_tag = f"user_{user_id}"

Convert email → stable user ID → container tag.

The container tag (user_abc123) isolates this user's memories from everyone else's. Each user has their own "memory box."

Search and Inject Memories

python
    user_message = messages[-1]["content"]
    memory_context = await search_user_memories(user_message, container_tag)

    enhanced_messages = [
        {"role": "system", "content": f"{SYSTEM_PROMPT}\n\n{memory_context}"}
    ] + messages

We take the user's latest message, search for relevant memories, then inject them into the system prompt.

Example:

Original: "What should I eat for breakfast?"

Enhanced system message:
"You are a helpful assistant... [system prompt]

Relevant memories:
- User is vegetarian
- User works out at 6 AM
- User prefers quick meals"

Now the AI can answer: "Try overnight oats with plant-based protein—perfect for post-workout!"

Stream OpenAI Response

python
    try:
        response = await openai_client.chat.completions.create(
            model="gpt-5",
            messages=enhanced_messages,
            temperature=0.7,
            stream=True
        )

Key parameters:

  • model="gpt-5": Fast, capable model
  • messages: Full conversation + memory context
  • temperature=0.7: Balanced creativity (0=deterministic, 1=creative)
  • stream=True: Enables word-by-word streaming

Why stream? Users see responses appear in real-time instead of waiting for the complete answer. Much better UX.

Handle Streaming

python
        async def generate():
            try:
                async for chunk in response:
                    if chunk.choices[0].delta.content:
                        content = chunk.choices[0].delta.content
                        yield f"data: {json.dumps({'content': content})}\n\n"
            except Exception as e:
                yield f"data: {json.dumps({'error': str(e)})}\n\n"

This async generator:

  1. Receives chunks from OpenAI as they're generated
  2. Extracts the text content from each chunk
  3. Formats it as Server-Sent Events (SSE): data: {...}\n\n
  4. Yields it to the client

SSE format is a web standard for server→client streaming. The frontend can process each chunk as it arrives.

Optional Memory Storage

python
            if "remember this" in user_message.lower():
                await add_user_memory(user_message, container_tag, email=email)

After streaming completes, check if the user explicitly asked to remember something. If yes, store it.

Why opt-in? Gives users control over what gets remembered. You could also make this automatic based on content analysis.

Return Streaming Response

python
        return StreamingResponse(generate(), media_type="text/plain")

    except Exception as e:
        raise HTTPException(status_code=500, detail=str(e))

StreamingResponse keeps the HTTP connection open and sends chunks as they're generated. The frontend receives them in real-time.

Local Development Server

python
if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="0.0.0.0", port=8000)

Run with python main.py and the server starts on port 8000. 0.0.0.0 means it accepts connections from any IP (useful for testing from other devices).

Step 3: Frontend (Streamlit)

Create streamlit_app.py:

<Accordion title="Complete Frontend Code" defaultOpen>
python
import streamlit as st
import requests
import json
import uuid

st.set_page_config(page_title="Personal AI Assistant", page_icon="🤖", layout="wide")

def normalize_email(email: str) -> str:
    return (email or "").strip().lower()

def stable_user_id_from_email(email: str) -> str:
    return uuid.uuid5(uuid.NAMESPACE_DNS, normalize_email(email)).hex

# Session state
if 'messages' not in st.session_state:
    st.session_state.messages = []
if 'user_name' not in st.session_state:
    st.session_state.user_name = None
if 'email' not in st.session_state:
    st.session_state.email = None
if 'user_id' not in st.session_state:
    st.session_state.user_id = None

st.title("🤖 Personal AI Assistant")
st.markdown("*Your AI that learns and remembers*")

with st.sidebar:
    st.header("👤 User Profile")

    if not st.session_state.user_name or not st.session_state.email:
        name = st.text_input("What should I call you?")
        email = st.text_input("Email", placeholder="[email protected]")

        if st.button("Get Started"):
            if name and email:
                st.session_state.user_name = name
                st.session_state.email = normalize_email(email)
                st.session_state.user_id = stable_user_id_from_email(st.session_state.email)
                st.session_state.messages.append({
                    "role": "user",
                    "content": f"Hi! My name is {name}."
                })
                st.rerun()
            else:
                st.warning("Please enter both fields.")
    else:
        st.write(f"**Name:** {st.session_state.user_name}")
        st.write(f"**Email:** {st.session_state.email}")
        if st.button("Reset Conversation"):
            st.session_state.messages = []
            st.rerun()

if st.session_state.user_name and st.session_state.email:
    for message in st.session_state.messages:
        with st.chat_message(message["role"]):
            st.markdown(message["content"])

    if prompt := st.chat_input("Message..."):
        st.session_state.messages.append({"role": "user", "content": prompt})
        with st.chat_message("user"):
            st.markdown(prompt)

        with st.chat_message("assistant"):
            try:
                response = requests.post(
                    "http://localhost:8000/chat",
                    json={
                        "messages": st.session_state.messages,
                        "email": st.session_state.email
                    },
                    stream=True,
                    timeout=30
                )

                if response.status_code == 200:
                    full_response = ""
                    for line in response.iter_lines():
                        if line:
                            try:
                                data = json.loads(line.decode('utf-8').replace('data: ', ''))
                                if 'content' in data:
                                    full_response += data['content']
                            except:
                                continue

                    st.markdown(full_response)
                    st.session_state.messages.append({"role": "assistant", "content": full_response})
                else:
                    st.error(f"Error: {response.status_code}")
            except Exception as e:
                st.error(f"Error: {e}")
else:
    st.info("Please enter your profile in the sidebar")
</Accordion>

Step 4: Run It

Terminal 1 - Start backend:

bash
python main.py

Terminal 2 - Start frontend:

bash
streamlit run streamlit_app.py

Open http://localhost:8501 in your browser.


TypeScript Implementation

Step 1: Project Setup

bash
npx create-next-app@latest personal-ai --typescript --tailwind --app
cd personal-ai
npm install @supermemory/tools ai @ai-sdk/openai

Create .env.local:

bash
SUPERMEMORY_API_KEY=your_supermemory_key_here
OPENAI_API_KEY=your_openai_key_here

Step 2: API Route

Create app/api/chat/route.ts. Let's break it down:

Import Dependencies

typescript
import { streamText } from 'ai'
import { createOpenAI } from '@ai-sdk/openai'
import { supermemoryTools } from '@supermemory/tools/ai-sdk'
  • streamText: Vercel AI SDK function that handles streaming responses and tool calling
  • createOpenAI: Factory function to create an OpenAI provider
  • supermemoryTools: Pre-built tools for memory search and storage

Initialize OpenAI Provider

typescript
const openai = createOpenAI({
  apiKey: process.env.OPENAI_API_KEY!
})

Creates an OpenAI provider configured with your API key. The ! tells TypeScript "this definitely exists" (because we set it in .env.local).

This provider object will be passed to streamText to specify which AI model to use.

Define System Prompt

typescript
const SYSTEM_PROMPT = `You are a highly personalized AI assistant.

When users share personal information, remember it using the addMemory tool.
Before responding, search your memories using searchMemories to provide personalized help.
Always be helpful while respecting privacy.`

This guides the AI's behavior and tells it:

  • When to use tools: Search memories before responding, add memories when users share info
  • Personality: Be helpful and personalized
  • Boundaries: Respect privacy

The AI SDK uses this to decide when to call searchMemories and addMemory tools automatically.

Create POST Handler

typescript
export async function POST(req: Request) {
  try {
    const { messages, email } = await req.json()

Next.js App Router convention: export an async function named after the HTTP method. This handles POST requests to /api/chat.

We extract:

  • messages: Chat history array [{role, content}]
  • email: User identifier

Validate Input

typescript
    if (!messages?.length) {
      return new Response('No messages provided', { status: 400 })
    }
    if (!email) {
      return new Response('Email required', { status: 400 })
    }

Why validate? Prevents crashes from malformed requests. We need:

  • At least one message to respond to
  • An email to isolate user memories

Without email, we can't maintain personalization across sessions.

Create Container Tag

typescript
    const containerTag = `user_${email.toLowerCase().trim()}`

Convert email to a container tag for memory isolation.

Simpler than Python: We skip UUID generation here for simplicity. In production, you might want to hash the email for privacy:

typescript
// Optional: More privacy-preserving approach
import crypto from 'crypto'
const containerTag = `user_${crypto.createHash('sha256').update(email).digest('hex').slice(0, 16)}`

Call streamText with Tools

typescript
    const result = streamText({
      model: openai('gpt-5'),
      messages,
      tools: supermemoryTools(process.env.SUPERMEMORY_API_KEY!, {
        containerTags: [containerTag]
      }),
      system: SYSTEM_PROMPT
    })

This is where the magic happens! Let's break down each parameter:

model: openai('gpt-5')

  • Specifies which AI model to use
  • The AI SDK handles the API calls

messages

  • Full conversation history
  • Format: [{role: "user"|"assistant", content: "..."}]

tools: supermemoryTools(...)

  • Gives the AI access to memory operations
  • The AI SDK automatically:
    • Decides when to call tools based on the conversation
    • Calls searchMemories when it needs context
    • Calls addMemory when users share information
    • Handles tool execution and error handling

containerTags: [containerTag]

  • Scopes all memory operations to this specific user
  • Ensures User A can't access User B's memories

system: SYSTEM_PROMPT

  • Guides the AI's behavior and tool usage

How tools work:

  1. User: "Remember that I'm vegetarian"
  2. AI SDK detects this is memory-worthy
  3. Automatically calls addMemory("User is vegetarian")
  4. Stores in Supermemory with the user's container tag
  5. Responds: "Got it, I'll remember that!"

Later:

  1. User: "What should I eat?"
  2. AI SDK calls searchMemories("food preferences")
  3. Retrieves: "User is vegetarian"
  4. Responds: "How about a delicious veggie stir-fry?"

No manual tool handling needed! The AI SDK manages the entire flow.

Return Streaming Response

typescript
    return result.toAIStreamResponse()

toAIStreamResponse() converts the streaming result into a format the frontend can consume. It:

  • Sets appropriate headers for streaming
  • Formats data for the useChat hook
  • Handles errors gracefully

This returns immediately (doesn't wait for completion), and chunks stream to the client as they're generated.

Error Handling

typescript
  } catch (error: any) {
    console.error('Chat error:', error)
    return new Response(error.message, { status: 500 })
  }
}

Catches any errors (API failures, tool errors, etc.) and returns a clean error response.

Why log to console? In production, you'd send this to a monitoring service (Sentry, DataDog, etc.) to track issues.


Key Differences from Python:

AspectPythonTypeScript
Memory SearchManual search_user_memories() callAI SDK calls searchMemories tool automatically
Memory AddManual add_user_memory() callAI SDK calls addMemory tool automatically
Tool DecisionYou decide when to search/addAI decides based on conversation context
StreamingManual SSE formattingtoAIStreamResponse() handles it
Error HandlingTry/catch in each functionAI SDK handles tool errors

Python = Manual Control You explicitly search and add memories. More control, more code.

TypeScript = AI-Driven The AI decides when to use tools. Less code, more "magic."

Step 3: Chat UI

Replace app/page.tsx:

<Accordion title="Complete Frontend Code" defaultOpen>
typescript
'use client'
import { useChat } from 'ai/react'
import { useState } from 'react'

export default function ChatPage() {
  const [email, setEmail] = useState('')
  const [userName, setUserName] = useState('')
  const [tempEmail, setTempEmail] = useState('')
  const [tempName, setTempName] = useState('')

  const { messages, input, handleInputChange, handleSubmit } = useChat({
    api: '/api/chat',
    body: { email }
  })

  if (!email) {
    return (
      <div className="flex items-center justify-center min-h-screen p-4">
        <div className="w-full max-w-md space-y-4 p-6 bg-white rounded-lg shadow-lg">
          <h1 className="text-2xl font-bold text-center">🤖 Personal AI Assistant</h1>
          <input
            type="text"
            placeholder="Your name"
            value={tempName}
            onChange={(e) => setTempName(e.target.value)}
            className="w-full px-4 py-2 border rounded-lg"
          />
          <input
            type="email"
            placeholder="[email protected]"
            value={tempEmail}
            onChange={(e) => setTempEmail(e.target.value)}
            className="w-full px-4 py-2 border rounded-lg"
          />
          <button
            onClick={() => {
              if (tempName && tempEmail) {
                setUserName(tempName)
                setEmail(tempEmail.toLowerCase().trim())
              }
            }}
            className="w-full px-4 py-2 bg-blue-600 text-white rounded-lg hover:bg-blue-700"
          >
            Get Started
          </button>
        </div>
      </div>
    )
  }

  return (
    <div className="flex flex-col h-screen max-w-4xl mx-auto p-4">
      <div className="flex-1 overflow-y-auto space-y-4 mb-4">
        {messages.map((message) => (
          <div
            key={message.id}
            className={`p-4 rounded-lg ${
              message.role === 'user'
                ? 'bg-blue-100 ml-auto max-w-[80%]'
                : 'bg-gray-100 mr-auto max-w-[80%]'
            }`}
          >
            <p className="whitespace-pre-wrap">{message.content}</p>
          </div>
        ))}
      </div>

      <form onSubmit={handleSubmit} className="flex gap-2">
        <input
          value={input}
          onChange={handleInputChange}
          placeholder="Tell me about yourself..."
          className="flex-1 p-3 border rounded-lg"
        />
        <button
          type="submit"
          className="px-6 py-3 bg-blue-600 text-white rounded-lg hover:bg-blue-700"
        >
          Send
        </button>
      </form>
    </div>
  )
}
</Accordion>

Step 4: Run It

bash
npm run dev

Open http://localhost:3000


Testing Your Assistant

Try these conversations to test memory:

Personal Preferences:

User: "I'm Sarah, a product manager. I prefer brief responses."
[Later]
User: "What's a good way to prioritize features?"
Assistant: [Should reference PM role and brevity preference]

Dietary & Lifestyle:

User: "Remember I'm vegan and work out at 6 AM."
[Later]
User: "Suggest a quick breakfast."
Assistant: [Should suggest vegan options for pre/post workout]

Work Context:

User: "I'm working on a React project with TypeScript."
[Later]
User: "Help me with state management."
Assistant: [Should suggest TypeScript-specific solutions]

Verify Memory Storage

Python

Create check_memories.py:

python
from supermemory import Supermemory
import os
from dotenv import load_dotenv

load_dotenv()
client = Supermemory(api_key=os.getenv("SUPERMEMORY_API_KEY"))

# Replace with your user_id from console logs
user_id = "your_user_id_here"
container_tag = f"user_{user_id}"

memories = client.documents.list(
    container_tags=[container_tag],
    limit=20,
    sort="updatedAt",
    order="desc"
)

print(f"Found {len(memories.memories)} memories:")
for i, memory in enumerate(memories.memories):
    full = client.documents.get(id=memory.id)
    print(f"\n{i + 1}. {full.content}")

TypeScript

Create scripts/check-memories.ts:

typescript
const userId = "your_user_id_here"
const containerTag = `user_${userId}`

const response = await fetch('https://api.supermemory.ai/v3/memories', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${process.env.SUPERMEMORY_API_KEY}`,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    containerTags: [containerTag],
    limit: 20,
    sort: 'updatedAt',
    order: 'desc'
  })
})

const data = await response.json()
console.log(`Found ${data.memories?.length || 0} memories`)

Troubleshooting

Memory not persisting?

  • Verify container tags are consistent
  • Check API key has write permissions
  • Ensure email is properly normalized

Responses not personalized?

  • Increase search limit to find more memories
  • Check that memories are being added
  • Verify system prompt guides tool usage

Performance issues?

  • Reduce search limits
  • Implement caching for frequent queries
  • Use appropriate thresholds

Built with Supermemory. Customize based on your needs.