ceg-day3-workshop-starter

v0.1.0

5 days ago

A retrieval-augmented generation (RAG) agent that can store and retrieve information from its own knowledge base.

A retrieval-augmented generation (RAG) agent that can store and retrieve information from its own knowledge base.

Tech Stack: Next.js 14, AI SDK, Groq (chat), HuggingFace (embeddings), Drizzle ORM, Postgres + pgvector

Created a new embeddings table to store text chunks and their vector representations
Step 1a - Foreign key (resourceId) linking each embedding back to its original resource
Step 1b - content column for the plain text chunk
Step 1c - embedding column as a 384-dimension vector (matching HuggingFace all-MiniLM-L6-v2 output)
Step 1d - HNSW index on the embedding column for fast cosine similarity searches

Created all the functions needed to chunk text, generate embeddings, and search for relevant content
Uses HuggingFace Inference API directly for free embeddings
Step 2a - Defined the HuggingFace embedding model (sentence-transformers/all-MiniLM-L6-v2, 384 dimensions, free)
Step 2b - hfEmbed() - helper to call HuggingFace Inference API for embeddings
Step 2c - generateChunks() - splits source material into smaller chunks by splitting on periods
Step 2d - generateEmbeddings() - generates embeddings for all chunks of a text (used when saving resources)
Step 2e - generateEmbedding() - generates a single embedding for a query string (used when searching)
Step 2f - findRelevantContent() - embeds the user's query and finds the top 4 similar chunks (similarity > 0.5)

Updated the createResource server action to also generate and store embeddings
Step 3a - Imported generateEmbeddings and the embeddings table
Step 3b - After saving a resource to the DB, chunk it, embed it, and store all embeddings

Created the /api/chat POST route that powers the AI agent
Step 4a - Uses Groq moonshotai/kimi-k2-instruct-0905 model for chat
Step 4b - stopWhen: stepCountIs(5) allows multi-step calls so the model can call tools and then summarize
Step 4c - System prompt restricts the model to only answer from its knowledge base
Step 4d - addResource tool lets the model save new information to the knowledge base
Step 4e - getInformation tool lets the model retrieve relevant content via semantic search

Built the frontend chat interface using the AI SDK's useChat hook
Step 5a - useChat hook manages chat state and sends messages to /api/chat
Step 5b - Renders message parts: text responses and tool call indicators
Step 5c - Shows tool call status (calling/called) with input details
Step 5d - Input form sends messages on submit via sendMessage

Add your keys to .env:

DATABASE_URL=your-neon-database-url
GROQ_API_KEY=your-groq-api-key
HUGGINGFACE_API_KEY=your-huggingface-api-key