A privacy-first RAG assistant that grounds Bedrock models (Claude/Titan) in your own documents with per-user isolation.
RAG Vault is a secure, full-stack Retrieval-Augmented Generation application. Users authenticate, upload documents (PDF, DOCX, TXT), and run multi-turn chats grounded in their private knowledge. Retrieval can be toggled on/off per question. The system emphasizes privacy, scalability, and a clean UX.
Traditional e-commerce platforms struggle with:
JWT login/signup with access & refresh tokens, bcrypt password hashing, refresh revocation, per-user chat sessions.
Uploads to S3 with user-level isolation. Extract text from PDF/DOCX/TXT; index metadata in PostgreSQL.
LangChain RecursiveCharacterTextSplitter; Titan Embeddings v2; vectors stored in Pinecone per user namespace.
Toggleable RAG: top-k chunks retrieved and injected into prompts; prompts formatted for Claude/Titan via Bedrock.
Session-scoped history persisted in PostgreSQL; load/resume any session.
Frontend: Vite + React + Tailwind · Backend: FastAPI + PostgreSQL · Infra: S3, Pinecone, Amazon Bedrock.
pytest suite (>80% coverage) covering auth flows, uploads, embedding/indexing, Bedrock prompts, and retrieval. Includes edge cases for token refresh expiry and upload errors.