Serverless RAG Architecture & Vector Search
Designed a production RAG architecture using OpenSearch and Pinecone to index enterprise knowledge. Containerized services with Docker for consistent deployment, provisioned infrastructure via Terraform, and optimized vector queries to hit sub-second latency for semantic retrieval.
- →Sub-second vector search latency in production
- →IaC-provisioned via Terraform across environments
- →Containerized with Docker for portable deployment