Create a clear AI/LLM system design PDF for a production-ready chat app with RAG, guardrails, and monitoring. This is documentation-only and must be practical enough for engineers to implement.
Requirements to cover in the PDF:
High-level architecture + data flows (ingestion → indexing → chat)
Components (API, orchestration, vector DB, storage, cache, auth/RBAC)
Cost controls (caching, model routing, token limits)
Security/guardrails (prompt injection + PII handling)
Deployment outline (AWS preferred) + key KPIs/monitoring
Failure modes + mitigations
To apply (answer in proposal):
5 key components you’d include
3 top security risks in LLM apps
Confirm you can deliver a PDF within 48 hours