CLOUD RUN

The serving layer. Used to deploy scalable, serverless, and secure RAG API endpoints.

Cloud Run provides serverless, container-based deployment that scales from zero to millions of requests, ideal for cost-efficient, production-grade RAG serving.