CLOUD RUN
The serving layer. Used to deploy scalable, serverless, and secure RAG API endpoints.
Cloud Run provides serverless, container-based deployment that scales from zero to millions of requests, ideal for cost-efficient, production-grade RAG serving.