A Service represents a persistent and continuously running applications that typically provides a set of APIs for interaction. As Services run continuously, costs are incurred based on resource utilization. Services can be dynamically scaled based on incoming traffic or resource demands.
Services are ideal when requests can arrive at any time and prompt responses are crucial. Some typical use cases for services include:
- Hosting Realtime Model Inference (e.g., Flask, FastAPI)
- Performing Model Inference on Incoming Data Streams (e.g., Kafka messages)
- Powering Dynamic Backend for Websites
- Demonstrating Models (e.g., Streamlit, Gradio)
A Service typically exposes APIs on designated ports. These ports can be mapped to domains, enabling external calls to the Service. Multiple ports can be exposed by a Service, each possibly mapped to a different URL.
Updated 20 days ago