A TrueFoundry Service represents a continuously running application that typically provides a set of APIs for interaction. Services can be dynamically scaled based on incoming traffic or resource demands.

Services are perfect for scenarios where real-time responses are essential, such as:

  • Hosting Real-time Model Inference (e.g., Flask, FastAPI)
  • Fueling Dynamic Website Backends
  • Creating Model Demos (e.g., Streamlit, Gradio)

TrueFoundry makes it easy to configure various aspects of your service deployment.