A TrueFoundry Service represents a continuously running application that typically provides a set of APIs for interaction. Services can be dynamically scaled based on incoming traffic or resource demands. Services are perfect for scenarios where real-time responses are essential, such as:
  • Hosting Real-time Model Inference (e.g., Flask, FastAPI)
  • Fueling Dynamic Website Backends
  • Creating Model Demos (e.g., Streamlit, Gradio)
TrueFoundry makes it easy to configure various aspects of your service deployment.