By default, truefoundry helm chart comes with basic monitoring. We expose critical metrics using prometheus’s ServiceMonitors and PodMonitors and we also expose a Grafana dashboard to visualize all of this easliy.

Enable Monitoring

To enable monitoring in truefoundry helm chart, update the following in values of the installation:
monitoring:
  enabled: true
  alertManager:
    secret:
      create: true
      data:
        API_URL: <SLACK_WEBHOOK_URL>
This setup by default comes with a Slack alerting channel, incase you want to setup different notification channel, please feel free to customize and disable the default one by patching following to values:
monitoring:
  alertManager:
    slackConfigs:
      enabled: false
Please request for SLACK_WEBHOOK_URL from TrueFoundry team incase you want TrueFoundry team to be paged for any incident in your hosted Control Plane.

Access Grafana Dashboard

To access the Grafana dashboard you can either install Grafana installation managed by TrueFoundry or import the dashboard into your exisitng Grafana.
  • To install new Grafana instance managed by TrueFoundry, navigate to Platform > Clusters, click on the control plane cluster and install Grafana from the Add-Ons tab.
  • To import the dashboard to exisitng Grafana - please find the dashboard.json here.
Dashboard can be accessed at Dashboards > General > TrueFoundry Control Plane as shown below:

Setup Additional Alert Channels

To setup alerts channels for any incident in the TrueFoundry control plane, you can provide the alert channel configuration in Prometheus alert receiver (ref) in the helm values as follows:
monitoring:
  alertManager:
    additionalReceivers: []
To customize anything further, please refer this.