This documentation provides instructions for setting up grafana dashboards to see metrics emitted from the inference extension and model servers.
Please follow metrics page to configure the proxy to enable all metrics.
Please follow grafana instructions to load the dashboard json.
If you run the inferece gateway with Google Managed Prometheus, please follow the instructions to configure Google Managed Prometheus as data source for the grafana dashboard.
Please configure the scrape_interval
of your prometheus configuration to lower than 15s
, rate
function returns empty string if data falls too apart. See https://www.robustperception.io/what-range-should-i-use-with-rate/ for more details.
Example:
global:
scrape_interval: 5s