Configure weight-based load balancing for AI models
weight
valuewhen
block specifies the conditions under which this rule applies: it matches requests for the gpt-4
model, coming from members of the engineering-team
, and only in the production
environment. All conditions in the when
block must be satisfied for the rule to take effect.