Comment by jiggawatts
Comment by jiggawatts 3 days ago
A standard trick is to only turn on detailed telemetry from a subset of identical worker VMs or container instances.
Sampling is almost always sufficient for most issues, and when it’s not, you can turn on telemetry on all nodes for selected error levels or critical sections.