Operational Performance Thresholds

From The Foundation for Best Practices in Machine Learning
Technical Best Practices > Monitoring & Maintenance > Operational Performance Thresholds

Operational Performance Thresholds

Control

Define and set metrics and tolerance intervals for operational performance of Models and Products, such as, amongst other things, latencies, memory size, CPU and GPU usage.


Aim

To (a) prevent unavailable and unreliable service; (b) enable quick detection of bugs in the code; (c) ensure smooth integration of the Model with the rest of the systems; and (d) highlight associated risks that might occur in the Product Lifecycle.


Additional Information