In this paper we discuss the implementation of various controller performance monitoring criteria on a multivariable benchmark example. These criteria can be based on either time-domain or frequency-domain concepts and have different requirements for information content and algorithm complexity. The simulation results from the benchmark example are analyzed to derive conclusions on the consistency of each criterion to identify poorly tuned controllers. Typically, all methods generate consistent results with suitably designed tests and when the only source of performance deterioration is the controller tuning. However, in the presence of arbitrary external disturbances (e.g., large amplitude, deterministic) and varying levels of excitation, not all techniques yield consistent results. In this respect, criteria based on controller unfalsification concepts appear to be the more promising to yield reliable performance monitoring for operation in industrial environments.