Confidence calibration

Project: 20219 IML4E
Type: New service
Description: Methodology in the field of uncertainty estimation in machine learning, focusing on confidence scores and their calibration. Application of the method to alleviate the uncertainty of an in-production machine learning model operating in the Basware case. The solution produces confidence estimates to increase the trustworthiness and dependability of the ML-based system, thus, improving quality and decreasing manual work
Contact: Juhani Kivimäki
Email: juhani.kivimaki@helsinki.fi
Research area(s): Uncertainty estimation, uncertainty calibration, model monitoring, machine learning
Technical features: The confidence estimator used in the case study is a hybrid system using convolutional neural networks and statistical analysis to derive comparative features from a stack of heatmaps, which originate from the Basware distillation pipeline. These features are fed to an XGBoost model, which learns to assign a confidence score indicating whether the ensemble of models within the Basware pipeline made a correct prediction or not. Finally, these confidence scores are calibrated using a Beta calibration mapping to match with empirical probabilities.
Integration constraints: The confidence estimator used in the case study is implemented in Python 3 using the TensorFlow, XGBoost, and betacal libraries.
Targeted customer(s): The method presented in the case study can be adjusted to work with most machine learning models. The produced confidence scores can be used in failure prediction, model monitoring, and for other quality assurance needs.
Conditions for reuse: Open access
Confidentiality: Public
Publication date: 23-09-2024
Involved partners: University of Helsinki (FIN); Basware Oy (FIN)

Confidence calibration

Images