ITEA is the Eureka Cluster on software innovation
ITEA is the Eureka Cluster on software innovation
ITEA 4 page header azure circular

Confidence calibration

Project
20219 IML4E
Type
New service
Description

Methodology in the field of uncertainty estimation in machine learning, focusing on confidence scores and their calibration. Application of the method to alleviate the uncertainty of an in-production machine learning model operating in the Basware case. The solution produces confidence estimates to increase the trustworthiness and dependability of the ML-based system, thus, improving quality and decreasing manual work

Contact
Juhani Kivimäki
Email
juhani.kivimaki@helsinki.fi
Research area(s)
Uncertainty estimation, uncertainty calibration, model monitoring, machine learning
Technical features

The confidence estimator used in the case study is a hybrid system using convolutional neural networks and statistical analysis to derive comparative features from a stack of heatmaps, which originate from the Basware distillation pipeline. These features are fed to an XGBoost model, which learns to assign a confidence score indicating whether the ensemble of models within the Basware pipeline made a correct prediction or not. Finally, these confidence scores are calibrated using a Beta calibration mapping to match with empirical probabilities.

Integration constraints

The confidence estimator used in the case study is implemented in Python 3 using the TensorFlow, XGBoost, and betacal libraries.

Targeted customer(s)

The method presented in the case study can be adjusted to work with most machine learning models. The produced confidence scores can be used in failure prediction, model monitoring, and for other quality assurance needs.

Conditions for reuse

Open access

Confidentiality
Public
Publication date
23-09-2024
Involved partners
University of Helsinki (FIN)
Basware Oy (FIN)

Images