A Principled Approach to Failure Analysis and Model Repairment: Demonstration in Medical Imaging

Henn, Thomas; Sakamoto, Yasukazu; Jacquet, Clément; Yoshizawa, Shunsuke; Andou, Masamichi; Tchen, Stephen; Saga, Ryosuke; Ishihara, Hiroyuki; Shimizu, Katsuhiko; Li, Yingzhen; Tanno, Ryutaro

doi:10.1007/978-3-030-87199-4_48

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12903))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

8356 Accesses
3 Altmetric

Abstract

Machine learning models commonly exhibit unexpected failures post-deployment due to either data shifts or uncommon situations in the training environment. Domain experts typically go through the tedious process of inspecting the failure cases manually, identifying failure modes and then attempting to fix the model. In this work, we aim to standardise and bring principles to this process through answering two critical questions: (i) how do we know that we have identified meaningful and distinct failure types?; (ii) how can we validate that a model has, indeed, been repaired? We suggest that the quality of the identified failure types can be validated through measuring the intra- and inter-type generalisation after fine-tuning and introduce metrics to compare different subtyping methods. Furthermore, we argue that a model can be considered repaired if it achieves high accuracy on the failure types while retaining performance on the previously correct data. We combine these two ideas into a principled framework for evaluating the quality of both the identified failure subtypes and model repairment. We evaluate its utility on a classification and an object detection tasks. Our code is available at https://github.com/Rokken-lab6/Failure-Analysis-and-Model-Repairment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

€32.70 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: EUR 29.95; Price includes VAT (France)

eBook: EUR 85.59; Price includes VAT (France)

Softcover Book: EUR 105.49; Price includes VAT (France)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Understanding Silent Failures in Medical Image Classification

Abstract: Understanding Silent Failures in Medical Image Classification

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation

References

Panfilov, E., Tiulpin, A., Klein, S., Nieminen, M.T., Saarakkala, S.: Improving robustness of deep learning based knee MRI segmentation: Mixup and adversarial domain adaptation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (2019)
Google Scholar
Bdair, T., Navab, N., Albarqouni, S.: Roam: Random layer mixup for semi-supervised learning in medical imaging. arXiv preprint arXiv:2003.09439 (2020)
Billot, B., Greve, D., Van Leemput, K., Fischl, B., Iglesias, J.E., Dalca, A.V.: A learning strategy for contrast-agnostic mri segmentation. arXiv preprint arXiv:2003.01995 (2020)
Liu, Q., Dou, Q., Yu, L., Heng, P.A.: Ms-net: multi-site network for improving prostate segmentation with heterogeneous MRI data. IEEE Trans. Med. Imaging 39(9), 2713–2724 (2020)
Article Google Scholar
Dou, Q., de Castro, D.C., Kamnitsas, K., Glocker, B.: Domain generalization via model-agnostic learning of semantic features. Adv. Neural Inf. Process. Syst. 32, 6450–6461 (2019)
Google Scholar
Collins, G.S., Moons, K.G.M.: Reporting of artificial intelligence prediction models. Lancet 393(10181), 1577–1579 (2019)
Article Google Scholar
Liu, X., Rivera, S.C., Moher, D., Calvert, M.J., Denniston, A.K.: Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the consort-AI extension. BMJ, 370 (2020)
Google Scholar
Oakden-Rayner, L., Dunnmon, J., Carneiro, G., Ré, C.: Hidden stratification causes clinically meaningful failures in machine learning for medical imaging. In: Proceedings of the ACM conference on health, inference, and learning, pp. 151–159 (2020)
Google Scholar
Singla, S., Nushi, B., Shah, S., Kamar, E., Horvitz, E.: Understanding failures of deep networks via robust feature extraction. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2021)
Google Scholar
McInnes, L., Healy, J., Saul, N., Grossberger, L.: UMAP: Uniform Manifold Approximation and Projection. J. Open Source Softw. 3(29), 861 (2018)
Article Google Scholar
Kirkpatrick, J., et al.: Overcoming catastrophic forgetting in neural networks. Proc. Nat. Acad. Sci. 114(13), 3521–3526 (2017)
Article MathSciNet Google Scholar
Karani, N., Chaitanya, K., Baumgartner, C., Konukoglu, E.: A lifelong learning approach to brain MR segmentation across scanners and protocols. In: Frangi, A.F., Schnabel, J.A., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds.) MICCAI 2018. LNCS, vol. 11070, pp. 476–484. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00928-1_54
Chapter Google Scholar
Hofmanninger, J., Perkonigg, M., Brink, J.A., Pianykh, O., Herold, C., Langs, G.: Dynamic memory to alleviate catastrophic forgetting in continuous learning settings. In: Martel, A.L., Abolmaesumi, P., Stoyanov, D., Mateus, D., Zuluaga, M.A., Zhou, S.K., Racoceanu, D., Joskowicz, L. (eds.) MICCAI 2020. LNCS, vol. 12262, pp. 359–368. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-59713-9_35
Chapter Google Scholar
Karani, N., Erdil, E., Chaitanya, K., Konukoglu, E.: Test-time adaptable neural networks for robust medical image segmentation. Med. Image Anal. 68, 101907 (2021)
Article Google Scholar
Kamnitsas, K., et al.: Unsupervised domain adaptation in brain lesion segmentation with adversarial networks. In: Niethammer, M., et al. (eds.) IPMI 2017. LNCS, vol. 10265, pp. 597–609. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59050-9_47
Chapter Google Scholar
Yang, J., Shi, R., Ni, B.: Medmnist classification decathlon: a lightweight automl benchmark for medical image analysis. arXiv preprint arXiv:2010.14925 (2020)
Kather, J.N., et al.: Predicting survival from colorectal cancer histology slides using deep learning: a retrospective multicenter study. PLoS Med. 16(1), e1002730 (2019)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

Download references

Author information

Authors and Affiliations

Rokken Inc., Osaka, Japan
Thomas Henn, Clément Jacquet, Stephen Tchen & Ryosuke Saga
Corporate R&D Center, Terumo Corporation, Tokyo, Japan
Yasukazu Sakamoto, Shunsuke Yoshizawa, Masamichi Andou, Hiroyuki Ishihara & Katsuhiko Shimizu
Department of Computing, Imperial College London, London, UK
Yingzhen Li
Department of Computing, University College London, London, UK
Ryutaro Tanno

Authors

Thomas Henn
View author publications
You can also search for this author in PubMed Google Scholar
Yasukazu Sakamoto
View author publications
You can also search for this author in PubMed Google Scholar
Clément Jacquet
View author publications
You can also search for this author in PubMed Google Scholar
Shunsuke Yoshizawa
View author publications
You can also search for this author in PubMed Google Scholar
Masamichi Andou
View author publications
You can also search for this author in PubMed Google Scholar
Stephen Tchen
View author publications
You can also search for this author in PubMed Google Scholar
Ryosuke Saga
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Ishihara
View author publications
You can also search for this author in PubMed Google Scholar
Katsuhiko Shimizu
View author publications
You can also search for this author in PubMed Google Scholar
Yingzhen Li
View author publications
You can also search for this author in PubMed Google Scholar
Ryutaro Tanno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thomas Henn .

Editor information

Editors and Affiliations

Erasmus MC - University Medical Center Rotterdam, Rotterdam, The Netherlands
Marleen de Bruijne
University of Basel, Allschwil, Switzerland
Philippe C. Cattin
Inria Nancy Grand Est, Villers-lès-Nancy, France
Stéphane Cotin
ICube, Université de Strasbourg, CNRS, Strasbourg, France
Nicolas Padoy
National Center for Tumor Diseases (NCT/UCC), Dresden, Germany
Stefanie Speidel
Tencent Jarvis Lab, Shenzhen, China
Yefeng Zheng
ICube, Université de Strasbourg, CNRS, Strasbourg, France
Caroline Essert

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Henn, T. et al. (2021). A Principled Approach to Failure Analysis and Model Repairment: Demonstration in Medical Imaging. In: de Bruijne, M., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2021. MICCAI 2021. Lecture Notes in Computer Science(), vol 12903. Springer, Cham. https://doi.org/10.1007/978-3-030-87199-4_48

Download citation

DOI: https://doi.org/10.1007/978-3-030-87199-4_48
Published: 21 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87198-7
Online ISBN: 978-3-030-87199-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

A Principled Approach to Failure Analysis and Model Repairment: Demonstration in Medical Imaging

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding Silent Failures in Medical Image Classification

Abstract: Understanding Silent Failures in Medical Image Classification

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

A Principled Approach to Failure Analysis and Model Repairment: Demonstration in Medical Imaging

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding Silent Failures in Medical Image Classification

Abstract: Understanding Silent Failures in Medical Image Classification

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation