The Role of Publicly Available Data in MICCAI Papers from 2014 to 2018

Heller, Nicholas; Rickman, Jack; Weight, Christopher; Papanikolopoulos, Nikolaos

doi:10.1007/978-3-030-33642-4_8

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11851))

Included in the following conference series:

International Workshop on Large-scale Annotation of Biomedical data and Expert Label Synthesis
International Workshop on Hardware Aware Learning for Medical Imaging and Computer Assisted Intervention
MICCAI Challenge on Correction of Brainshift with Intra-Operative Ultrasound

677 Accesses

Abstract

Widely-used public benchmarks are of huge importance to computer vision and machine learning research, especially with the computational resources required to reproduce state of the art results quickly becoming untenable. In medical image computing, the wide variety of image modalities and problem formulations yields a huge task-space for benchmarks to cover, and thus the widespread adoption of standard benchmarks has been slow, and barriers to releasing medical data exacerbate this issue. In this paper, we examine the role that publicly available data has played in MICCAI papers from the past five years. We find that more than half of these papers are based on private data alone, although this proportion seems to be decreasing over time. Additionally, we observed that after controlling for open access publication and the release of code, papers based on public data were cited over 60% more per year than their private-data counterparts. Further, we found that more than 20% of papers using public data did not provide a citation to the dataset or associated manuscript, highlighting the “second-rate” status that data contributions often take compared to theoretical ones. We conclude by making recommendations for MICCAI policies which could help to better incentivise data sharing and move the field toward more efficient and reproducible science.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

€32.70 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: EUR 29.95; Price includes VAT (France)

eBook: EUR 42.79; Price includes VAT (France)

Softcover Book: EUR 52.74; Price includes VAT (France)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Medical Image Data and Datasets in the Era of Machine Learning—Whitepaper from the 2016 C-MIMI Meeting Dataset Session

Article Open access 17 May 2017

Methods and open-source toolkit for analyzing and visualizing challenge results

Article Open access 27 January 2021

Code-free deep learning for multi-modality medical image classification

Article Open access 01 March 2021

References

Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O.: 3D U-net: learning dense volumetric segmentation from sparse annotation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9901, pp. 424–432. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46723-8_49
Chapter Google Scholar
Clark, K., et al.: The cancer imaging archive (tcia): maintaining and operating a public information repository. J. Digit. Imaging 26(6), 1045–1057 (2013)
Article Google Scholar
Colavizza, G., Hrynaszkiewicz, I., Staden, I., Whitaker, K., McGillivray, B.: The citation advantage of linking publications to research data. arXiv preprint arXiv:1907.02565 (2019)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Google Scholar
Dixon, W.J., Yuen, K.K.: Trimming and winsorization: a review. Statistische Hefte 15(2–3), 157–170 (1974)
Article MathSciNet Google Scholar
Drachen, T., Ellegaard, O., Larsen, A., Dorch, S.: Sharing data increases citations. Liber Q. 26(2) (2016)
Google Scholar
Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap. CRC Press, Boca Raton (1994)
Book Google Scholar
Erickson, B.J., Korfiatis, P., Akkus, Z., Kline, T.L.: Machine learning for medical imaging. Radiographics 37(2), 505–515 (2017)
Article Google Scholar
Eysenbach, G.: Citation advantage of open access articles. PLoS Biol. 4(5), e157 (2006)
Article Google Scholar
Goldberger, A.L., Amaral, L.A., Glass, L., Hausdorff, J.M., Ivanov, P.C., Mark, R.G., Mietus, J.E., Moody, G.B., Peng, C.K., Stanley, H.E.: Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals. Circulation 101(23), e215–e220 (2000)
Article Google Scholar
Krizhevsky, A., Hinton, G., et al.: Learning multiple layers of features from tiny images. Technical report, Citeseer (2009)
Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Piwowar, H.A., Vision, T.J.: Data reuse and the open data citation advantage. PeerJ 1, e175 (2013)
Article Google Scholar
Roth, H.R., et al.: DeepOrgan: multi-level deep convolutional networks for automated pancreas segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9349, pp. 556–564. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24553-9_68
Chapter Google Scholar
Sekara, V., Deville, P., Ahnert, S.E., Barabási, A.L., Sinatra, R., Lehmann, S.: The chaperone effect in scientific publishing. Proc. Natl. Acad. Sci. 115(50), 12603–12607 (2018)
Article Google Scholar
Thelwall, M., Wilson, P.: Regression for citation data: an evaluation of different methods. J. Informetrics 8(4), 963–971 (2014)
Article Google Scholar
Vandewalle, P.: Code sharing is associated with research impact in image processing. Comput. Sci. Eng. 14(4), 42–47 (2012)
Article Google Scholar
Wilkinson, M.D., et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data 3 (2016)
Google Scholar

Download references

Acknowledgements

Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under Award Number R01CA225435. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

University of Minnesota – Twin Cities, Minneapolis, USA
Nicholas Heller, Jack Rickman, Christopher Weight & Nikolaos Papanikolopoulos

Authors

Nicholas Heller
View author publications
You can also search for this author in PubMed Google Scholar
Jack Rickman
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Weight
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaos Papanikolopoulos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nicholas Heller .

Editor information

Editors and Affiliations

University of Sydney, Sydney, NSW, Australia
Luping Zhou
University of Minnesota, Minneapolis, MN, USA
Nicholas Heller
University of Notre Dame, Notre Dame, IN, USA
Yiyu Shi
Western University, London, ON, Canada
Yiming Xiao
University of Bern, Bern, Switzerland
Raphael Sznitman
Eindhoven University of Technology, Eindhoven, The Netherlands
Veronika Cheplygina
École Centrale de Nantes, Nantes, France
Diana Mateus
University of Dundee, Dundee, UK
Emanuele Trucco
University of Notre Dame, Notre Dame, IN, USA
X. Sharon Hu
University of Notre Dame, Notre Dame, IN, USA
Danny Chen
University of Grenoble Alpes, Grenoble, France
Matthieu Chabanas
Concordia University, Montréal, QC, Canada
Hassan Rivaz
Health Research, SINTEF Digital, Trondheim, Norway
Ingerid Reinertsen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Heller, N., Rickman, J., Weight, C., Papanikolopoulos, N. (2019). The Role of Publicly Available Data in MICCAI Papers from 2014 to 2018. In: Zhou, L., et al. Large-Scale Annotation of Biomedical Data and Expert Label Synthesis and Hardware Aware Learning for Medical Imaging and Computer Assisted Intervention. LABELS HAL-MICCAI CuRIOUS 2019 2019 2019. Lecture Notes in Computer Science(), vol 11851. Springer, Cham. https://doi.org/10.1007/978-3-030-33642-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-33642-4_8
Published: 24 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33641-7
Online ISBN: 978-3-030-33642-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

The Role of Publicly Available Data in MICCAI Papers from 2014 to 2018

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Medical Image Data and Datasets in the Era of Machine Learning—Whitepaper from the 2016 C-MIMI Meeting Dataset Session

Methods and open-source toolkit for analyzing and visualizing challenge results

Code-free deep learning for multi-modality medical image classification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

The Role of Publicly Available Data in MICCAI Papers from 2014 to 2018

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Medical Image Data and Datasets in the Era of Machine Learning—Whitepaper from the 2016 C-MIMI Meeting Dataset Session

Methods and open-source toolkit for analyzing and visualizing challenge results

Code-free deep learning for multi-modality medical image classification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation