Improving classification results on a small medical dataset using a GAN; An outlook for dealing with rare disease datasets

Röglin, Julia and Ziegeler, Katharina and Kube, Jana and König, Franziska and Hermann, Kay-Geert and Ortmann, Steffen (2022) Improving classification results on a small medical dataset using a GAN; An outlook for dealing with rare disease datasets. Frontiers in Computer Science, 4. ISSN 2624-9898

[thumbnail of pubmed-zip/versions/1/package-entries/fcomp-04-858874/fcomp-04-858874.pdf] Text
pubmed-zip/versions/1/package-entries/fcomp-04-858874/fcomp-04-858874.pdf - Published Version

Download (788kB)

Abstract

For clinical decision support systems, automated classification algorithms on medical image data have become more important in the past. For such computer vision problems, deep convolutional neural networks (DCNNs) have made breakthroughs. These often require large, annotated, and privacy-cleared datasets as a prerequisite for gaining high-quality results. This proves to be difficult with rare diseases due to limited incidences. Therefore, it is hard to sensitize clinical decision support systems to identify these diseases at an early stage. It has been shown several times, that synthetic data can improve the results of clinical decision support systems. At the same time, the greatest problem for the generation of these synthetic images is the data basis. In this paper, we present four different methods to generate synthetic data from a small dataset. The images are from 2D magnetic resonance tomography of the spine. The annotation resulted in 540 healthy, 47 conspicuously non-pathological, and 106 conspicuously pathological vertebrae. Four methods are presented to obtain optimal generation results in each of these classes. The obtained generation results are then evaluated with a classification net. With this procedure, we showed that adding synthetic annotated data has a positive impact on the classification results of the original data. In addition, one of our methods is appropriate to generate synthetic image data from <50 images. Thus, we found a general approach for dealing with small datasets in rare diseases, which can be used to build sensitized clinical decision support systems to detect and treat these diseases at an early stage.

Item Type: Article
Subjects: Oalibrary Press > Computer Science
Depositing User: Managing Editor
Date Deposited: 27 Jan 2023 05:48
Last Modified: 20 Sep 2023 07:09
URI: http://asian.go4publish.com/id/eprint/510

Actions (login required)

View Item
View Item