TY - JOUR
T1 - Evaluation of two semi-supervised learning methods and their combination for automatic classification of bone marrow cells
AU - Nakamura, Iori
AU - Ida, Haruhi
AU - Yabuta, Mayu
AU - Kashiwa, Wataru
AU - Tsukamoto, Maho
AU - Sato, Shigeki
AU - Ota, Syuichi
AU - Kobayashi, Naoki
AU - Masauzi, Hiromi
AU - Okada, Kazunori
AU - Kaga, Sanae
AU - Miwa, Keiko
AU - Kanai, Hiroshi
AU - Masauzi, Nobuo
N1 - Funding Information:
This work was carried out within the framework of a state contract at the Tuvinian Institute for Complex Exploration of Natural Resources, Siberian Branch, Russian Academy of Sciences (state registration no. 121031500511-0, topic no. FUFS-2021-0008).
Funding Information:
This research was partly supported by a research grant from AMED, Japan Agency for Medical Research and Development (Grant no. JP20lm0203001j0003). We would like to thank Editage (www.editage.com) for English language editing.
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - Differential bone marrow (BM) cell counting is an important test for the diagnosis of various hematological diseases. However, it is difficult to accurately classify BM cells due to non-uniformity and the lack of reproducibility of differential counting. Therefore, automatic classification systems have been developed in which deep learning is used. These systems requires large and accurately labeled datasets for training. To overcome this, we used semi-supervised learning (SSL), in which learning proceeds while labeling. We used three methods: self-training (ST), active learning (AL), and a combination of these methods, and attempted to automatically classify 16 types of BM cell images. ST involves data verification, as in AL, before adding them to the training dataset (confirmed self-training: CST). After 25 rounds of CST, AL, and CST + AL, the initial number of training data increased from 425 to 40,518; 3682; and 47,843, respectively. Accuracies for the test data of 50 images for each cell type were 0.944, 0.941, and 0.976, respectively. Data added with CST or AL showed some imbalances between classes, while CST + AL exhibited fewer imbalances. We suggest that CST + AL, when combined with two SSL methods, is efficient in increasing training data for the development of automatic BM cells classification systems.
AB - Differential bone marrow (BM) cell counting is an important test for the diagnosis of various hematological diseases. However, it is difficult to accurately classify BM cells due to non-uniformity and the lack of reproducibility of differential counting. Therefore, automatic classification systems have been developed in which deep learning is used. These systems requires large and accurately labeled datasets for training. To overcome this, we used semi-supervised learning (SSL), in which learning proceeds while labeling. We used three methods: self-training (ST), active learning (AL), and a combination of these methods, and attempted to automatically classify 16 types of BM cell images. ST involves data verification, as in AL, before adding them to the training dataset (confirmed self-training: CST). After 25 rounds of CST, AL, and CST + AL, the initial number of training data increased from 425 to 40,518; 3682; and 47,843, respectively. Accuracies for the test data of 50 images for each cell type were 0.944, 0.941, and 0.976, respectively. Data added with CST or AL showed some imbalances between classes, while CST + AL exhibited fewer imbalances. We suggest that CST + AL, when combined with two SSL methods, is efficient in increasing training data for the development of automatic BM cells classification systems.
UR - http://www.scopus.com/inward/record.url?scp=85139360033&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85139360033&partnerID=8YFLogxK
U2 - 10.1038/s41598-022-20651-4
DO - 10.1038/s41598-022-20651-4
M3 - Article
C2 - 36202847
AN - SCOPUS:85139360033
SN - 2045-2322
VL - 12
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 16736
ER -