AC-MLP: Axial Convolution-MLP Mixer for nuclei segmentation in histopathological images
##plugins.themes.academic_pro.article.main##
Author
-
Nguyen Thanh ThuSchool of Electrical and Electronic Engineering, Hanoi University of Science and Technology, VietnamDinh Binh DuongSchool of Electrical and Electronic Engineering, Hanoi University of Science and Technology, VietnamTran Thi ThaoĐại học Bách khoa Hà NộiPham Van TruongSchool of Electrical and Electronic Engineering, Hanoi University of Science and Technology, Vietnam
Từ khóa:
Tóm tắt
Recent MLP-Mixer has a good ability to handle long-range dependencies, however, to have a good performance, one requires huge data and expensive infrastructures for the pre-training process. In this study, we proposed a novel model for nuclei image segmentation namely Axial Convolutional-MLP Mixer, by replacing the token mixer of MLP-Mixer with a new operator, Axial Convolutional Token Mix. Specifically, in the Axial Convolutional Token Mix, we inherited the idea of axial depthwise convolution to create a flexible receptive field. We also proposed a Long-range Attention module that uses dilated convolution to extend the convolutional kernel size, thereby addressing the issue of long-range dependencies. Experiments demonstrate that our model can achieve high results on small medical datasets, with Dice scores of 90.20% on the GlaS dataset, 80.43% on the MoNuSeg dataset, and without pre-training. The code will be available at https://github.com/thanhthu152/AC-MLP.
Tài liệu tham khảo
-
[1] Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical image segmentation”, Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, Proceedings, Part III 18. Springer International Publishing, 2015.
[2] Jha, M. A. Riegler, D. Johansen, P. Halvorsen, and H.D. Johansen, “Doubleu-net: A deep convolutional neural network for medical image segmentation”, 2020 IEEE 33rd International symposium on computer-based medical systems (CBMS), 2020.
[3] Oktay et al., “Attention u-net: Learning where to look for the pancreas”, arXiv preprint arXiv:1804.03999, 2018.
[4] Zhou, M.M.R. Siddiquee, N. Tajbakhsh, and J. Liang, “Unet++: A nested u-net architecture for medical image segmentation”, Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4. Springer International Publishing, 2018.
[5] Jha et al., “Resunet++: An advanced architecture for medical image segmentation”, 2019 IEEE International Symposium on Multimedia (ISM). IEEE, 2019.
[6] Shan, J. Yan, X. Cui, and L. Xie, “DSCA-Net: A depthwise separable convolutional neural network with attention mechanism for medical image segmentation”, Math Biosci Eng., vol. 20, pp. 365-382, 2022.
[7] Howard et al., “Mobilenets: Efficient convolutional neural networks for mobile vision applications”, arXiv preprint arXiv:1704.04861, 2017.
[8] B. Dinh, T.T Nguyen, T.T. Tran, and V.T Pham, “1M parameters are enough? A lightweight CNN-based model for medical image segmentation”, 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE, 2023.
[9] Dosovitskiy et al., “An image is worth 16x16 words: Transformers for image recognition at scale”, arXiv preprint arXiv:2010.11929, 2020.
[10] Liu et al., “Swin transformer: Hierarchical vision transformer using shifted windows”, Proceedings of the IEEE/CVF international conference on computer vision, 2021.
[11] Tolstikhin et al. “Mlp-mixer: An all-mlp architecture for vision”, Advances in neural information processing systems 34, 2021, pp.24261-24272
[12] Hou, Z. Jiang, L. Yuan, M.M. Cheng, S. Yan, and J. Feng, “Vision permutator: A permutable mlp-like architecture for visual recognition”, IEEE Transactions on Pattern Analysis and Machine Intelligence. vol.45, no.1, pp. 1328-1334, 2022.
[13] Chen, E. Xie, C. Ge, R. Chen, D. Liang, and P. Luo, “Cyclemlp: A mlp-like architecture for dense prediction”, arXiv preprint arXiv:2107.10224, 2017.
[14] Lian, Z. Yu, X. Sun, and S. Gao, “As-mlp: An axial shifted mlp architecture for vision”, arXiv preprint arXiv:2107.08391, 2021.
[15] Yan et al., “COVID-19 chest CT image segmentation--a deep convolutional neural network solution”, arXiv preprint arXiv:2004.10987, 2020.
[16] C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs”, IEEE transactions on pattern analysis and machine intelligence. Vol. 40, no. 4, pp. 834-848, 2017.
[17] Hu, L. Shen, S. Albanie, G. Sun, and E. Wu, “Squeeze-and-excitation networks”, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018.
[18] Woo, J. Park, J. Y. Lee, and I. S. Kweon, “Cbam: Convolutional block attention module”, Proceedings of the European conference on computer vision (ECCV), 2018.
[19] P. Lai, T.T Tran, and V.T Pham, “Axial attention mlp-mixer: A new architecture for image segmentation”, 2022 IEEE Ninth International Conference on Communications and Electronics (ICCE). IEEE, 2022.
[20] Sirinukunwattanan et al. “Gland segmentation in colon histology images: The glas challenge contest”, Medical image analysis, vol. 35, pp. 489-502, 2017.
[21] Kumar et al., “A multi-organ nucleus segmentation challenge”, IEEE transactions on medical imaging, vol. 39, no. 5, pp. 1380-1391, 2019.
[22] Kingma and J. Ba, “Adam: A method for stochastic optimization”, ArXiv Preprint ArXiv:1412.6980, 2014.
[23] Han, M. Jian, and G.G. Wang, “ConvUNeXt: An efficient convolution neural network for medical image segmentation”. Knowledge-Based Systems, vol. 253, no. C, 2022.
[24] J. M. Valanarasu, and V. M. Patel, “Unext: Mlp-based rapid medical image segmentation network”, Medical Image Computing and Computer Assisted Intervention–MICCAI 2022: 25th International Conference, Singapore, September 18–22, 2022.
[25] Valanarasu, P. Oza, I. Hacihaliloglu, and V. Patel, “Medical transformer: Gated axial-attention for medical image segmentation”. International Conference on Medical Image Computing and Computer-Assisted Intervention, 2021. pp. 36-46.
[26] Cao et al., “Swin-unet: Unet-like pure transformer for medical image segmentation”, European conference on computer vision. Cham: Springer Nature Switzerland, 2022.
[27] Z. Liu, H. Mao, C. Y. Wu, C. Feichtenhofer, T. Darrell, and S. Xie, “A convnet for the 2020s”, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022.