To read this content please select one of the options below:

MD-LDA: a supervised LDA topic model for identifying mechanism of disease in TCM

Meiwen Li (School of Information Engineering, Henan University of Science and Technology, Luoyang, China)
Liye Xia (The First Affiliated Hospital, Henan University of Science and Technology, Luoyang, China)
Qingtao Wu (School of Information Engineering, Henan University of Science and Technology, Luoyang, China)
Lin Wang (School of Information Engineering, Henan University of Science and Technology, Luoyang, China)
Junlong Zhu (School of Information Engineering, Henan University of Science and Technology, Luoyang, China)
Mingchuan Zhang (School of Information Engineering, Henan University of Science and Technology, Luoyang, China)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 22 July 2024

Issue publication date: 14 January 2025

107

Abstract

Purpose

In traditional Chinese medicine (TCM), the mechanism of disease (MD) constitutes an essential element of syndrome differentiation and treatment, elucidating the mechanisms underlying the occurrence, progression, alterations and outcomes of diseases. However, there is a dearth of research in the field of intelligent diagnosis concerning the analysis of MD.

Design/methodology/approach

In this paper, we propose a supervised Latent Dirichlet Allocation (LDA) topic model, termed MD-LDA, which elucidates the process of MDs identification. We leverage the label information inherent in the data as prior knowledge and incorporate it into the model’s training. Additionally, we devise two parallel parameter estimation algorithms for efficient training. Furthermore, we introduce a benchmark MD identification dataset, named TMD, for training MD-LDA. Finally, we validate the performance of MD-LDA through comprehensive experiments.

Findings

The results show that MD-LDA is effective and efficient. Moreover, MD-LDA outperforms the state-of-the-art topic models on perplexity, Kullback–Leibler (KL) and classification performance.

Originality/value

The proposed MD-LDA can be applied for the MD discovery and analysis of TCM clinical diagnosis, so as to improve the interpretability and reliability of intelligent diagnosis and treatment.

Keywords

Acknowledgements

This work was funded in part by the National Natural Science Foundation of China (NSFC) under (No. 62002102), the Scientific and Technological Innovation Teams and Talents of Colleges and Universities in Henan Province of China (Nos. 24IRTSTHN022 and 22HASTIT014) and the Key Technologies R&D Program of Henan Province (Nos. 241111210700, 232102211008 and 232102210028).

Citation

Li, M., Xia, L., Wu, Q., Wang, L., Zhu, J. and Zhang, M. (2025), "MD-LDA: a supervised LDA topic model for identifying mechanism of disease in TCM", Data Technologies and Applications, Vol. 59 No. 1, pp. 1-18. https://doi.org/10.1108/DTA-12-2023-0868

Publisher

:

Emerald Publishing Limited

Copyright © 2024, Emerald Publishing Limited

Related articles