TY - GEN
T1 - Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints
AU - Keshi, Ikuo
AU - Daimon, Ryota
AU - Takaoka, Yutaka
AU - Hayashi, Atsushi
N1 - Publisher Copyright:
© 2024 by SCITEPRESS – Science and Technology Publications, Lda.
PY - 2024
Y1 - 2024
N2 - This study compared semantic representation learning + machine learning, BERT, and GPT-4 to estimate disease names from chief complaints and evaluate their accuracy. Semantic representation learning + machine learning showed high accuracy for chief complaints of at least 10 characters in the International Classification of Diseases 10th Revision (ICD-10) codes middle categories, slightly surpassing BERT. For GPT-4, the Retrieval Augmented Generation (RAG) method achieved the best performance, with a Top-5 accuracy of 84.5% when all chief complaints, including the evaluation data, were used. Additionally, the latest GPT-4o model further improved the Top-5 accuracy to 90.0%. These results suggest the potential of these methods as diagnostic support tools. Future work aims to enhance disease name estimation through more extensive evaluations by experienced physicians.
AB - This study compared semantic representation learning + machine learning, BERT, and GPT-4 to estimate disease names from chief complaints and evaluate their accuracy. Semantic representation learning + machine learning showed high accuracy for chief complaints of at least 10 characters in the International Classification of Diseases 10th Revision (ICD-10) codes middle categories, slightly surpassing BERT. For GPT-4, the Retrieval Augmented Generation (RAG) method achieved the best performance, with a Top-5 accuracy of 84.5% when all chief complaints, including the evaluation data, were used. Additionally, the latest GPT-4o model further improved the Top-5 accuracy to 90.0%. These results suggest the potential of these methods as diagnostic support tools. Future work aims to enhance disease name estimation through more extensive evaluations by experienced physicians.
KW - BERT
KW - Chief Complaints
KW - Disease Name Estimation
KW - Electronic Medical Record (EMR)
KW - GPT-4
KW - Generative AI
KW - Medical AI
KW - Medical Diagnostic Support Tool
KW - Semantic Representation Learning
UR - http://www.scopus.com/inward/record.url?scp=85215262993&partnerID=8YFLogxK
U2 - 10.5220/0012927100003838
DO - 10.5220/0012927100003838
M3 - 会議への寄与
AN - SCOPUS:85215262993
T3 - International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings
SP - 294
EP - 301
BT - 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
A2 - Coenen, Frans
A2 - Fred, Ana
A2 - Bernardino, Jorge
PB - Science and Technology Publications, Lda
T2 - 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024
Y2 - 17 November 2024 through 19 November 2024
ER -