Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints

Ikuo Keshi; Ryota Daimon; Yutaka Takaoka; Atsushi Hayashi

doi:10.5220/0012927100003838

Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints

Ikuo Keshi, Ryota Daimon, Yutaka Takaoka, Atsushi Hayashi

研究成果: 書籍の章/レポート/会議録 › 会議への寄与 › 査読

抄録

This study compared semantic representation learning + machine learning, BERT, and GPT-4 to estimate disease names from chief complaints and evaluate their accuracy. Semantic representation learning + machine learning showed high accuracy for chief complaints of at least 10 characters in the International Classification of Diseases 10th Revision (ICD-10) codes middle categories, slightly surpassing BERT. For GPT-4, the Retrieval Augmented Generation (RAG) method achieved the best performance, with a Top-5 accuracy of 84.5% when all chief complaints, including the evaluation data, were used. Additionally, the latest GPT-4o model further improved the Top-5 accuracy to 90.0%. These results suggest the potential of these methods as diagnostic support tools. Future work aims to enhance disease name estimation through more extensive evaluations by experienced physicians.

本文言語	英語
ホスト出版物のタイトル	16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
編集者	Frans Coenen, Ana Fred, Jorge Bernardino
出版社	Science and Technology Publications, Lda
ページ	294-301
ページ数	8
ISBN（電子版）	9789897587160
DOI	https://doi.org/10.5220/0012927100003838
出版ステータス	出版済み - 2024
イベント	16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024 - Porto, ポルトガル継続期間: 2024/11/17 → 2024/11/19

出版物シリーズ

名前	International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings
巻	1
ISSN（電子版）	2184-3228

学会

学会	16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024
国/地域	ポルトガル
City	Porto
Period	2024/11/17 → 2024/11/19

ASJC Scopus 主題領域

ソフトウェア
技術マネージメントおよび技術革新管理
戦略と経営

文献へのアクセス

10.5220/0012927100003838

フィンガープリント

「Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル

Keshi, I., Daimon, R., Takaoka, Y., & Hayashi, A. (2024). Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints. In F. Coenen, A. Fred, & J. Bernardino (Eds.), 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (pp. 294-301). (International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings; Vol. 1). Science and Technology Publications, Lda. https://doi.org/10.5220/0012927100003838

Keshi, Ikuo ; Daimon, Ryota ; Takaoka, Yutaka その他. / Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints. 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. editor / Frans Coenen ; Ana Fred ; Jorge Bernardino. Science and Technology Publications, Lda, 2024. pp. 294-301 (International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings).

@inproceedings{a5a4a45c7e13455d975be3e5f0b51af2,

title = "Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints",

abstract = "This study compared semantic representation learning + machine learning, BERT, and GPT-4 to estimate disease names from chief complaints and evaluate their accuracy. Semantic representation learning + machine learning showed high accuracy for chief complaints of at least 10 characters in the International Classification of Diseases 10th Revision (ICD-10) codes middle categories, slightly surpassing BERT. For GPT-4, the Retrieval Augmented Generation (RAG) method achieved the best performance, with a Top-5 accuracy of 84.5% when all chief complaints, including the evaluation data, were used. Additionally, the latest GPT-4o model further improved the Top-5 accuracy to 90.0%. These results suggest the potential of these methods as diagnostic support tools. Future work aims to enhance disease name estimation through more extensive evaluations by experienced physicians.",

keywords = "BERT, Chief Complaints, Disease Name Estimation, Electronic Medical Record (EMR), GPT-4, Generative AI, Medical AI, Medical Diagnostic Support Tool, Semantic Representation Learning",

author = "Ikuo Keshi and Ryota Daimon and Yutaka Takaoka and Atsushi Hayashi",

note = "Publisher Copyright: {\textcopyright} 2024 by SCITEPRESS – Science and Technology Publications, Lda.; 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024 ; Conference date: 17-11-2024 Through 19-11-2024",

year = "2024",

doi = "10.5220/0012927100003838",

language = "英語",

series = "International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings",

publisher = "Science and Technology Publications, Lda",

pages = "294--301",

editor = "Frans Coenen and Ana Fred and Jorge Bernardino",

booktitle = "16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management",

}

Keshi, I, Daimon, R, Takaoka, Y & Hayashi, A 2024, Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints. in F Coenen, A Fred & J Bernardino (eds), 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings, vol. 1, Science and Technology Publications, Lda, pp. 294-301, 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024, Porto, ポルトガル, 2024/11/17. https://doi.org/10.5220/0012927100003838

Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints. / Keshi, Ikuo; Daimon, Ryota; Takaoka, Yutaka その他.
16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. ed. / Frans Coenen; Ana Fred; Jorge Bernardino. Science and Technology Publications, Lda, 2024. p. 294-301 (International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings; Vol. 1).

研究成果: 書籍の章/レポート/会議録 › 会議への寄与 › 査読

TY - GEN

T1 - Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints

AU - Keshi, Ikuo

AU - Daimon, Ryota

AU - Takaoka, Yutaka

AU - Hayashi, Atsushi

PY - 2024

Y1 - 2024

N2 - This study compared semantic representation learning + machine learning, BERT, and GPT-4 to estimate disease names from chief complaints and evaluate their accuracy. Semantic representation learning + machine learning showed high accuracy for chief complaints of at least 10 characters in the International Classification of Diseases 10th Revision (ICD-10) codes middle categories, slightly surpassing BERT. For GPT-4, the Retrieval Augmented Generation (RAG) method achieved the best performance, with a Top-5 accuracy of 84.5% when all chief complaints, including the evaluation data, were used. Additionally, the latest GPT-4o model further improved the Top-5 accuracy to 90.0%. These results suggest the potential of these methods as diagnostic support tools. Future work aims to enhance disease name estimation through more extensive evaluations by experienced physicians.

AB - This study compared semantic representation learning + machine learning, BERT, and GPT-4 to estimate disease names from chief complaints and evaluate their accuracy. Semantic representation learning + machine learning showed high accuracy for chief complaints of at least 10 characters in the International Classification of Diseases 10th Revision (ICD-10) codes middle categories, slightly surpassing BERT. For GPT-4, the Retrieval Augmented Generation (RAG) method achieved the best performance, with a Top-5 accuracy of 84.5% when all chief complaints, including the evaluation data, were used. Additionally, the latest GPT-4o model further improved the Top-5 accuracy to 90.0%. These results suggest the potential of these methods as diagnostic support tools. Future work aims to enhance disease name estimation through more extensive evaluations by experienced physicians.

KW - BERT

KW - Chief Complaints

KW - Disease Name Estimation

KW - Electronic Medical Record (EMR)

KW - GPT-4

KW - Generative AI

KW - Medical AI

KW - Medical Diagnostic Support Tool

KW - Semantic Representation Learning

UR - http://www.scopus.com/inward/record.url?scp=85215262993&partnerID=8YFLogxK

U2 - 10.5220/0012927100003838

DO - 10.5220/0012927100003838

M3 - 会議への寄与

AN - SCOPUS:85215262993

T3 - International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings

SP - 294

EP - 301

BT - 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management

A2 - Coenen, Frans

A2 - Fred, Ana

A2 - Bernardino, Jorge

PB - Science and Technology Publications, Lda

T2 - 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024

Y2 - 17 November 2024 through 19 November 2024

ER -

Keshi I, Daimon R, Takaoka Y , Hayashi A. Integrated Evaluation of Semantic Representation Learning, BERT, and Generative AI for Disease Name Estimation Based on Chief Complaints. In Coenen F, Fred A, Bernardino J, editors, 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management. Science and Technology Publications, Lda. 2024. p. 294-301. (International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings). doi: 10.5220/0012927100003838