TY - JOUR
T1 - Application of machine learning to a material library for modeling of relationships between material properties and tablet properties
AU - Hayashi, Yoshihiro
AU - Nakano, Yuri
AU - Marumo, Yuki
AU - Kumada, Shungo
AU - Okada, Kotaro
AU - Onuki, Yoshinori
N1 - Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2021/11/20
Y1 - 2021/11/20
N2 - This study investigates the usefulness of machine learning for modeling complex relationships in a material library. We tested 81 types of active pharmaceutical ingredients (APIs) and their tablets to construct the library, which included the following variables: 20 types of API material properties, one type of process parameter (three levels of compression pressure), and two types of tablet properties (tensile strength (TS) and disintegration time (DT)). The machine learning algorithms boosted tree (BT) and random forest (RF) were applied to analysis of our material library to model the relationships between input variables (material properties and compression pressure) and output variables (TS and DT). The calculated BT and RF models achieved higher performance statistics compared with a conventional modeling method (i.e., partial least squares regression), and revealed the material properties that strongly influence TS and DT. For TS, true density, the tenth percentile of the cumulative percentage size distribution, loss on drying, and compression pressure were of high relative importance. For DT, total surface energy, water absorption rate, polar surface energy, and hygroscopicity had significant effects. Thus, we demonstrate that BT and RF can be used to model complex relationships and clarify important material properties in a material library.
AB - This study investigates the usefulness of machine learning for modeling complex relationships in a material library. We tested 81 types of active pharmaceutical ingredients (APIs) and their tablets to construct the library, which included the following variables: 20 types of API material properties, one type of process parameter (three levels of compression pressure), and two types of tablet properties (tensile strength (TS) and disintegration time (DT)). The machine learning algorithms boosted tree (BT) and random forest (RF) were applied to analysis of our material library to model the relationships between input variables (material properties and compression pressure) and output variables (TS and DT). The calculated BT and RF models achieved higher performance statistics compared with a conventional modeling method (i.e., partial least squares regression), and revealed the material properties that strongly influence TS and DT. For TS, true density, the tenth percentile of the cumulative percentage size distribution, loss on drying, and compression pressure were of high relative importance. For DT, total surface energy, water absorption rate, polar surface energy, and hygroscopicity had significant effects. Thus, we demonstrate that BT and RF can be used to model complex relationships and clarify important material properties in a material library.
KW - Boosted tree
KW - Machine learning
KW - Material library
KW - Quality by design
KW - Random forest
KW - Tablet
UR - http://www.scopus.com/inward/record.url?scp=85116893155&partnerID=8YFLogxK
U2 - 10.1016/j.ijpharm.2021.121158
DO - 10.1016/j.ijpharm.2021.121158
M3 - 学術論文
C2 - 34624447
AN - SCOPUS:85116893155
SN - 0378-5173
VL - 609
JO - International Journal of Pharmaceutics
JF - International Journal of Pharmaceutics
M1 - 121158
ER -