TY - JOUR
T1 - Explainable machine learning for materials discovery
T2 - Predicting the potentially formable Nd-Fe-B crystal structures and extracting the structure-stability relationship
AU - Pham, Tien Lam
AU - Nguyen, Duong Nguyen
AU - Ha, Minh Quyet
AU - Kino, Hiori
AU - Miyakeb, Takashi
AU - Dam, Hieu Chi
N1 - Funding Information:
Funding for this research was provided by the Ministry of Education, Culture, Sports, Science and Technology (MEXT) (award No. ESICMM12016013 to HCD, TLP, DNN, HK, TM); MEXT (JST) Precursory Research for Embryonic Science and Technology (PRESTO) (award to HCD); MEXT, Japan Society for the Promotion of Science (JSPS) (KAKENHI grant Nos. JP19H05815 to HCD; 20K05301) (Grant-in-Aid for Scientific Research on Innovative Areas ‘Interface Ionics’); Materials research was carried out by the Information Integration Initiative (MI2I) project of the Support Program for Starting Up Innovation Hub from JST, and MEXT as a social and scientific priority issue employing the post-K computer (creation of new functional devices and high-performance materials to support next-generation industries; CDMSI) (awarded to HCD, HK and TM).
Publisher Copyright:
© 2020.
PY - 2020/11/1
Y1 - 2020/11/1
N2 - New Nd-Fe-B crystal structures can be formed via the elemental substitution of LA-T-X host structures, including lanthanides (LA), transition metals (T) and light elements, X = B, C, N and O. The 5967 samples of ternary LA-T-X materials that are collected are then used as the host structures. For each host crystal structure, a substituted crystal structure is created by substituting all lanthanide sites with Nd, all transition metal sites with Fe and all light-element sites with B. High-throughput first-principles calculations are applied to evaluate the phase stability of the newly created crystal structures, and 20 of them are found to be potentially formable. A data-driven approach based on supervised and unsupervised learning techniques is applied to estimate the stability and analyze the structure-stability relationship of the newly created Nd-Fe-B crystal structures. For predicting the stability for the newly created Nd-Fe-B structures, three supervised learning models: kernel ridge regression, logistic classification and decision tree model, are learned from the LA-T-X host crystal structures; the models achieved maximum accuracy and recall scores of 70.4 and 68.7%, respectively. On the other hand, our proposed unsupervised learning model based on the integration of descriptor-relevance analysis and a Gaussian mixture model achieved an accuracy and recall score of 72.9 and 82.1%, respectively, which are significantly better than those of the supervised models. While capturing and interpreting the structure-stability relationship of the Nd-Fe-B crystal structures, the unsupervised learning model indicates that the average atomic coordination number and coordination number of the Fe sites are the most important factors in determining the phase stability of the new substituted Nd-Fe-B crystal structures.
AB - New Nd-Fe-B crystal structures can be formed via the elemental substitution of LA-T-X host structures, including lanthanides (LA), transition metals (T) and light elements, X = B, C, N and O. The 5967 samples of ternary LA-T-X materials that are collected are then used as the host structures. For each host crystal structure, a substituted crystal structure is created by substituting all lanthanide sites with Nd, all transition metal sites with Fe and all light-element sites with B. High-throughput first-principles calculations are applied to evaluate the phase stability of the newly created crystal structures, and 20 of them are found to be potentially formable. A data-driven approach based on supervised and unsupervised learning techniques is applied to estimate the stability and analyze the structure-stability relationship of the newly created Nd-Fe-B crystal structures. For predicting the stability for the newly created Nd-Fe-B structures, three supervised learning models: kernel ridge regression, logistic classification and decision tree model, are learned from the LA-T-X host crystal structures; the models achieved maximum accuracy and recall scores of 70.4 and 68.7%, respectively. On the other hand, our proposed unsupervised learning model based on the integration of descriptor-relevance analysis and a Gaussian mixture model achieved an accuracy and recall score of 72.9 and 82.1%, respectively, which are significantly better than those of the supervised models. While capturing and interpreting the structure-stability relationship of the Nd-Fe-B crystal structures, the unsupervised learning model indicates that the average atomic coordination number and coordination number of the Fe sites are the most important factors in determining the phase stability of the new substituted Nd-Fe-B crystal structures.
KW - data mining
KW - first-principles calculations
KW - machine learning
KW - materials informatics
KW - new magnets
UR - http://www.scopus.com/inward/record.url?scp=85095841236&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095841236&partnerID=8YFLogxK
U2 - 10.1107/S2052252520010088
DO - 10.1107/S2052252520010088
M3 - Article
AN - SCOPUS:85095841236
SN - 2052-2525
VL - 7
SP - 1036
EP - 1047
JO - IUCrJ
JF - IUCrJ
ER -