中国物理B ›› 2023, Vol. 32 ›› Issue (11): 118104-118104.doi: 10.1088/1674-1056/ad04cb

所属专题: Featured Column — COMPUTATIONAL PROGRAMS FOR PHYSICS

• • 上一篇    下一篇

MatChat: A large language model and application service platform for materials science

Zi-Yi Chen(陈子逸)1,2,†, Fan-Kai Xie(谢帆恺)3,4,†, Meng Wan(万萌)1,†, Yang Yuan(袁扬)1,2, Miao Liu(刘淼)3,5,6,‡, Zong-Guo Wang(王宗国)1,2,§, Sheng Meng(孟胜)3,5, and Yan-Gang Wang(王彦棡)1,2   

  1. 1 Computer Network Information Center, Chinese Academy of Sciences, Beijing 100083, China;
    2 University of Chinese Academy of Sciences, Beijing 100049, China;
    3 Institute of Physics, Chinese Academy of Sciences, Beijing 100190, China;
    4 School of Physical Sciences, University of Chinese Academy of Sciences, Beijing 100190, China;
    5 Songshan Lake Materials Laboratory, Dongguan 523808, China;
    6 Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
  • 收稿日期:2023-10-11 修回日期:2023-10-18 接受日期:2023-10-19 出版日期:2023-10-16 发布日期:2023-11-03
  • 通讯作者: Miao Liu, Zong-Guo Wang E-mail:mliu@iphy.ac.cn;wangzg@cnic.cn
  • 基金资助:
    This work was supported by the Informatization Plan of the Chinese Academy of Sciences (Grant No. CASWX2023SF-0101), the Key Research Program of Frontier Sciences, CAS (Grant No. ZDBS-LY-7025), the Youth Innovation Promotion Association CAS (Grant No. 2021167), and the Strategic Priority Research Program of Chinese Academy of Sciences (Grant No. XDB33020000).

MatChat: A large language model and application service platform for materials science

Zi-Yi Chen(陈子逸)1,2,†, Fan-Kai Xie(谢帆恺)3,4,†, Meng Wan(万萌)1,†, Yang Yuan(袁扬)1,2, Miao Liu(刘淼)3,5,6,‡, Zong-Guo Wang(王宗国)1,2,§, Sheng Meng(孟胜)3,5, and Yan-Gang Wang(王彦棡)1,2   

  1. 1 Computer Network Information Center, Chinese Academy of Sciences, Beijing 100083, China;
    2 University of Chinese Academy of Sciences, Beijing 100049, China;
    3 Institute of Physics, Chinese Academy of Sciences, Beijing 100190, China;
    4 School of Physical Sciences, University of Chinese Academy of Sciences, Beijing 100190, China;
    5 Songshan Lake Materials Laboratory, Dongguan 523808, China;
    6 Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2023-10-11 Revised:2023-10-18 Accepted:2023-10-19 Online:2023-10-16 Published:2023-11-03
  • Contact: Miao Liu, Zong-Guo Wang E-mail:mliu@iphy.ac.cn;wangzg@cnic.cn
  • Supported by:
    This work was supported by the Informatization Plan of the Chinese Academy of Sciences (Grant No. CASWX2023SF-0101), the Key Research Program of Frontier Sciences, CAS (Grant No. ZDBS-LY-7025), the Youth Innovation Promotion Association CAS (Grant No. 2021167), and the Strategic Priority Research Program of Chinese Academy of Sciences (Grant No. XDB33020000).

摘要: The prediction of chemical synthesis pathways plays a pivotal role in materials science research. Challenges, such as the complexity of synthesis pathways and the lack of comprehensive datasets, currently hinder our ability to predict these chemical processes accurately. However, recent advancements in generative artificial intelligence (GAI), including automated text generation and question-answering systems, coupled with fine-tuning techniques, have facilitated the deployment of large-scale AI models tailored to specific domains. In this study, we harness the power of the LLaMA2-7B model and enhance it through a learning process that incorporates 13878 pieces of structured material knowledge data. This specialized AI model, named MatChat, focuses on predicting inorganic material synthesis pathways. MatChat exhibits remarkable proficiency in generating and reasoning with knowledge in materials science. Although MatChat requires further refinement to meet the diverse material design needs, this research undeniably highlights its impressive reasoning capabilities and innovative potential in materials science. MatChat is now accessible online and open for use, with both the model and its application framework available as open source. This study establishes a robust foundation for collaborative innovation in the integration of generative AI in materials science.

关键词: MatChat, materials science, generative artificial intelligence

Abstract: The prediction of chemical synthesis pathways plays a pivotal role in materials science research. Challenges, such as the complexity of synthesis pathways and the lack of comprehensive datasets, currently hinder our ability to predict these chemical processes accurately. However, recent advancements in generative artificial intelligence (GAI), including automated text generation and question-answering systems, coupled with fine-tuning techniques, have facilitated the deployment of large-scale AI models tailored to specific domains. In this study, we harness the power of the LLaMA2-7B model and enhance it through a learning process that incorporates 13878 pieces of structured material knowledge data. This specialized AI model, named MatChat, focuses on predicting inorganic material synthesis pathways. MatChat exhibits remarkable proficiency in generating and reasoning with knowledge in materials science. Although MatChat requires further refinement to meet the diverse material design needs, this research undeniably highlights its impressive reasoning capabilities and innovative potential in materials science. MatChat is now accessible online and open for use, with both the model and its application framework available as open source. This study establishes a robust foundation for collaborative innovation in the integration of generative AI in materials science.

Key words: MatChat, materials science, generative artificial intelligence

中图分类号:  (New materials: theory, design, and fabrication)

  • 81.05.Zx
01.50.hv (Computer software and software reviews) 81.16.Be (Chemical synthesis methods)