Role of multifidelity data in sequential active learning materials discovery campaigns: case study of electronic bandgap

被引:3
|
作者
Jacobs, Ryan [1 ]
Goins, Philip E. [2 ]
Morgan, Dane [1 ]
机构
[1] Univ Wisconsin, Dept Mat Sci & Engn, Madison, WI 53706 USA
[2] US Army, CCDC, Res Lab, 6300 Rodman Rd, Aberdeen, MD 21005 USA
来源
关键词
machine learning; multifidelity data; active learning; materials discovery;
D O I
10.1088/2632-2153/ad1627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Materials discovery and design typically proceeds through iterative evaluation (both experimental and computational) to obtain data, generally targeting improvement of one or more properties under one or more constraints (e.g. time or budget). However, there can be great variation in the quality and cost of different data, and when they are mixed together in what we here call multifidelity data, the optimal approaches to their utilization are not established. It is therefore important to develop strategies to acquire and use multifidelity data to realize the most efficient iterative materials exploration. In this work, we assess the impact of using multifidelity data through mock demonstration of designing solar cell materials, using the electronic bandgap as the target property. We propose a new approach of using multifidelity data through leveraging machine learning models of both low- and high-fidelity data, where using predicted low-fidelity data as an input feature in the high-fidelity model can improve the impact of a multifidelity data approach. We show how tradeoffs of low- versus high-fidelity measurement cost and acquisition can impact the materials discovery process. We find that the use of multifidelity data has maximal impact on the materials discovery campaign when approximately five low-fidelity measurements per high-fidelity measurement are performed, and when the cost of low-fidelity measurements is approximately 5% or less than that of high-fidelity measurements. This work provides practical guidance and useful qualitative measures for improving materials discovery campaigns that involve multifidelity data.
引用
收藏
页数:13
相关论文
共 24 条
  • [1] Accelerating active learning materials discovery with FAIR data and workflows: A case study for alloy melting temperatures
    Harwani, Mohnish
    Verduzco, Juan C.
    Lee, Brian H.
    Strachan, Alejandro
    COMPUTATIONAL MATERIALS SCIENCE, 2025, 249
  • [2] Active Learning with Realistic Data - A Case Study
    Calma, Adrian
    Stolz, Moritz
    Kottke, Daniel
    Tomforde, Sven
    Sick, Bernhard
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [3] Revolutionizing Energetic Materials Discovery and Design: The Role of Data Science and Machine Learning
    Gee, Richard
    Lindsey, Rebecca
    PROPELLANTS EXPLOSIVES PYROTECHNICS, 2023, 48 (04)
  • [4] A study of the role of data and model uncertainty in active learning
    Li, Yahao
    Jiang, Errui
    Ni, Ziqi
    Li, Wudi
    Huang, Ming
    Zhao, Fengyuan
    Liu, Fengqi
    Ye, Yicong
    Bai, Shuxin
    COMPUTATIONAL MATERIALS SCIENCE, 2025, 247
  • [5] A DESCRIPTIVE STUDY OF STUDENTS' ACTIVE ROLE THROUGH LESSON STUDY-BASED DISCOVERY LEARNING
    Gunawan
    Setyaningsih, Eka
    PROCEEDINGS OF THE 4TH ASIA PACIFIC EDUCATION CONFERENCE (AECON 2017), 2017, 109 : 103 - 107
  • [6] Data mining and knowledge discovery in materials science and engineering: A polymer nanocomposites case study
    AbuOmar, O.
    Nouranian, S.
    King, R.
    Bouvard, J. L.
    Toghiani, H.
    Lacy, T. E.
    Pittman, C. U., Jr.
    ADVANCED ENGINEERING INFORMATICS, 2013, 27 (04) : 615 - 624
  • [7] Learning Active Implementation Frameworks: the role of implementation teams in a case study from Pakistan
    Hamid, Saima
    Mureed, Sheh
    Kayani, Aasia
    Javed, Kiran
    Khan, Adnan
    Awais, Sayema
    Khan, Neelam
    Tus-Salam, Fakiha
    Fixsen, Dean L.
    GLOBAL HEALTH ACTION, 2020, 13 (01)
  • [8] A computational learning paradigm to targeted discovery of biocatalysts from metagenomic data: A case study of lipase identification
    Shahraki, Mehdi F.
    Atanaki, Fereshteh F.
    Ariaeenejad, Shohreh
    Ghaffari, Mohammad R.
    Norouzi-Beirami, Mohammad H.
    Maleki, Morteza
    Salekdeh, Ghasem H.
    Kavousi, Kaveh
    BIOTECHNOLOGY AND BIOENGINEERING, 2022, 119 (04) : 1115 - 1128
  • [9] Bridging the Gap Between Theory and Active Learning: A Case Study of Project-Based Learning in Introduction to Materials Science and Engineering
    Lopera, Henry A. Colorado
    Gutierrez-Velasquez, Elkin
    Ballesteros, Nancy
    Revista Iberoamericana de Tecnologias del Aprendizaje, 2022, 17 (02): : 160 - 169
  • [10] Bridging the Gap Between Theory and Active Learning: A Case Study of Project-Based Learning in Introduction to Materials Science and Engineering
    Colorado Lopera, Henry A.
    Gutierrez-Velasquez, Elkin
    Ballesteros, Nancy
    IEEE REVISTA IBEROAMERICANA DE TECNOLOGIAS DEL APRENDIZAJE-IEEE RITA, 2022, 17 (02): : 160 - 169