Leveraging low-fidelity data to improve machine learning of sparse high-fidelity thermal conductivity data via transfer learning

被引:6
|
作者
Liu, Z. [1 ,2 ]
Jiang, M. [3 ]
Luo, T. [1 ,4 ]
机构
[1] Univ Notre Dame, Dept Aerosp & Mech Engn, Notre Dame, IN 46556 USA
[2] Hunan Univ, Sch Phys & Elect, Dept Appl Phys, Changsha 410082, Peoples R China
[3] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[4] Univ Notre Dame, Dept Chem & Biomol Engn, Notre Dame, IN 46556 USA
关键词
Thermal conductivity; Machine learning; Transfer learning; First-principles simulation; TRANSPORT; CRYSTALS; PHONONS; MODEL;
D O I
10.1016/j.mtphys.2022.100868
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Lattice thermal conductivity (TC) of semiconductors is crucial for various applications, ranging from micro-electronics to thermoelectrics. Data-driven approach can potentially establish the critical composition-property relationship needed for fast screening of candidates with desirable TC, but the small number of available data remains the main challenge. TC can be efficiently calculated using empirical models, but they have inferior accuracy compared to the more resource-demanding first-principles calculations. Here, we demonstrate the use of transfer learning (TL) to improve the machine learning models trained on small but high-fidelity TC data from experiments and first-principles calculations, by leveraging a large but low-fidelity data generated from empirical TC models, where the trainings on high-and low-fidelity TC data are treated as different but related tasks. TL improves the model accuracy by as much as 23% in R2 and reduces the average factor difference by as much as 30%. Using the TL model, a large semiconductor database is screened, and several candidates with room tem-perature TC > 350 W/mK are identified and further verified using first-principles simulations. This study demonstrates that TL can leverage big low-fidelity data as a proxy task to improve models for the target task with high-fidelity but small data. Such a capability of TL may have important implications to materials informatics in general.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Low-Fidelity Gradient Updates for High-Fidelity Reprogrammable Iterative Learning Control
    Tseng, Kuan-Yu
    Shamma, Jeff S.
    Dullerud, Geir E.
    [J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 4772 - 4777
  • [2] Machine Learning and Modeling of Ultrasonic Signals for High-Fidelity Data Compression
    Saniie, Jafar
    Govindan, Pramod
    Wang, Boyang
    Zhang, Xin
    Lu, Yufeng
    Oruklu, Erdal
    [J]. 2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
  • [3] High-fidelity data supported ship trajectory prediction via an ensemble machine learning framework
    Zhao, Jiansen
    Lu, Jinquan
    Chen, Xinqiang
    Yan, Zhongwei
    Yan, Ying
    Sun, Yang
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 586
  • [4] Comparison of low-fidelity TRANSIMS and high-fidelity CORSIM highway simulation models with intelligent transportation system data
    Rilett, L.R.
    Kim, K.-O.
    Raney, B.
    [J]. Transportation Research Record, 2000, (1739) : 1 - 8
  • [5] Comparison of low-fidelity TRANSIMS and high-fidelity CORSIM highway simulation models with intelligent transportation system data
    Rilett, LR
    Kim, KO
    Raney, B
    [J]. EVALUATING INTELLIGENT TRANSPORTATION SYSTEMS, ADVANCED TRAVELER INFORMATION SYSTEMS, AND OTHER ARTIFICIAL INTELLIGENCE APPLICATIONS: PLANNING AND ADMINISTRATION, 2000, (1739): : 1 - 8
  • [6] Generating high-fidelity synthetic patient data for assessing machine learning healthcare software
    Allan Tucker
    Zhenchen Wang
    Ylenia Rotalinti
    Puja Myles
    [J]. npj Digital Medicine, 3
  • [7] Generating high-fidelity synthetic patient data for assessing machine learning healthcare software
    Tucker, Allan
    Wang, Zhenchen
    Rotalinti, Ylenia
    Myles, Puja
    [J]. NPJ DIGITAL MEDICINE, 2020, 3 (01)
  • [8] Generating high-fidelity synthetic patient data for assessing machine learning healthcare software
    Tucker A.
    Wang Z.
    Rotalinti Y.
    Myles P.
    [J]. Tucker, Allan (allan.tucker@brunel.ac.uk), 1600, Nature Research (03):
  • [9] Architectures and Patterns for Leveraging High-Frequency, Low-Fidelity Data in the Healthcare Domain
    Zhang, Peng
    White, Jules
    Schmidt, Douglas
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 463 - 464
  • [10] Data-driven learning of nonlocal physics from high-fidelity synthetic data
    You, Huaiqian
    Yu, Yue
    Trask, Nathaniel
    Gulian, Mamikon
    D'Elia, Marta
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 374