Data-centric machine learning in quantum information science

被引:5
|
作者
Lohani, Sanjaya [1 ]
Lukens, Joseph M. [2 ]
Glasser, Ryan T. [3 ]
Searles, Thomas A. [1 ]
Kirby, Brian T. [3 ,4 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA
[2] Oak Ridge Natl Lab, Quantum Informat Sci Sect, POB 2009, Oak Ridge, TN 37831 USA
[3] Tulane Univ, New Orleans, LA 70118 USA
[4] DEVCOM Army Res Lab, Adelphi, MD 20783 USA
来源
关键词
machine learning; quantum noise and quantum operations; quantum tomography; SEPARABILITY; STATES;
D O I
10.1088/2632-2153/ac9036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a series of data-centric heuristics for improving the performance of machine learning systems when applied to problems in quantum information science. In particular, we consider how systematic engineering of training sets can significantly enhance the accuracy of pre-trained neural networks used for quantum state reconstruction without altering the underlying architecture. We find that it is not always optimal to engineer training sets to exactly match the expected distribution of a target scenario, and instead, performance can be further improved by biasing the training set to be slightly more mixed than the target. This is due to the heterogeneity in the number of free variables required to describe states of different purity, and as a result, overall accuracy of the network improves when training sets of a fixed size focus on states with the least constrained free variables. For further clarity, we also include a 'toy model' demonstration of how spurious correlations can inadvertently enter synthetic data sets used for training, how the performance of systems trained with these correlations can degrade dramatically, and how the inclusion of even relatively few counterexamples can effectively remedy such problems.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Quantum computing for data-centric engineering and science
    Herbert, Steven
    [J]. DATA-CENTRIC ENGINEERING, 2022, 3 (04):
  • [2] Machine learning for data-centric epidemic forecasting
    Rodriguez, Alexander
    Kamarthi, Harshavardhan
    Agarwal, Pulak
    Ho, Javen
    Patel, Mira
    Sapre, Suchet
    Prakash, B. Aditya
    [J]. NATURE MACHINE INTELLIGENCE, 2024,
  • [3] A Data-Centric Optimization Framework for Machine Learning
    Rausch, Oliver
    Ben-Nun, Tal
    Dryden, Nikoli
    Ivanov, Andrei
    Li, Shigang
    Hoefler, Torsten
    [J]. PROCEEDINGS OF THE 36TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2022, 2022,
  • [4] Data-Centric Approaches to Radio Frequency Machine Learning
    Kuzdeba, Scott
    Robinson, Josh
    [J]. 2022 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2022,
  • [5] Data-Centric Machine Learning Pipeline for Hardware Verification
    Shin, Hongsup
    [J]. 2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 11 - 12
  • [6] Taxonomy of machine learning paradigms: A data-centric perspective
    Emmert-Streib, Frank
    Dehmer, Matthias
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (05)
  • [7] Data-centric science for materials innovation
    Tanaka, Isao
    Rajan, Krishna
    Wolverton, Christopher
    [J]. MRS BULLETIN, 2018, 43 (09) : 659 - 663
  • [8] Roadmap on data-centric materials science
    Bauer, Stefan
    Benner, Peter
    Bereau, Tristan
    Blum, Volker
    Boley, Mario
    Carbogno, Christian
    Catlow, C. Richard A.
    Dehm, Gerhard
    Eibl, Sebastian
    Ernstorfer, Ralph
    Fekete, Adam
    Foppa, Lucas
    Fratzl, Peter
    Freysoldt, Christoph
    Gault, Baptiste
    Ghiringhelli, Luca M.
    Giri, Sajal K.
    Gladyshev, Anton
    Goyal, Pawan
    Hattrick-Simpers, Jason
    Kabalan, Lara
    Karpov, Petr
    Khorrami, Mohammad S.
    Koch, Christoph T.
    Kokott, Sebastian
    Kosch, Thomas
    Kowalec, Igor
    Kremer, Kurt
    Leitherer, Andreas
    Li, Yue
    Liebscher, Christian H.
    Logsdail, Andrew J.
    Lu, Zhongwei
    Luong, Felix
    Marek, Andreas
    Merz, Florian
    Mianroodi, Jaber R.
    Neugebauer, Joerg
    Pei, Zongrui
    Purcell, Thomas A. R.
    Raabe, Dierk
    Rampp, Markus
    Rossi, Mariana
    Rost, Jan-Michael
    Saal, James
    Saalmann, Ulf
    Sasidhar, Kasturi Narasimha
    Saxena, Alaukik
    Sbailo, Luigi
    Scheidgen, Markus
    [J]. MODELLING AND SIMULATION IN MATERIALS SCIENCE AND ENGINEERING, 2024, 32 (06)
  • [9] Data-centric science for materials innovation
    Isao Tanaka
    Krishna Rajan
    Christopher Wolverton
    [J]. MRS Bulletin, 2018, 43 : 659 - 663
  • [10] Data-centric approach to improve machine learning models for inorganic materials
    Bartel, Christopher J.
    [J]. PATTERNS, 2021, 2 (11):