Data Column Prediction: Experiment in Automated Column Tagging Using Machine Learning

被引:0
|
作者
McCabe, S. [1 ]
Cropp, B. [1 ]
Coles, J. [1 ]
Del Vecchio, J. [1 ]
Ekstrum, J. [1 ]
机构
[1] CUBRC Inc, 4455 Genesee St, Buffalo, NY 14226 USA
关键词
machine learning; column prediction; ontology; intelligence analysis;
D O I
10.1117/12.2519305
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The lack of tools to rapidly identify and align data from different sources is a critical, needed capability for the Department of Defense especially when it comes to automated ingestion. In the current open source Karma Mapping Tool, the Steiner tree optimization algorithm suggests semantic types during data alignment. We hypothesize that Machine Learning (ML) may perform better than the Steiner approach on a subset of column types, or "labels", where 1.) the data is extremely similar in structure and content and 2.) inferring column type correctly is highly dependent on the interrelated components of the dataset. In this session we discuss the experimental design, our initial results, and a path toward future work in broader applications beginning with intelligence analysis in the maritime domain. The initial results from this experiment show there is promise in using ML to do column prediction in analysis environments where there are many similar or overlapping data.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] COLUMN CHROMATOGRAPHY EXPERIMENT USING UNKNOWNS
    MARMOR, S
    JOURNAL OF CHEMICAL EDUCATION, 1965, 42 (05) : 272 - &
  • [12] Machine Learning Applications for Jet Tagging in the CMS Experiment
    Cagnotta, Antimo
    Carnevali, Francesco
    De Iorio, Agostino
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [13] Prediction of Software Defects Using Automated Machine Learning
    Tanaka, Kazuya
    Monden, Akito
    Yucel, Zeynep
    2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2019, : 490 - 494
  • [14] HEART FAILURE RISK PREDICTION USING AZURE DATA LAKE ARCHITECTURE WITH AUTOMATED MACHINE LEARNING AND MACHINE LEARNING APPROACHES
    Alghamdi, Ahmed M.
    Al Shehri, Waleed
    Almalki, Jameel
    Jannah, Najlaa
    Bahaddad, Adel
    Bokhary, Abdullah M.
    THERMAL SCIENCE, 2024, 28 (6B): : 5059 - 5069
  • [15] Data-based modeling of an industrial flotation column using classic and intelligent machine learning algorithms
    Zarie, Majid
    Jahedsaravani, Ali
    Massinaei, Mohammad
    CANADIAN METALLURGICAL QUARTERLY, 2025,
  • [16] Automated Flare Prediction Using Extreme Learning Machine
    Bian, Yuqing
    Yang, Jianwei
    Li, Ming
    Lan, Rushi
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [17] Earthquake Prediction Model Based on Geomagnetic Field Data Using Automated Machine Learning
    Yusof, Khairul Adib
    Mashohor, Syamsiah
    Abdullah, Mardina
    Abd Rahman, Mohd Amiruddin
    Hamid, Nurul Shazana Abdul
    Qaedi, Kasyful
    Matori, Khamirul Amin
    Hayakawa, Masashi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [18] ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation
    Kara, Kaan
    Eguro, Ken
    Zhang, Ce
    Alonso, Gustavo
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 12 (04): : 348 - 361
  • [19] Analysis and Data Regression of Water Hammer with Column Separation Based on Machine Learning
    Fu, You
    Zhang, Shanshan
    JOURNAL OF PIPELINE SYSTEMS ENGINEERING AND PRACTICE, 2025, 16 (02)
  • [20] Joint shear strength prediction of beam-column connections using machine learning via experimental results
    Marie, Hanaa Salem
    Abu el-hassan, Khaled
    Almetwally, Ehab M.
    A. El-Mandouh, Mahmoud
    Case Studies in Construction Materials, 2022, 17