TraViQuA: Natural Language Driven Traffic Video Querying Using Deep Learning

被引:0
|
作者
Yuksel, Asim Sinan [1 ]
Karabiyik, Muhammed Abdulhamid [2 ]
机构
[1] Suleyman Demirel Univ, Dept Comp Engn, TR-32100 Isparta, Turkiye
[2] Nigde Omer Halisdemir Univ, Bor Vocat Sch, TR-51100 Nigde, Turkiye
关键词
natural language processing (NLP) you; only look once (YOLO) long short-term; memory (LSTM) video query deep learning;
D O I
10.18280/ts.400213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video cameras are widely utilized and have ingrained themselves into many aspects of our daily life. Analysis of video contents is more challenging as the size of the data collected from the cameras increases. The fundamental cause of this challenge is because certain data, like the videos, cannot be queried. Our research focuses on converting traffic videos into a structure that can be queried. Specifically, an application called TraViQuA was suggested f or natural language-based car search and localization in traffic videos. To query and identify cars, data including color, brand, and appearance time are used as features. The query is initiated in real time on live traffic feed, as the user enters the search term on the application interface. Our text to SQL conversion algorithm enables the mapping of a search term into a SQL query. Based on the response to the natural language query, TraViQuA can start the video from the relevant time. Deep neural networks were employed in our application for text to SQL conversion and feature extraction. Our research reveals that color and brand models had mean average precision of 98.714% and 91.742%, respectively. The text to SQL conversion had an 80% accuracy rate. To the best of our knowledge, TraViQuA is the first application that enables police officers to input a natural language description of a car and discover the car of interest that matches the description, bridging the gap in traffic video surveillance. Moreover, TraViQuA can be incorporated into other intelligent transportation systems to support law enforcement officials in urgent situations like hit-and-run incidents and amber alerts.
引用
收藏
页码:543 / 553
页数:11
相关论文
共 50 条
  • [1] Querying NoSQL with Deep Learning to Answer Natural Language Questions
    Blank, Sebastian
    Wilhelm, Florian
    Zorn, Hans-Peter
    Rettinger, Achim
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9416 - 9421
  • [2] Natural language querying for video databases
    Erozel, Guzen
    Cicekli, Nihan Kesim
    Cicekli, Ilyas
    INFORMATION SCIENCES, 2008, 178 (12) : 2534 - 2552
  • [3] NADAQ: Natural Language Database Querying Based on Deep Learning
    Xu, Boyan
    Cai, Ruichu
    Zhang, Zhenjie
    Yang, Xiaoyan
    Hao, Zhifeng
    Li, Zijian
    Liang, Zhihao
    IEEE ACCESS, 2019, 7 : 35012 - 35017
  • [4] A deep learning model for natural language querying in Cyber-Physical Systems
    Llopis, Juan Alberto
    Fernandez-Garcia, Antonio Jesus
    Criado, Javier
    Iribarne, Luis
    Ayala, Rosa
    Wang, James Z.
    INTERNET OF THINGS, 2023, 24
  • [5] Querying an Ontology Using Natural Language
    Salgueiro, Ana Marisa
    Alves, Catarina Bile
    Balsa, Joao
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 164 - 169
  • [6] A natural language-based interface for querying a video database
    Kuecuektunc, Onur
    Gueduekbay, Ugur
    Ulusoy, Oezgur
    IEEE MULTIMEDIA, 2007, 14 (01) : 83 - 89
  • [7] Querying Database using a universal Natural Language Interface Based on Machine Learning
    Bais, Hanane
    Machkour, Mustapha
    Koutti, Lahcen
    2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD), 2016,
  • [8] VOICEQUERYSYSTEM: A Voice-driven Database Querying System Using Natural Language Questions
    Song, Yuanfeng
    Wong, Raymond Chi-Wing
    Zhao, Xuefang
    Jiang, Di
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2385 - 2388
  • [9] Natural language driven video sequencer
    Terebijon Gakkaishi, 10 (1585):
  • [10] Encrypted malicious traffic detection based on natural language processing and deep learning
    Zang X.
    Wang T.
    Zhang X.
    Gong J.
    Gao P.
    Zhang G.
    Computer Networks, 2024, 250