Text Matching with Monte Carlo Tree Search

被引:1
|
作者
He, Yixuan [1 ]
Tao, Shuchang [1 ]
Xu, Jun [1 ]
Guo, Jiafeng [1 ]
Lan, YanYan [1 ]
Cheng, Xueqi [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, CAS Key Lab Network Data Sci & Technol, Beijing, Peoples R China
来源
关键词
Monte Carlo tree search; Markov decision process; Text matching; GAME; GO;
D O I
10.1007/978-3-030-01012-6_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we address a novel reinforcement learning based model for text matching, referred to as MM-Match. Inspired by the success and methodology of the AlphaGo Zero, MM-Match formalizes the problem of text matching with a Monte Carlo tree search (MCTS) enhanced Markov decision process (MDP) model, where the time steps corresponded to the positions in a match matrix from top left corner to down right corner, and each action corresponds to a movement in a direction. Two long short-term memory networks (LSTM) are used to summarize the words in current path and one more LSTM is used to summarize future words. Based on the outputs of LSTMs, the policy guides the move direction and the value predicts the correctness of the whole sentence are produced. The policy and value are then strengthened with MCTS, which takes the produced raw policy and value as inputs, simulates and evaluates the possible direction assignments at the subsequent positions, and outputs a better search policy for assigning directions. A reinforcement learning algorithm is proposed to train the model parameters. Our work innovatively applies an MDP model to the text matching task. MM-Match can accurately predict the directions thanks to the exploratory decision making mechanism introduced by MCTS. Experimental results showed that MM-Match performs similar to the classical text matching models including MatchPyramid and MatchSRNN.
引用
收藏
页码:41 / 52
页数:12
相关论文
共 50 条
  • [1] Geometric Graph Matching Using Monte Carlo Tree Search
    Pinheiro, Miguel Amavel
    Kybic, Jan
    Fua, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2171 - 2185
  • [2] GEOMETRICAL GRAPH MATCHING USING MONTE CARLO TREE SEARCH
    Pinheiro, Miguel Amavel
    Kybic, Jan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3145 - 3149
  • [3] Extracting Knowledge from Web Text with Monte Carlo Tree Search
    Liu, Guiliang
    Li, Xu
    Wang, Jiakang
    Sun, Mingming
    Li, Ping
    [J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2585 - 2591
  • [4] Multiagent Monte Carlo Tree Search
    Zerbel, Nicholas
    Yliniemi, Logan
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2309 - 2311
  • [5] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144
  • [6] Elastic Monte Carlo Tree Search
    Xu, Linjie
    Dockhorn, Alexander
    Perez-Liebana, Diego
    [J]. IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 527 - 537
  • [7] Monte Carlo Tree Search in Hex
    Arneson, Broderick
    Hayward, Ryan B.
    Henderson, Philip
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) : 251 - 258
  • [8] Monte Carlo tree search in Kriegspiel
    Ciancarini, Paolo
    Favini, Gian Piero
    [J]. ARTIFICIAL INTELLIGENCE, 2010, 174 (11) : 670 - 684
  • [9] Monte Carlo Tree Search for Quoridor
    Respall, Victor Massague
    Brown, Joseph Alexander
    Aslam, Hamna
    [J]. 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, : 5 - 9
  • [10] An Analysis of Monte Carlo Tree Search
    James, Steven
    Konidaris, George
    Rosman, Benjamin
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3576 - 3582