Semi-Structured Object Sequence Encoders

被引:0
|
作者
Murthy, Rudra, V [1 ]
Bhat, Riyaz [1 ]
Gunasekara, Chulaka [1 ]
Patel, Siva Sankalp [1 ]
Wan, Hui [1 ]
Dhamecha, Tejas Indulal [2 ]
Contractor, Danish [1 ]
Danilevsky, Marina [1 ]
机构
[1] IBM Res AI, Armonk, NY 10504 USA
[2] Microsoft India Dev Ctr, Hyderabad, India
关键词
COMPLEMENTARY LEARNING-SYSTEMS; CONNECTIONIST MODELS; MEMORY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we explore the task of modeling semi-structured object sequences; in particular, we focus our attention on the problem of developing a structure-aware input representation for such sequences. Examples of such data include user activity on websites, machine logs, and many others. This type of data is often represented as a sequence of sets of keyvalue pairs over time and can present modeling challenges due to an ever-increasing sequence length thereby affecting the quality of the representation. We propose a two-part approach, which first considers each key independently and encodes a representation of its values over time; we then self-attend over these value-aware key representations to accomplish a downstream task. This allows us to learn better representation while being able to operate on longer object sequences than existing methods. We introduce a novel shared-attentionhead architecture between the two modules and present an innovative training schedule that interleaves the training of both modules with shared weights for some attention heads.1 Our experiments on multiple prediction tasks using real-world data demonstrate that our approach outperforms a unified network with hierarchical encoding, as well as other methods including a record-centric representation and a flattened representation of the sequence.
引用
收藏
页码:9026 / 9039
页数:14
相关论文
共 50 条
  • [21] WICCAO: From semi-structured data to structured data
    Li, Z
    Ng, WK
    11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOP ON THE ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 2004, : 86 - 93
  • [22] Semantic annotation of semi-structured documents
    Ranganathan, Girish R.
    Biletskiy, Yevgen
    Kaltchenko, Alexey
    2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 877 - +
  • [23] On the information content of semi-structured databases
    Levene, Mark
    Acta Cybernetica, 1998, 13 (03): : 257 - 275
  • [24] Survey on Mining in Semi-Structured Data
    Shettar, Rajashree
    Shobha, G.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (08): : 226 - 231
  • [25] Mastering the Semi-Structured Interview and Beyond
    Shirani, Fiona
    QUALITATIVE RESEARCH, 2015, 15 (01) : 136 - 137
  • [26] Conducting semi-structured interviews with the Vietnamese
    Thi Quynh Trang Nguyen
    QUALITATIVE RESEARCH JOURNAL, 2015, 15 (01) : 35 - 46
  • [27] Multigrid Methods on Semi-Structured Grids
    Rodrigo, Carmen
    Gaspar, Francisco J.
    Lisbona, Francisco J.
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2012, 19 (04) : 499 - 538
  • [28] A Survey on the Semi-Structured Data Models
    Chakraborty, Supriya
    Chaki, Nabendu
    COMPUTER INFORMATION SYSTEMS - ANALYSIS AND TECHNOLOGIES, 2011, 245 : 257 - +
  • [29] Adaptive retrieval of semi-structured data
    Ben-Asher, Yosi
    Berkovsky, Shlomo
    Busetta, Paolo
    Eytani, Yaniv
    Jbara, Sadek
    Kuflik, Tsvi
    ADAPTIVE HYPERMEDIA AND ADAPTIVE WEB-BASED SYSTEMS, 2008, 5149 : 32 - +
  • [30] AUSMS: An environment for frequent sub-structures extraction in a semi-structured object collection
    Laur, PA
    Teisseire, M
    Poncelet, P
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, 2736 : 38 - 45