Probabilistic Document Model for Automated Document Composition

被引:0
|
作者
Damera-Venkata, Niranjan [1 ]
Bento, Jose
O'Brien-Strain, Eamonn [1 ]
机构
[1] Hewlett Packard Labs, Palo Alto, CA 94304 USA
关键词
automated publishing; layout synthesis; variable templates;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a new paradigm for automated document composition based on a generative, unified probabilistic document model (PDM) that models document composition. The model formally incorporates key design variables such as content pagination, relative arrangement possibilities for page elements and possible page edits. These design choices are modeled jointly as coupled random variables (a Bayesian Network) with uncertainty modeled by their probability distributions. The overall joint probability distribution for the network assigns higher probability to good design choices. Given this model, we show that the general document layout problem can be reduced to probabilistic inference over the Bayesian network. We show that the inference task may be accomplished efficiently, scaling linearly with the content in the best case. We provide a useful specialization of the general model and use it to illustrate the advantages of soft probabilistic encodings over hard one-way constraints in specifying design aesthetics.
引用
收藏
页码:3 / 12
页数:10
相关论文
共 50 条
  • [1] Probabilistic document correlation model
    Jia, Xiping
    Peng, Hong
    [J]. CIS WORKSHOPS 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY WORKSHOPS, 2007, : 433 - 436
  • [2] DOCUMENT RETRIEVAL USING A PROBABILISTIC KNOWLEDGE MODEL
    Wang, Shuguang
    Visweswaran, Shyam
    Hauskrecht, Milos
    [J]. KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 26 - +
  • [3] A Probabilistic model for compact document topic representation
    Berenyi, Zsolt
    Vajk, Istvan
    [J]. PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON SIMULATION, MODELLING AND OPTIMIZATION, 2009, : 322 - +
  • [4] Analysis of Probabilistic model for Document Retrieval in Information Retrieval
    Tamrakar, Astha
    Vishwakarma, Santosh K.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 760 - 765
  • [5] AUTOMATED DOCUMENT SEGMENTATION
    ZLATOPOLSKY, AA
    [J]. PATTERN RECOGNITION LETTERS, 1994, 15 (07) : 699 - 704
  • [6] Document aging effects and automated security document authentication
    Stolc, Svorad
    Daubner, Franz
    Huber-Moerk, Reinhold
    [J]. 2015 9TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON), 2015, : 347 - 352
  • [7] The missing link - A probabilistic model of document content and hypertext connectivity
    Cohn, D
    Hofmann, T
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 430 - 436
  • [8] Color document image segmentation for automated document entry systems
    Suen, HM
    Wang, JF
    [J]. 1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 131 - 136
  • [9] Automated document content characterization for a multimedia document retrieval system
    Koivusaari, M
    Sauvola, J
    Pietikainen, M
    [J]. MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS II, 1997, 3229 : 148 - 159
  • [10] Automated Text Document Categorization
    Yasotha, R.
    Charles, E. Y. A.
    [J]. 2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), 2015, : 522 - 528