Multi-dimensional interval algebra with symmetry for describing block layouts

被引:0
|
作者
Lahoti, A [1 ]
Singh, R [1 ]
Mukerjee, A [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci, Kanpur 208016, Uttar Pradesh, India
来源
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Describing the relative positions of Rectangular boxes on a page is a fundamental task in document layout processing. Typically, this is achieved by comparing quantitative values of the endpoints of the rectangle. Such a representation expresses a property that is basic for the "interval" as a conjunction of relations for the "point". In this work, we adopt a qualitative interval projection model to describe the relative positions of such blocks using interval algebra, which defines the spatial relation of two points only in terms of precedence, coincidence and post-occurrence. Such relations have not been found very meaningful in document or other media layout contexts since they cannot capture symmetry. In this work, we propose an extension of interval algebra by defining secondary operators (e.g. "centered") which are expressed in terms of basic interval algebra operators. By extending the ordering of intervals to higher dimensions, Multidimensional Interval Algebra can capture the notion of tangency and alignment between blocks while retaining the relative size information. We present several examples from the document domain to show that this information is sufficient to identify the layout of block structured formats. While this representation does not provide any immediate benefit to document analysis per se - the fact that it provides a compact yet complete vocabulary enables its use in abstraction tasks such as learning the grammar of a document sets by studying a series of examples.
引用
收藏
页码:143 / 154
页数:12
相关论文
共 50 条