Index structures;
efficient query processing;
genomic data management;
VARIABLE-LENGTH QUERIES;
D O I:
10.1109/TKDE.2018.2871031
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
One-dimensional intervals incremental inverted index (Di4) is amulti-resolution, single-dimension indexing framework for efficient, scalable, and extensible computation of genomic interval expressions. The framework has a tri-layer architecture: the semantic layer provides orthogonal and genericmeans (including the support of user-defined function) of sense-making and higher-lever reasoning fromregion-based datasets; the logical layer provides building blocks for region calculus and topological relations between intervals; the physical layer abstracts from persistence technology and makes the model adaptable to variety of persistence technologies, spanning from small-scale (e.g., B+tree) to large-scale (e.g., LevelDB). The extensibility of Di4 to application scenarios is shown with an example of comparative evaluation of ChIP-seq and DNase-Seq replicates. Performance of Di4 is benchmarked for small and large scale scenarios under common bioinformatics application scenarios. Di4 is freely available from https://genometric.github.io/Di4.
机构:
Univ Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USAUniv Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USA
Nix, David A.
Di Sera, Tonya L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USAUniv Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USA
Di Sera, Tonya L.
Dalley, Brian K.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USAUniv Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USA
Dalley, Brian K.
Milash, Brett A.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USAUniv Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USA
Milash, Brett A.
Cundick, Robert M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USAUniv Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USA
Cundick, Robert M.
Quinn, Kevin S.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USAUniv Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USA
Quinn, Kevin S.
Courdy, Samir J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USAUniv Utah, Huntsman Canc Inst, Dept Oncol Sci, Salt Lake City, UT 84112 USA