Auto-Grouping Emails For Faster E-Discovery

被引:0
|
作者
Joshi, Sachindra [1 ]
Contractor, Danish [1 ]
Ng, Kenney [2 ]
Deshpande, Prasad M. [1 ]
Hampp, Thomas [3 ]
机构
[1] IBM Res, New Delhi, India
[2] IBM Software Grp, New York, NY USA
[3] IBM Software Grp, Frankfurt, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2011年 / 4卷 / 12期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we examine the application of various grouping techniques to help improve the efficiency and reduce the costs involved in an electronic discovery process. Specifically, we create coherent groups of email documents which characterize either a syntactic theme, a semantic theme or an email thread. All such grouped documents can be reviewed together leading to a faster and more consistent review of documents. Syntactic grouping of emails is based on near duplicate detection whereas semantic grouping is based on identifying concepts in the email content using information extraction. Email thread detection is achieved using a combination of segmentation and near duplicate detection. We present experimental results on the Enron corpus that suggest that these approaches can significantly reduce the review time and show that high precision and recall in identifying the groups can be achieved. We also describe how these techniques are integrated into the IBM eDiscovery Analyzer product offering.
引用
收藏
页码:1284 / 1294
页数:11
相关论文
共 50 条
  • [41] A Query Recommending Scheme for an Efficient Evidence Search in e-Discovery
    Lee, Heon-min
    Han, Su-bin
    Lee, Taerim
    Shin, Sang Uk
    2014 16TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2014, : 1237 - 1241
  • [42] Digital Archiving and e-Discovery: Delivering Evidence in an age of Overload
    van Bussel, Geert-Jan
    Henseler, Hans
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS MANAGEMENT AND EVALUATION (ICIME 2013), 2013, : 281 - 288
  • [43] Conference on electronic discovery - Panel one: Technical aspects of document production and e-discovery
    Feldman, JE
    Socha, GJ
    Withers, KJ
    FORDHAM LAW REVIEW, 2004, 73 (01) : 23 - 31
  • [44] Replication and Automation of Expert Judgments: Information Engineering in Legal E-Discovery
    Hedin, Bruce
    Oard, Douglas W.
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 102 - +
  • [45] Integrating ILM, E-Discovery and DPA 1998 for Effective Information Processing
    Kesarwani, Anshul
    Gupta, Chandani
    Tripathi, Manas Mani
    Gupta, Vishnu
    Gupta, Rahul
    Chaurasiya, Vijay K.
    COMPUTER NETWORKS AND INTELLIGENT COMPUTING, 2011, 157 : 60 - 68
  • [46] E-discovery in New Zealand: The impact of the new High Court Rules
    Garrie, Daniel B.
    Harvey, Judge David
    CIVIL JUSTICE QUARTERLY, 2012, 31 (03): : 305 - 317
  • [47] Discovery-led refinement in e-discovery investigations: sensemaking, cognitive ergonomics and system design
    Attfield, Simon
    Blandford, Ann
    ARTIFICIAL INTELLIGENCE AND LAW, 2010, 18 (04) : 387 - 412
  • [48] E-Discovery in Investment Treaty Arbitration: Practice, Procedures, Challenges and Opportunities
    Shirlow, Esme
    JOURNAL OF INTERNATIONAL DISPUTE SETTLEMENT, 2020, 11 (04): : 549 - 588
  • [49] E-Discovery revisited: the need for artificial intelligence beyond information retrieval
    Conrad, Jack
    ARTIFICIAL INTELLIGENCE AND LAW, 2010, 18 (04) : 321 - 345
  • [50] Network-based filtering for large email collections in E-Discovery
    Henseler, Hans
    ARTIFICIAL INTELLIGENCE AND LAW, 2010, 18 (04) : 413 - 430