共 20 条
- [1] VRDU: A Benchmark for Visually-rich Document Understanding [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5184 - 5193
- [2] XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4573 - 4582
- [3] LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 2579 - 2591
- [4] LayerDoc: Layer-wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3599 - 3609
- [6] MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6078 - 6087