A step towards sequence-to-sequence alignment

被引：0

作者：

Caspi, Y ^{[1
]}

Irani, H ^{[1
]}

机构：

[1] Weizmann Inst Sci, Dept Comp Sci & Appl Math, IL-76100 Rehovot, Israel

来源：

IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, VOL II | 2000年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents an approach for establishing correspondences in time and in space between two different video sequences of the same dynamic scene, recorded by stationary uncalibrated video cameras. The method simultaneously estimates both spatial alignment as well as temporal synchronization (temporal alignment) between the two sequences, using all available spatio-temporal information. Temporal variations between image frames (such as moving objects or changes in scene illumination) are powerful cues for alignment, which cannot be exploited by standard image-to-image alignment techniques. We show that by folding spatial and temporal cues into a single alignment framework, situations which are inherently ambiguous for traditional image-to-image alignment methods, are often uniquely resolved by sequence-to-sequence alignment. We also present a "direct" method for sequence-to-sequence alignment. The algorithm simultaneously estimates spatial and temporal alignment parameters directly from measurable sequence quantities, without requiring prior estimation of point correspondences, frame correspondences, or moving object detection. Results are shown on real image sequences taken by multiple video cameras.

引用

页码：682 / 689

页数：8

共 50 条

[31] Sequence-to-Sequence Models for Emphasis Speech Translation
Quoc Truong Do
Sakti, Sakriani
Nakamura, Satoshi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1873 - 1883
[32] Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Zhang, Jing-Xuan
Ling, Zhen-Hua
Liu, Li-Juan
Jiang, Yuan
Dai, Li-Rong
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 631 - 644
[33] Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Huang, Wen-Chin
Hayashi, Tomoki
Wu, Yi-Chiao
Kameoka, Hirokazu
Toda, Tomoki
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 745 - 755
[34] Feature-Based Sequence-to-Sequence Matching
Yaron Caspi
Denis Simakov
Michal Irani
International Journal of Computer Vision, 2006, 68 : 53 - 64
[35] On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models
Michel, Paul
Li, Xian
Neubig, Graham
Pino, Juan Miguel
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3103 - 3114
[36] Sequence-to-Sequence Contrastive Learning for Text Recognition
Aberdam, Aviad
Litman, Ron
Tsiper, Shahar
Anschel, Oron
Slossberg, Ron
Mazor, Shai
Manmatha, R.
Perona, Pietro
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15297 - 15307
[37] Effective Sequence-to-Sequence Dialogue State Tracking
Zhao, Jeffrey
Mandieh, Mahdis
Zhang, Ye
Cao, Yuan
Wu, Yonghui
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7486 - 7493
[38] A Comparison of Sequence-to-Sequence Models for Speech Recognition
Prabhavalkar, Rohit
Rao, Kanishka
Sainath, Tara N.
Li, Bo
Johnson, Leif
Jaitly, Navdeep
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 939 - 943
[39] Learning Damage Representations with Sequence-to-Sequence Models
Yang, Qun
Shen, Dejian
SENSORS, 2022, 22 (02)
[40] Foundations of Sequence-to-Sequence Modeling for Time Series
Kuznetsov, Vitaly
Mariet, Zelda
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 408 - 417

← 1 2 3 4 5 →