共 50 条
- [31] Multimodal understanding for person recognition in video broadcasts 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 607 - 611
- [33] Prosody modeling for automatic speech recognition and understanding MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 105 - 114
- [35] Video Analysis for Human Behavior Understanding EURASIP Journal on Advances in Signal Processing, 2010
- [36] OmniViD: A Generative Framework for Universal Video Understanding 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18209 - 18220
- [37] Social behavior recognition in continuous video 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1322 - 1329
- [38] Crowd Behavior Recognition for Video Surveillance ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2008, 5259 : 970 - +
- [39] A framework for improved video text detection and recognition Multimedia Tools and Applications, 2014, 69 : 217 - 245
- [40] Video Analytics Framework for Human Action Recognition CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3841 - 3859