共 50 条
- [1] User interfaces for speech-based retrieval of lecture recordings [J]. ED-MEDIA 2004: World Conference on Educational Multimedia, Hypermedia & Telecommunications, Vols. 1-7, 2004, : 4470 - 4477
- [2] Region-Based Annotation of Digital Photographs [J]. COMPUTATIONAL COLOR IMAGING, 2011, 6626 : 47 - 59
- [3] Temporal Confusion Network for Speech-based Soccer Event Retrieval [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2013, : 549 - 553
- [4] Using catalogue browsing for speech-based interface to a digital library [J]. PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION, 2007, : 130 - +
- [6] Multimodal video search techniques: Late fusion of speech-based retrieval and visual content-based retrieval [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 1048 - 1051
- [7] Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval [J]. INTERSPEECH 2021, 2021, : 2976 - 2980
- [9] Speech-based Class Attendance [J]. 6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260
- [10] Speech-Based Meaning of Music [J]. PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 385 - 397