共 50 条
- [1] Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15546 - 15555Li, Xin论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaWu, Yunfei论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaJiang, Xinghua论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaGuo, Zhihao论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaGong, Mingming论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaCao, Haoyu论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaLiu, Yinsong论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaJiang, Deqiang论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R ChinaSun, Xing论文数: 0 引用数: 0 h-index: 0机构: Tencent YouTu Lab, Shanghai, Peoples R China Tencent YouTu Lab, Shanghai, Peoples R China
- [2] Visual-language foundation models in medicineVISUAL COMPUTER, 2025, 41 (04): : 2953 - 2972Liu, Chunyu论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai Peoples Hosp 6, Sch Med, Dept Endocrinol & Metab, Shanghai, Peoples R China Shanghai Diabet Inst, Shanghai, Peoples R China Shanghai Clin Ctr Diabet, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Artificial Intelligence Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaJin, Yixiao论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Tsinghua Med, 30 Shuangqing Rd, Beijing, Peoples R China Beijing Tsinghua Changgung Hosp, Sch Clin Med, Beijing, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaGuan, Zhouyu论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai Peoples Hosp 6, Sch Med, Dept Endocrinol & Metab, Shanghai, Peoples R China Shanghai Diabet Inst, Shanghai, Peoples R China Shanghai Clin Ctr Diabet, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaLi, Tingyao论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai Peoples Hosp 6, Sch Med, Dept Endocrinol & Metab, Shanghai, Peoples R China Shanghai Diabet Inst, Shanghai, Peoples R China Shanghai Clin Ctr Diabet, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Artificial Intelligence Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaQin, Yiming论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai Peoples Hosp 6, Sch Med, Dept Endocrinol & Metab, Shanghai, Peoples R China Shanghai Diabet Inst, Shanghai, Peoples R China Shanghai Clin Ctr Diabet, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Artificial Intelligence Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaQian, Bo论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai Peoples Hosp 6, Sch Med, Dept Endocrinol & Metab, Shanghai, Peoples R China Shanghai Diabet Inst, Shanghai, Peoples R China Shanghai Clin Ctr Diabet, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Artificial Intelligence Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaJiang, Zehua论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Tsinghua Med, 30 Shuangqing Rd, Beijing, Peoples R China Beijing Tsinghua Changgung Hosp, Sch Clin Med, Beijing, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaWu, Yilan论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Tsinghua Med, 30 Shuangqing Rd, Beijing, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaWang, Xiangning论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Shanghai Peoples Hosp 6, Sch Med, Dept Ophthalmol, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaZheng, Ying Feng论文数: 0 引用数: 0 h-index: 0机构: Zhongshan Ophthalm Ctr, Guangzhou, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R ChinaZeng, Dian论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Tsinghua Med, 30 Shuangqing Rd, Beijing, Peoples R China Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai Belt & Rd Int Joint Lab Intelligent Preve, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
- [3] VTPL: Visual and text prompt learning for visual-language modelsJOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104Sun, Bo论文数: 0 引用数: 0 h-index: 0机构: Beijing Normal Univ, Sch Artificial Intelligence, 19 Xinjiekouwai Rd, Beijing 100875, Peoples R China Beijing Normal Univ, Coll Educ Future, 18 Jinfeng Rd, Zhuhai 519087, Peoples R China Beijing Normal Univ, Sch Artificial Intelligence, 19 Xinjiekouwai Rd, Beijing 100875, Peoples R ChinaWu, Zhichao论文数: 0 引用数: 0 h-index: 0机构: Beijing Normal Univ, Sch Artificial Intelligence, 19 Xinjiekouwai Rd, Beijing 100875, Peoples R China Beijing Normal Univ, Sch Artificial Intelligence, 19 Xinjiekouwai Rd, Beijing 100875, Peoples R ChinaZhang, Hao论文数: 0 引用数: 0 h-index: 0机构: Yunnan Univ, Sch Informat Sci & Engn, 2 Cuihui Bei Rd, Kunming 650091, Peoples R China Beijing Normal Univ, Sch Artificial Intelligence, 19 Xinjiekouwai Rd, Beijing 100875, Peoples R ChinaHe, Jun论文数: 0 引用数: 0 h-index: 0机构: Beijing Normal Univ, Sch Artificial Intelligence, 19 Xinjiekouwai Rd, Beijing 100875, Peoples R China Beijing Normal Univ, Coll Educ Future, 18 Jinfeng Rd, Zhuhai 519087, Peoples R China Beijing Normal Univ, Sch Artificial Intelligence, 19 Xinjiekouwai Rd, Beijing 100875, Peoples R China
- [4] Prompting Visual-Language Models for Efficient Video UnderstandingCOMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 105 - 124Ju, Chen论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R ChinaHan, Tengda论文数: 0 引用数: 0 h-index: 0机构: Univ Oxford, Visual Geometry Grp, Oxford, England Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R ChinaZheng, Kunhao论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R ChinaZhang, Ya论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R ChinaXie, Weidi论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China Univ Oxford, Visual Geometry Grp, Oxford, England Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China
- [5] Context Compression and Extraction: Efficiency Inference of Large Language ModelsADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 221 - 232Zhou, Junyao论文数: 0 引用数: 0 h-index: 0机构: Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaDu, Ruiqing论文数: 0 引用数: 0 h-index: 0机构: Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaTan, Yushan论文数: 0 引用数: 0 h-index: 0机构: Acad Mil Sci Peoples Liberat Army, Beijing 1000000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaYang, Jintao论文数: 0 引用数: 0 h-index: 0机构: Acad Mil Sci Peoples Liberat Army, Beijing 1000000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaYang, Zonghao论文数: 0 引用数: 0 h-index: 0机构: Acad Mil Sci Peoples Liberat Army, Beijing 1000000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaLuo, Wei论文数: 0 引用数: 0 h-index: 0机构: Acad Mil Sci Peoples Liberat Army, Beijing 1000000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaLuo, Zhunchen论文数: 0 引用数: 0 h-index: 0机构: Acad Mil Sci Peoples Liberat Army, Beijing 1000000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaZhou, Xian论文数: 0 引用数: 0 h-index: 0机构: Acad Mil Sci Peoples Liberat Army, Beijing 1000000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R ChinaHu, Wenpeng论文数: 0 引用数: 0 h-index: 0机构: Acad Mil Sci Peoples Liberat Army, Beijing 1000000, Peoples R China Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056000, Peoples R China
- [6] Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 11375 - 11385Huang, Chaoqin论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Shanghai, Peoples R China Natl Univ Singapore, Singapore, Singapore Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai, Peoples R ChinaHan, Aofan论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Shanghai, Peoples R China Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai, Peoples R ChinaFeng, Jinghao论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Shanghai, Peoples R China Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai, Peoples R ChinaZhang, Ya论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Shanghai, Peoples R China Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai, Peoples R ChinaWan, Xinchao论文数: 0 引用数: 0 h-index: 0机构: Natl Univ Singapore, Singapore, Singapore Shanghai Jiao Tong Univ, Shanghai, Peoples R ChinaWang, Yanfeng论文数: 0 引用数: 0 h-index: 0机构: Shanghai Jiao Tong Univ, Shanghai, Peoples R China Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China Shanghai Jiao Tong Univ, Shanghai, Peoples R China
- [7] Geometry-sensitive semantic modeling in visual and visual-language domains for image captioningENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 147Zhu, Wencai论文数: 0 引用数: 0 h-index: 0机构: Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R China Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R ChinaJiang, Zetao论文数: 0 引用数: 0 h-index: 0机构: Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R China Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R ChinaHe, Yuting论文数: 0 引用数: 0 h-index: 0机构: Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R China Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Peoples R China
- [8] ViLEM: Visual-Language Error Modeling for Image-Text Retrieval2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11018 - 11027Chen, Yuxin论文数: 0 引用数: 0 h-index: 0机构: Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China ARC Lab, Singapore, Singapore Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaZhang, Zongyang论文数: 0 引用数: 0 h-index: 0机构: Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China ARC Lab, Singapore, Singapore Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaZhang, Ziqi论文数: 0 引用数: 0 h-index: 0机构: Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaQi, Zhongang论文数: 0 引用数: 0 h-index: 0机构: ARC Lab, Singapore, Singapore Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaYuan, Chunfeng论文数: 0 引用数: 0 h-index: 0机构: Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaShan, Ying论文数: 0 引用数: 0 h-index: 0机构: ARC Lab, Singapore, Singapore Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaLi, Bing论文数: 0 引用数: 0 h-index: 0机构: Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaHu, Weiming论文数: 0 引用数: 0 h-index: 0机构: Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China CAS Ctr Excellence Brain Sci & Intelligence Techn, Shanghai, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaQie, Xiaohu论文数: 0 引用数: 0 h-index: 0机构: Tencent PCG, Shenzhen, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R ChinaWu, JianPing论文数: 0 引用数: 0 h-index: 0机构: Tsinghua Univ, Beijing, Peoples R China Chinese Acad Sci, State Key Lab Multimodal Artificial Intelligence, Inst Automat, Beijing, Peoples R China
- [9] Open-set domain adaptation with visual-language foundation modelsCOMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 250Yu, Qing论文数: 0 引用数: 0 h-index: 0机构: Univ Tokyo, Dept Informat & Commun Engn, Tokyo 1138656, Japan Univ Tokyo, Dept Informat & Commun Engn, Tokyo 1138656, Japan论文数: 引用数: h-index:机构:Aizawa, Kiyoharu论文数: 0 引用数: 0 h-index: 0机构: Univ Tokyo, Dept Informat & Commun Engn, Tokyo 1138656, Japan Univ Tokyo, Dept Informat & Commun Engn, Tokyo 1138656, Japan
- [10] Active Perception for Visual-Language NavigationInternational Journal of Computer Vision, 2023, 131 : 607 - 625Hanqing Wang论文数: 0 引用数: 0 h-index: 0机构: Beijing Institute of Technology,ReLER Lab, Australian Artificial Intelligence InstituteWenguan Wang论文数: 0 引用数: 0 h-index: 0机构: Beijing Institute of Technology,ReLER Lab, Australian Artificial Intelligence InstituteWei Liang论文数: 0 引用数: 0 h-index: 0机构: Beijing Institute of Technology,ReLER Lab, Australian Artificial Intelligence InstituteSteven C. H. Hoi论文数: 0 引用数: 0 h-index: 0机构: Beijing Institute of Technology,ReLER Lab, Australian Artificial Intelligence InstituteJianbing Shen论文数: 0 引用数: 0 h-index: 0机构: Beijing Institute of Technology,ReLER Lab, Australian Artificial Intelligence InstituteLuc Van Gool论文数: 0 引用数: 0 h-index: 0机构: Beijing Institute of Technology,ReLER Lab, Australian Artificial Intelligence Institute