Longteng Guo

Cited by

	All	Since 2019
Citations	957	953
h-index	12	12
i10-index	13	13

340

170

255

20182019202020212022202320244 9 47 162 263 338 132

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jing Liu 刘静Professor in Institute of Automation of the Chinese Academy Sciences (CASIA)Verified email at nlpr.ia.ac.cn
Xinxin Zhu 朱欣鑫Institute of Automation of the Chinese Academy Sciences (CASIA)Verified email at nlpr.ia.ac.cn
Xingjian HeInstitute of Automation of the Chinese Academy Sciences (CASIA)Verified email at nlpr.ia.ac.cn
Zijia ZhaoInstitute of Automation, Chinese Academy Sciences (CASIA)Verified email at ia.ac.cn
Jinhui Tang (唐金辉)Nanjing University of Science and TechnologyVerified email at acm.org
Shuai ShaoByteDanceVerified email at bytedance.com
Sihan ChenInstitute of Automation, Chinese Academy of SciencesVerified email at nlpr.ia.ac.cn
Zhiwei FangBusiness Growth BU, JD.COMVerified email at jd.com
Jun FuBeijing, China
Hanqing LuNational Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

Longteng Guo

Other names郭龙腾

Associate Professor, Institute of Automation of the Chinese Academy Sciences (CASIA)

Verified email at nlpr.ia.ac.cn - Homepage

Multimodality


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Normalized and geometry-aware self-attention network for image captioning L Guo, J Liu, X Zhu, P Yao, S Lu, H Lu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	222	2020
Cptr: Full transformer network for image captioning W Liu, S Chen, L Guo, X Zhu, J Liu arXiv preprint arXiv:2101.10804, 2021	155	2021
Mscap: Multi-style image captioning with unpaired stylized text L Guo, J Liu, P Yao, J Li, H Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	113	2019
Aligning linguistic words and visual semantic units for image captioning L Guo, J Liu, J Tang, J Li, W Luo, H Lu Proceedings of the 27th ACM international conference on multimedia, 765-773, 2019	111	2019
Valor: Vision-audio-language omni-perception pretraining model and dataset S Chen, X He, L Guo, X Zhu, W Wang, J Tang, J Liu arXiv preprint arXiv:2304.08345, 2023	54	2023
Non-autoregressive image captioning with counterfactuals-critical multi-agent learning L Guo, J Liu, X Zhu, X He, J Jiang, H Lu arXiv preprint arXiv:2005.04690, 2020	50	2020
Show, tell, and polish: Ruminant decoding for image captioning L Guo, J Liu, S Lu, H Lu IEEE Transactions on Multimedia 22 (8), 2149-2162, 2019	46	2019
OPT: Omni-perception pre-trainer for cross-modal understanding and generation J Liu, X Zhu, F Liu, L Guo, Z Zhao, M Sun, W Wang, H Lu, S Zhou, J Zhang, ... arXiv preprint arXiv:2107.00249, 2021	36	2021
Sketch-based image retrieval using generative adversarial networks L Guo, J Liu, Y Wang, Z Luo, W Wen, H Lu Proceedings of the 25th ACM international conference on Multimedia, 1267-1268, 2017	34	2017
Boosted transformer for image captioning J Li, P Yao, L Guo, W Zhang Applied Sciences 9 (16), 3260, 2019	33	2019
Chatbridge: Bridging modalities with large language model as a language catalyst Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu arXiv preprint arXiv:2305.16103, 2023	29	2023
AutoCaption: Image captioning with neural architecture search X Zhu, W Wang, L Guo, J Liu arXiv preprint arXiv:2012.09742, 2020	16	2020
Fast sequence generation with multi-agent reinforcement learning L Guo, J Liu, X Zhu, H Lu arXiv preprint arXiv:2101.09698, 2021	10	2021
Mscap: Multi-style image captioning with unpaired stylized text. 2019 IEEE L Guo, J Liu, P Yao, J Li, H Lu CVF Conference on Computer Vision and Pattern Recognition (CVPR), 4199-4208, 2019	9	2019
Image captioning with word gate and adaptive self-critical learning X Zhu, L Li, J Liu, L Guo, Z Fang, H Peng, X Niu Applied Sciences 8 (6), 909, 2018	6	2018
Mm21 pre-training for video understanding challenge: Video captioning with pretraining techniques S Chen, X Zhu, D Hao, W Liu, J Liu, Z Zhao, L Guo, J Liu Proceedings of the 29th ACM International Conference on Multimedia, 4853-4857, 2021	5	2021
Multi-view features and hybrid reward strategies for vatex video captioning challenge 2019 X Zhu, L Guo, P Yao, J Liu, H Lu, Z Yu, W Liu, H Lu arXiv preprint arXiv:1910.11102, 2019	5	2019
MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling Z Zhao, L Guo, X He, S Shao, Z Yuan, J Liu Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023	4	2023
Mamo: masked multimodal modeling for fine-grained vision-language representation learning Z Zhao, L Guo, X He, S Shao, Z Yuan, J Liu arXiv preprint arXiv:2210.04183, 2022	4	2022
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation W Wang, X He, Y Zhang, L Guo, J Shen, J Li, J Liu IEEE Transactions on Multimedia, 2024	3	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors