Longteng Guo
Longteng Guo
Other names郭龙腾
ByteDance; Ph.D, Institute of Automation of the Chinese Academy Sciences (CASIA)
Verified email at - Homepage
Cited by
Cited by
Normalized and geometry-aware self-attention network for image captioning
L Guo, J Liu, X Zhu, P Yao, S Lu, H Lu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
Mscap: Multi-style image captioning with unpaired stylized text
L Guo, J Liu, P Yao, J Li, H Lu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
Cptr: Full transformer network for image captioning
W Liu, S Chen, L Guo, X Zhu, J Liu
arXiv preprint arXiv:2101.10804, 2021
Aligning linguistic words and visual semantic units for image captioning
L Guo, J Liu, J Tang, J Li, W Luo, H Lu
Proceedings of the 27th ACM international conference on multimedia, 765-773, 2019
Show, tell, and polish: Ruminant decoding for image captioning
L Guo, J Liu, S Lu, H Lu
IEEE Transactions on Multimedia 22 (8), 2149-2162, 2019
Non-autoregressive image captioning with counterfactuals-critical multi-agent learning
L Guo, J Liu, X Zhu, X He, J Jiang, H Lu
arXiv preprint arXiv:2005.04690, 2020
Sketch-based image retrieval using generative adversarial networks
L Guo, J Liu, Y Wang, Z Luo, W Wen, H Lu
Proceedings of the 25th ACM international conference on Multimedia, 1267-1268, 2017
OPT: Omni-perception pre-trainer for cross-modal understanding and generation
J Liu, X Zhu, F Liu, L Guo, Z Zhao, M Sun, W Wang, H Lu, S Zhou, J Zhang, ...
arXiv preprint arXiv:2107.00249, 2021
Boosted transformer for image captioning
J Li, P Yao, L Guo, W Zhang
Applied Sciences 9 (16), 3260, 2019
AutoCaption: Image captioning with neural architecture search
X Zhu, W Wang, L Guo, J Liu
arXiv preprint arXiv:2012.09742, 2020
Valor: Vision-audio-language omni-perception pretraining model and dataset
S Chen, X He, L Guo, X Zhu, W Wang, J Tang, J Liu
arXiv preprint arXiv:2304.08345, 2023
Fast sequence generation with multi-agent reinforcement learning
L Guo, J Liu, X Zhu, H Lu
arXiv preprint arXiv:2101.09698, 2021
Vatex video captioning challenge 2020: Multi-view features and hybrid reward strategies for video captioning
X Zhu, L Guo, P Yao, S Lu, W Liu, J Liu
arXiv preprint arXiv:1910.11102, 2019
Image captioning with word gate and adaptive self-critical learning
X Zhu, L Li, J Liu, L Guo, Z Fang, H Peng, X Niu
Applied Sciences 8 (6), 909, 2018
Multi-view features and hybrid reward strategies for vatex video captioning challenge 2019
X Zhu, L Guo, P Yao, J Liu, H Lu, Z Yu, W Liu, H Lu
arXiv preprint arXiv:1910.11102, 2019
MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques
S Chen, X Zhu, D Hao, W Liu, J Liu, Z Zhao, L Guo, J Liu
Proceedings of the 29th ACM International Conference on Multimedia, 4853-4857, 2021
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning
Z Zhao, L Guo, X He, S Shao, Z Yuan, J Liu
arXiv preprint arXiv:2210.04183, 2022
Structure Preserving Convolutional Attention for Image Captioning
S Lu, R Hu, J Liu, L Guo, F Zheng
Applied Sciences 9 (14), 2888, 2019
Modeling Local and Global Contexts for Image Captioning
P Yao, J Li, L Guo, J Liu
2020 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2020
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu
arXiv preprint arXiv:2305.16103, 2023
The system can't perform the operation now. Try again later.
Articles 1–20