Follow
AJ Piergiovanni
AJ Piergiovanni
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Pali: A jointly-scaled multilingual language-image model
X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ...
arXiv preprint arXiv:2209.06794, 2022
4132022
Representation flow for action recognition
AJ Piergiovanni, MS Ryoo
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
1882019
Evolving losses for unsupervised video representation learning
AJ Piergiovanni, A Angelova, MS Ryoo
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1562020
Tokenlearner: Adaptive space-time tokenization for videos
M Ryoo, AJ Piergiovanni, A Arnab, M Dehghani, A Angelova
Advances in neural information processing systems 34, 12786-12797, 2021
1292021
F-vlm: Open-vocabulary object detection upon frozen vision and language models
W Kuo, Y Cui, X Gu, AJ Piergiovanni, A Angelova
arXiv preprint arXiv:2209.15639, 2022
1112022
Assemblenet: Searching for multi-stream neural connectivity in video architectures
MS Ryoo, AJ Piergiovanni, M Tan, A Angelova
arXiv preprint arXiv:1905.13209, 2019
1112019
Learning latent super-events to detect multiple activities in videos
AJ Piergiovanni, MS Ryoo
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
1032018
Tokenlearner: What can 8 learned tokens do for images and videos?
MS Ryoo, AJ Piergiovanni, A Arnab, M Dehghani, A Angelova
arXiv preprint arXiv:2106.11297, 2021
1022021
Temporal gaussian mixture layer for videos
AJ Piergiovanni, M Ryoo
International Conference on Machine learning, 5152-5161, 2019
992019
Fine-grained activity recognition in baseball videos
AJ Piergiovanni, MS Ryoo
Proceedings of the ieee conference on computer vision and pattern …, 2018
882018
Pali-x: On scaling up a multilingual vision and language model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
arXiv preprint arXiv:2305.18565, 2023
862023
Evolving space-time neural architectures for videos
AJ Piergiovanni, A Angelova, A Toshev, MS Ryoo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
772019
Learning latent subevents in activity videos using temporal attention filters
A Piergiovanni, C Fan, M Ryoo
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
632017
4d-net for learned multi-modal alignment
AJ Piergiovanni, V Casser, MS Ryoo, A Angelova
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
592021
Attentionnas: Spatiotemporal attention cell search for video classification
X Wang, X Xiong, M Neumann, AJ Piergiovanni, MS Ryoo, A Angelova, ...
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
522020
Tiny video networks
AJ Piergiovanni, A Angelova, MS Ryoo
Applied AI Letters 3 (1), e38, 2022
512022
Assemblenet++: Assembling modality representations via attention connections
MS Ryoo, AJ Piergiovanni, J Kangaspunta, A Angelova
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
472020
Learning real-world robot policies by dreaming
AJ Piergiovanni, A Wu, MS Ryoo
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019
402019
Rethinking video vits: Sparse video tubes for joint image and video learning
AJ Piergiovanni, W Kuo, A Angelova
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
392023
Avid dataset: Anonymized videos from diverse countries
AJ Piergiovanni, M Ryoo
Advances in Neural Information Processing Systems 33, 16711-16721, 2020
372020
The system can't perform the operation now. Try again later.
Articles 1–20