CV & Publications
Education
- B.S. in School of Information and Electronics, Beijing Institute of Technology, 2010-2012
- B.S. in College of Engineering and Computer Science, Australian National University, 2012-2015
- Ph.D in College of Engineering and Computer Science, Australian National University, 2016-2020
Work Experience
- April 2024 - Now: Applied Scientist
- Amazon AGI
- August 2022 - March 2024: Machine Learning Scientist
- Amazon Prime Video
- June 2020 - June 2022: Postdoctoral Researcher
- Facebook AI Research Lab
- September - November 2019: Research Intern
- Mitsubishi Electric Research Lab
- February - November 2018: Research Intern
- Mitsubishi Electric Research Lab
Publications
Now You See Me: Context-Aware Automatic Audio Description
Seon-Ho Lee, Jue Wang, David Fan, Zhikang Zhang, Linda Liu, Xiang Hao, Vimal Bhat, Xinyu Li
Winter Conference on Applications of Computer Vision (WACV) 2025GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning
Yicheng Wang, Zhikang Zhang, Jue Wang, David Fan, Zhenlin Xu, Linda Liu, Xiang Hao, Vimal Bhat, Xinyu Li
Winter Conference on Applications of Computer Vision (WACV) 2025Video token merging for long-form video understanding
Seon Ho Lee, Jue Wang, Zhikang Zhang, David Fan, Xinyu Arthur Li
Conference on Neural Information Processing Systems (NeurIPS) 2024Text-guided video masked autoencoder
David Fan, Jue Wang, Shuai Liao, Zhikang Zhang, Vimal Bhat, Xinyu Li
European Conference on Computer Vision (ECCV) 2024Motion-guided masking for spatiotemporal representation learning
David Fan, Jue Wang, Leo Liao, Yi Zhu, Vimal Bhat, Hector Santos, Rohith Mysore Vijaya Kumar, Xinyu Arthur Li
IEEE International Conference on Computer Vision (ICCV) 2023Selective Structured State-Spaces for Long-Form Video Understanding
Jue Wang, Wentao Zhu, Pichao Wang, Xiang Yu, Linda Liu, Mohamed Omar, Raffay Hamid
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023Deformable video transformer
Jue Wang, Lorenzo Torresani
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022Long-Short Temporal Contrastive Learning of Video Transformers
Jue Wang, Gedas Bertasius, Du Tran, Lorenzo Torresani
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022Generalized One-Class Learning Using Pairs of Complementary Classifiers
Anoop Cherian*, Jue Wang*
*Equal Contribution
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)Learning Log-Determinant Divergences for Positive Definite Matrices
Anoop Cherian, Panagiotis Stanitsas, Jue Wang, Mehrtash T Harandi, Vassilios Morellas, Nikos Papanikolopoulos
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021Spatio-Temporal Ranked-Attention Networks for Video Captioning
Anoop Cherian, Jue Wang, Chiori Hori, Tim Marks
Winter Conference on Applications of Computer Vision (WACV) 2020Discriminative Video Representation Learning Using Support Vector Classifiers
Jue Wang, Anoop Cherian
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)GODS: Generalized One-class Discriminative Subspaces for Anomaly Detection
Jue Wang, Anoop Cherian
IEEE International Conference on Computer Vision (ICCV) 2019End-to-end audio visual scene-aware dialog using multimodal attention-based video features
Chiori Hori, Huda Alamri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019Audio Visual Scene-Aware Dialog
Huda Alamri, Vincent Cartillier, Abhishek Das, Jue Wang, Anoop Cherian, Irfan Essa, Dhruv Batra, Tim K Marks, Chiori Hori, Peter Anderson, Stefan Lee, Devi Parikh
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019Learning discriminative video representations using adversarial perturbations
Jue Wang, Anoop Cherian
15th European Conference on Computer Vision (ECCV) 2018, (Best Paper Finalists)Video Representation Learning Using Discriminative Pooling
Jue Wang, Anoop Cherian, Fatih Porikli, Stephen Gould
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description
Chiori Hori, Takaaki Hori, Gordon Wichern, Jue Wang, Teng-yok Lee, Anoop Cherian, Tim K Marks
IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018Ordered Pooling of Optical Flow Sequences for Action Recognition
Jue Wang, Anoop Cherian, Fatih Porikli
Winter Conference on Applications of Computer Vision (WACV) 2017
Academic Service
- Peer Reviewer for ICCV, CVPR, ECCV, AAAI, ICLR, ICML, NIPS