Yutong Wang

University of Sydney Ph.D. student, University of Sydney

I am Yutong Wang, a first-year Ph.D. student from the University of Sydney under the supervision of Prof. Chang Xu. Before that, I received my M.S. and B.S. degrees from Beijing Institute of Technology under the supervision of Prof. Dixin Luo and Prof. Hongteng Xu. My research interest mainly focuses on machine learning, especially multi-modal learning, optimal transport, video understanding and generation.


Education
  • University of Sydney

    University of Sydney

    Ph.D. in Computer Science Sep. 2025 - Present

  • Beijing Institute of Technology

    Beijing Institute of Technology

    M.S. in Computer Science Sep. 2022 - Jun. 2025

  • Beijing Institute of Technology

    Beijing Institute of Technology

    B.S. in Computer Science Sep. 2018 - Jun. 2022

Experience
  • Shanghai AI Lab

    Shanghai AI Lab

    Research Intern Apr. 2025 - Present

  • Ant Group

    Ant Group

    Research Intern Apr. 2024 - Oct. 2024

  • VRC Inc.

    VRC Inc.

    Research Intern Jul. 2023 - Sep. 2023

Selected Publications (view all )
Weakly-Supervised Movie Trailer Generation Driven by Multi-Modal Semantic Consistency
Weakly-Supervised Movie Trailer Generation Driven by Multi-Modal Semantic Consistency

Sidan Zhu, Yutong Wang, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the 34th International Joint Conference on Artificial Intelligence, IJCAI 2025 Conference

-

Weakly-Supervised Movie Trailer Generation Driven by Multi-Modal Semantic Consistency
Weakly-Supervised Movie Trailer Generation Driven by Multi-Modal Semantic Consistency

Sidan Zhu, Yutong Wang, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the 34th International Joint Conference on Artificial Intelligence, IJCAI 2025 Conference

-

Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency

Yutong Wang, Jiajie Teng, Jiajiong Cao, Yuming Li, Chenguang Ma, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 Conference

-

Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency

Yutong Wang, Jiajie Teng, Jiajiong Cao, Yuming Li, Chenguang Ma, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 Conference

-

An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation
An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation

Yutong Wang*, Sidan Zhu*, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the 32th ACM International Conference on Multimedia, ACMMM 2024 Conference

-

An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation
An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation

Yutong Wang*, Sidan Zhu*, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the 32th ACM International Conference on Multimedia, ACMMM 2024 Conference

-

Self-supervised Video Summarization Guided by Semantic Inverse Optimal Transport
Self-supervised Video Summarization Guided by Semantic Inverse Optimal Transport

Yutong Wang, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the 31st ACM International Conference on Multimedia, ACMMM 2023 Conference

-

Self-supervised Video Summarization Guided by Semantic Inverse Optimal Transport
Self-supervised Video Summarization Guided by Semantic Inverse Optimal Transport

Yutong Wang, Hongteng Xu, Dixin Luo†(† corresponding author)

Proceedings of the 31st ACM International Conference on Multimedia, ACMMM 2023 Conference

-

Weakly-Supervised Temporal Action Alignment Driven by Unbalanced Spectral Fused Gromov-Wasserstein Distance
Weakly-Supervised Temporal Action Alignment Driven by Unbalanced Spectral Fused Gromov-Wasserstein Distance

Dixin Luo, Yutong Wang, Angxiao Yue, Hongteng Xu†(† corresponding author)

Proceedings of the 30st ACM International Conference on Multimedia, ACMMM 2022 Conference

-

Weakly-Supervised Temporal Action Alignment Driven by Unbalanced Spectral Fused Gromov-Wasserstein Distance
Weakly-Supervised Temporal Action Alignment Driven by Unbalanced Spectral Fused Gromov-Wasserstein Distance

Dixin Luo, Yutong Wang, Angxiao Yue, Hongteng Xu†(† corresponding author)

Proceedings of the 30st ACM International Conference on Multimedia, ACMMM 2022 Conference

-

All publications