Biography

Hi, I am a student at Tsinghua University. Currently, my research focuses on multi-modal learning and generative models, with the aim of exploring the understanding and generative modeling of the physical world. I am particularly interested in how these models can be applied to complex systems such as visual reasoning and generative model. As my tech expertise deepened, I now focus less on paper quantity and more on rethinking problems and offering simple, effective solutions. If you are interested in collaborating or have any use cases you would like to share, please feel free to contact me!

Email: zachxu.thu(at)foxmail.com

[Publication]

News
  • Two paper is accepted to ICCV 2025.
  • I have received an outstanding reviewer award at CVPR 2025.
  • Three paper is accepted to CVPR 2025.
  • One paper is accepted to AAAI 2025.
  • One paper is accepted to EMNLP 2024.
  • We won the 3rd prize in CVPR 2024 OVD challenge.
  • Two paper is accepted to ICML 2024.
  • One paper is accepted to NeurIPS 2024.
  • One paper is accepted to IJCAI 2024.
  • One paper is accepted to ACMMM 2024.
  • One paper is accepted to ICASSP 2024.
  • One paper is accepted to AAAI 2024.
  • One paper is accepted to ICCV 2023.
  • Research Interest
    Computer Vision
  • Complex Visual Reasoning, including referring image segmentation, visual grounding, semantic segmentation, and general object detection.
  • Generative Models, including Image/Video generation and 3D motion synthesis.
  • Machine Learning
  • Efficient Transfer Learning
  • Self-supervised Learning
  • Academic Services
    Program Committee Member/Reviewer for
  • Neural Information Processing Systems (NeurIPS)
  • International Conference on Machine Learning (ICML)
  • International Conference on Learning Representations (ICLR)
  • IEEE International Conference on Computer Vision and Pattern Recognition (CVPR)
  • IEEE International Conference on Computer Vision (ICCV)
  • IEEE Transactions on Image Processing (TIP)
  • IEEE Winter Conference on Applications of Computer Vision (WACV)
  • Association for Computational Linguistics (ACL)
  • Conference on Empirical Methods in Natural Language Processing (EMNLP)
  • ACM International Conference on Multimedia (ACMMM)
  • Selected Honors & Awards
    2025 Outstanding Graduate of Beijing

    2025 Golden Award in International Exhibition of Inventions Geneva

    2024 Graduate National Scholarship

    2024 Tencent Rhino-Bird Research Elite

    2023 Outstanding Graduate of SYSU

    2022 Golden Award in International Genetically Engineered Machine Competition

    2021 Golden Award in ACM/ICPC Competition

    2020 Undergraduate National Scholarship