Selected Publications (Full List)

NAME1 indicates co-first author. indicates corresponding author.
  1. 5. StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
    Sjie Zhao, Wenbo Hu, Xiaodong Cun, Yong Zhang, Xiaoyu Li, Zhe Kong, Xiangjun Gao, Muyao Niu, Ying Shan

    arxiv 2024.08

  2. 4. DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
    Wenbo Hu, Xiangjun Gao, Xiaoyu Li, Sijie Zhao, Xiaodong Cun, Yong Zhang, Long Quan, Ying Shan

    arxiv 2024.08

  3. 3. CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
    Tao Wu, Yong Zhang, Xintao Wang, Xianpan Zhou, Guangcong Zheng, Zhongang Qi, Ying Shan, Xi Li

    arxiv 2024.08

  4. 2. ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation
    Shaoshu Yang, Yong Zhang, Xiaodong Cun, Ying Shan, Ran He

    arxiv 2024.05

  5. 1. Videocrafter1: Open diffusion models for high-quality video generation
    Haoxin Chen1, Menghan Xia1, Yinging He1, Yong Zhang1, Xiaodong Cun1, Shaoshu Yang, Jinbo Xing, Yaofang Liu, Qifeng Chen, Xintao Wang, Chao Weng, Ying Shan

    Arxiv 2023

    PDF GitHub stars

Tech Report

  1. 66. CV-VAE: A Compatible Video VAE for Latent Generative Video Models
    Sijie Zhao, Yong Zhang, Xiaodong Cun, Shaoshu Yang, Muyao Niu, Xiaoyu Li, Wenbo Hu, Ying Shan,

    NeurIPS 2024

  2. 65. LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
    Liang Chen, Yong Zhang, Yibing Song, Zhiqiang Shen, Lingqiao Liu

    NeurIPS 2024

  3. 64. ToonCrafter: Generative Cartoon Interpolation
    Jinbo Xing, Hanyuan Liu, Menghan Xia, Yong Zhang, Xintao Wang, Ying Shan, Tien-Tsin Wong

    Siggraph Asia 2024

  4. 63. StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
    Gongye Liu, Menghan Xia, Yong Zhang, Haoxin Chen, Jinbo Xing, Xintao Wang, Yujiu Yang, Ying Shan

    Siggraph Asia 2024

  5. 62. Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
    Yingqing He, Menghan Xia, Haoxin Chen, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng Ying Shan, Qifeng Chen

    ECCV 2024 Workshop

    PDF GitHub stars
  6. 61. Dynamicrafter: Animating open-domain images with video diffusion priors
    Jinbo Xing, Menghan Xia, Yong Zhang, Haoxin Chen, Xintao Wang, Tien-Tsin Wong, Ying Shan

    ECCV 2024 (Oral)

  7. 60. Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
    Qinyu Yang, Haoxin Chen, Yong Zhang, Menghan Xia, Xiaodong Cun, Zhixun Su, Ying Shan

    ECCV 2024

  8. 59. Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
    Lanqing Guo, Yingqing he, Haoxin Chen, Menghan Xia, Xiaodong Cun, Yufei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen

    ECCV 2024

  9. 58. OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
    Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luo

    ECCV 2024

  10. 57. MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
    Muyao Niu, Xiaodong Cun, Xintao Wang, Yong Zhang, Ying Shan, Yinqiang Zheng

    ECCV 2024

  11. 56. Identity-Preserving Face Swapping via Dual Surrogate Generative Models
    Ziyao Huang, Fan Tang, Yong Zhang, Juan Cao, Chengyu Li, Sheng Tang, Jintao Li, Tong-Yee Lee

    Transactions on Graphics 2024

  12. 55. VideoCrafter2 : Overcoming Data Limitations for High-Quality Video Diffusion Models
    Haoxin Chen, Yong Zhang, Xiaodong Cun, Menghan Xia, Xintao Wang, Chao Weng, Ying Shan

    CVPR 2024

  13. 54. EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
    Yaofang Liu, Xiaodong Cun, Xuebo Liu, Xintao Wang, Yong Zhang, Haoxin Chen, Yang Liu, Tieyong Zeng, Raymond Chan, Ying Shan

    CVPR 2024

  14. 53. Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
    Ziyao Huang, Fan Tang, Yong Zhang, Xiaodong Cun, Juan Cao, Jintao Li, Tong-yee Lee

    CVPR 2024

  15. 52. ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
    Yingqing He, Shaoshu Yang, Haoxin Chen, Xiaodong Cun, Menghan Xia, Yong Zhang, Xintao Wang, Ran He, Qifeng Chen, Ying Shan,

    ICLR 2024 (Spotlight)

  16. 51. Freenoise: Tuning-free longer video diffusion via noise rescheduling
    Haonan Qiu, Menghan Xia, Yong Zhang, Yingqing He, Xintao Wang, Ying Shan, Ziwei Liu

    ICLR 2024

  17. 50. A Causal Inspired Early-BranchingStructure for Domain Generalization
    Liang Chen, Yong Zhang, Yibing Song, Zhen Zhang, Linqiao Liu

    IJCV 2024

  18. 49. Improving Fast Adversarial Training with Prior-Guided Knowledge
    Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao

    TPAMI 2024

    PDF

2024

  1. 48. Inserting Anybody in Diffusion Models via Celeb Basis
    Ge Yuan, Xiaodong Cun, Yong Zhang, Maomao Li, Xintao Wang, Ying Shan, Huicheng Zheng

    NeurIPS 2023

  2. 47. DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection
    Zhiyuan Yan, Yong Zhang, Xinhang Yuan, Siwei Lyu, Baoyuan Wu

    NeurIPS 2023

  3. 46. TaleCrafter: Interactive Story Visualization with Multiple Characters
    Yuan Gong, Youxin Pang, Xiaodong Cun , Menghan Xia, Haoxin Chen, Longyue Wang, Yong Zhang , Xintao Wang, Ying Shan, Yujiu Yang

    SIGGRAPH Asia Conference 2023

  4. 45. FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
    Chenyang Qi, Xiaodong Cun, Yong Zhang, Chenyang Lei , Xintao Wang, Ying Shan, Qifeng Chen

    IEEE International Conference on Computer Vision (ICCV) 2023 (Oral)

  5. 44. ToonTalker: Cross-Domain Face Reenactment
    Yuan Gong, Yong Zhang, Xiaodong Cun, Fei Yin, Yanbo Fan, Xuan Wang, Baoyuan Wu, Yujiu Yang

    IEEE International Conference on Computer Vision (ICCV) 2023

  6. 43. UCF: Uncovering Common Features for Generalizable Deepfake Detection
    Zhiyuan Yan1, Yong Zhang1, Yanbo Fan, Baoyuan Wu

    IEEE International Conference on Computer Vision (ICCV) 2023

  7. 42. Domain Generalization via Rationale Invariance
    Liang Chen, Yong Zhang, Yibing Song, Anton van den Hengel, Lingqiao Liu

    IEEE International Conference on Computer Vision (ICCV) 2023

  8. 41. Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
    Zunnan Xu, Zhihong Chen, Yong Zhang, Yibing Song, Xiang Wan, Guanbin Li

    IEEE International Conference on Computer Vision (ICCV) 2023

  9. 40. NOFA: NeRF-based One-shot Facial Avatar Reconstruction
    Wangbo Yu, Yanbo Fan, Yong Zhang, Xuan Wang, Fei Yin, Yunpeng Bai, Yanpei Cao, Ying Shan, Yang Wu, Zhongqian Sun, Baoyuan Wu

    SIGGRPAH (Conference Track) 2023

    PDF
  10. 39. Disentanglement of Pose and Expression for General Video Portrait Editing
    Youxin Pang, Yong Zhang, Weize Quan, Yanbo Fan, Xiaodong Cun, Ying Shan, Dong-ming Yan

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  11. 38. Fine-Grained Face Swapping via Regional GAN Inversion
    Zhian Liu1, Maomao Li1, Yong Zhang1, Cairong Wang, Qi Zhang, Jue Wang, Yongwei Nie

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  12. 37. Improved Test-Time Adaptation for Domain Generalization
    Liang Chen, Yong Zhang, Yibing Song, Ying Shan, Lingqiao Liu

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  13. 36. 3D GAN Inversion with Facial Symmetry Prior
    Fei Yin, Yong Zhang, Xuan Wang, Tengfei Wang, Xiaoyu Li, Yuan Gong, Yanbo Fan, Xiaodong Cun, Ying Shan, Cengiz Oztireli, Yujiu Yang

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  14. 35. SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
    Wenxuan Zhang, Xiaodong Cun, Xuan Wang, Yong Zhang, Xi Shen, Yu Guo, Ying Shan, Fei Wang

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  15. 34. Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars
    Jingxiang Sun, Xuan Wang, Lizhen Wang, Xiaoyu Li, Yong Zhang, Hongwen Zhang, Yebin Liu

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023 (Highlight)

  16. 33. Generating Human Motion from Textual Descriptions with High Quality Discrete Representation
    Jianrong Zhang, Yangsong Zhang, Xiaodong Cun, Shaoli Huang, Yong Zhang, Hongwei Zhao, Hongtao Lu, Xi Shen

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  17. 32. High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors
    Yunpeng Bai, Yanbo Fan, Xuan Wang, Yong Zhang, Jingxiang Sun, Chun Yuan, Ying Shan

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  18. 31. High-fidelity Clothed Avatar Reconstruction from a Single Image
    Tingting Liao, Xiaomei Zhang, Yuliang Xiu, Hongwei Yi, Xudong Liu, Guo-Jun Qi, Yong Zhang, Xuan Wang, Xiangyu Zhu, Zhen Lei

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  19. 30. CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying
    Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang

    AAAI Conference on Artificial Intelligence (AAAI) 2023 (Oral)

2023

  1. 29. OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training
    Liang Chen, Yong Zhang, Yibing Song, Jue Wang, Lingqiao Liu

    Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS) 2022.

  2. 28. Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation
    Zeyu Qin, Yanbo Fan, Yi Liu, Li Shen, Yong Zhang, Jue Wang, Baoyuan Wu

    Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS) 2022.

  3. 27. VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
    Kun Cheng, Xiaodong Cun, Yong Zhang, Menghan Xia, Fei Yin, Mingrui Zhu, Xuan Wang, Jue Wang, Nannan Wang

    SIGGRAPH Asia (Conference Track) 2022

  4. 26. Generalizable Black-Box Adversarial Attack with Meta Learning
    Fei Yin1, Yong Zhang1, Baoyuan Wu1, Yan Feng, Jingyi Zhang, Yanbo Fan, Yujiu Yang

    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022

  5. 25. StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
    Fei Yin, Yong Zhang, Xiaodong Cun, Mingdeng Cao, Yanbo Fan, Xuan Wang, Qingyan Bai, Baoyuan Wu, Jue Wang, Yujiu Yang

    European Conference on Computer Vision (ECCV) 2022

  6. 24. Prior-Guided Adversarial Initialization for Fast Adversarial Training
    Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang , Xiaochun Cao

    European Conference on Computer Vision (ECCV) 2022

  7. 23. Boosting Fast Adversarial Training with Learnable Adversarial Initialization
    Xiaojun Jia, Yong Zhang, Baoyuan Wu, Jue Wang, Xiaochun Cao

    IEEE Transactions on image processing (TIP) 2022

  8. 22. FENeRF: Face Editing in Neural Radiance Fields
    Jingxiang Sun, Xuan Wang, Yong Zhang, Xiaoyu Li, Qi Zhang, Yebing Liu, Jue Wang

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022

  9. 21. LAS-AT: Adversarial Training with Learnable Attack Strategy
    Xiaojun Jia1, Yong Zhang1, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Oral)

  10. 20. High-Fidelity GAN Inversion for Image Attribute Editing
    Tengfei Wang, Yong Zhang, Yanbo Fan, Jue Wang, Qifeng Chen

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022

  11. 19. Self-supervised Learning of Adversarial Examples: Towards Good Generalizations for DeepFake Detections
    Liang Chen, Yong Zhang, Yibing Song, Lingqiao Liu, Jue Wang

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Oral)

  12. 18. Image Inpainting with Local and Global Refinement
    Weize Quan, Ruisong Zhang, Yong Zhang, Zhifeng Li, Jue Wang, Dong-Ming Yan

    IEEE Transactions on image processing (TIP) 2022

2022

  1. 17. Aesthetic-guided Outward Image Cropping
    Lei Zhong, Feng-Heng Li, Hao-Zhi Huang, Yong Zhang, Shao-Ping Lu, Jue Wang

    ACM Transactions on Graphics (TOG) 2021. (SIGGRPAH ASIA 2021)

    PDF
  2. 16. DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis
    Shulan Ruan, Yong Zhang, Kun Zhang, Yanbo Fan, Fan Tang, Qi Liu, Enhong Chen

    IEEE International Conference on Computer Vision (ICCV) 2021

  3. 15. Meta-Attack: Class-agnostic and Model-agnostic Physical Adversarial Attack
    Weiwei Feng, Baoyuan Wu, Tianzhu Zhang, Yong Zhang, Yongdong Zhang

    IEEE International Conference on Computer Vision (ICCV) 2021

  4. 14. Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
    Gengcong Yang, Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yujiu Yang

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021

  5. 13.Generalizing Face Forgery Detection with High-frequency Features
    Yuchen Luo1, Yong Zhang1, Junchi Yan, Wei Liu

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021

  6. 12. Targeted Attack Against Deep Neural Networks via Flipping Limited Weight Bits
    Jiawang Bai, Baoyuan Wu, Yong Zhang, Yiming Li, Zhifeng Li, Shu-Tao Xia

    International Conference on Learning Representations (ICLR) 2021

2021

  1. 11. Sparse Adversarial Attack via Perturbation Factorization
    Yanbo Fan, Baoyuan Wu, Tuanhui Li, Yong Zhang, Mingyang Li, Zhifeng Li, Yujiu Yang

    European Conference on Computer Vision (ECCV) 2020

  2. 10. Label Error Correction and Generation Through Label Relationships
    Zijun Cui, Yong Zhang, Qiang Ji

    The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI) 2020

    PDF

2020

  1. 9. Context-Aware Feature and Label Fusion for Facial Action Unit Intensity Estimation With Partially Labeled Data
    Yong Zhang, Haiyong Jiang, Baoyuan Wu, Yanbo Fan, Qiang Ji

    IEEE International Conference on Computer Vision (ICCV) 2019

    PDF
  2. 8. Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation
    Yong Zhang, Baoyuan Wu, Weiming Dong, Zhifeng Li, Wei Liu, Bao-Gang Hu, Qiang Ji

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019

  3. 7. Compressing Convolutional Neural Networks via Factorized Convolutional Filters
    Tuanhui Li, Baoyuan Wu, Yujiu Yang, Yanbo Fan, Yong Zhang, Wei Liu

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019

  4. 6. Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables
    Yan Xu, Baoyuan Wu, Fumin Shen, Yanbo Fan, Yong Zhang, Heng Tao Shen Wei Liu

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019

2019

  1. 5. Weakly-Supervised CNN Learning for Facial Action Unit Intensity Estimation
    Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018

  2. 4. Bilateral Ordinal Relevance Multi-instance Regression for Facial Action Unit Intensity Estimation
    Yong Zhang, Rui Zhao, Weiming Dong, Bao-Gang Hu, Qiang Ji

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018

  3. 3. Classifier Learning With Prior Probabilities for Facial Action Unit Recognition
    Yong Zhang, Weiming Dong, Bao-Gang Hu, Qiang Ji

    IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018

    PDF
  4. 2. Data-driven Synthesis of Cartoon Faces Using Different Styles
    Yong Zhang, Weiming Dong, Chongyang Ma, Xing Mei, Ke Li, Feiyue Huang, Bao-Gang Hu, Oliver Deussen

    IEEE Transactions on image processing (TIP), 2017

    PDF
  5. 1. Data-driven Face Cartoon Stylization
    Yong Zhang, Weiming Dong, Oliver Deussen, Feiyue Huang, Ke Li, Bao-Gang Hu

    ACM Siggraph Asia Techincal Briefs 2014

    PDF

2018 and before