Selected Publications (Full List)
NAME1 indicates co-first author. indicates corresponding author.
-
5. StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
arxiv 2024.08
-
4. DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
arxiv 2024.08
-
3. CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
arxiv 2024.08
-
2. ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation
arxiv 2024.05
Tech Report
-
-
65. LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
NeurIPS 2024
-
-
63. StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
Siggraph Asia 2024
-
61. Dynamicrafter: Animating open-domain images with video diffusion priors
ECCV 2024 (Oral)
-
60. Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
ECCV 2024
-
59. Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
-
58. OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
ECCV 2024
-
57. MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
ECCV 2024
-
56. Identity-Preserving Face Swapping via Dual Surrogate Generative Models
Transactions on Graphics 2024
-
55. VideoCrafter2 : Overcoming Data Limitations for High-Quality Video Diffusion Models
CVPR 2024
-
54. EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
CVPR 2024
-
-
52. ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024 (Spotlight)
-
-
2024
-
-
-
46. TaleCrafter: Interactive Story Visualization with Multiple Characters
SIGGRAPH Asia Conference 2023
-
45. FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
IEEE International Conference on Computer Vision (ICCV) 2023 (Oral)
-
44. ToonTalker: Cross-Domain Face Reenactment
IEEE International Conference on Computer Vision (ICCV) 2023
-
43. UCF: Uncovering Common Features for Generalizable Deepfake Detection
IEEE International Conference on Computer Vision (ICCV) 2023
-
42. Domain Generalization via Rationale Invariance
IEEE International Conference on Computer Vision (ICCV) 2023
-
41. Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
IEEE International Conference on Computer Vision (ICCV) 2023
-
39. Disentanglement of Pose and Expression for General Video Portrait Editing
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
38. Fine-Grained Face Swapping via Regional GAN Inversion
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
37. Improved Test-Time Adaptation for Domain Generalization
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
36. 3D GAN Inversion with Facial Symmetry Prior
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
35. SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
34. Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023 (Highlight)
-
33. Generating Human Motion from Textual Descriptions with High Quality Discrete Representation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
32. High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
31. High-fidelity Clothed Avatar Reconstruction from a Single Image
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023
-
30. CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying
AAAI Conference on Artificial Intelligence (AAAI) 2023 (Oral)
2023
-
29. OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training
Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS) 2022.
-
28. Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation
Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS) 2022.
-
27. VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
SIGGRAPH Asia (Conference Track) 2022
-
26. Generalizable Black-Box Adversarial Attack with Meta Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022
-
25. StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN
European Conference on Computer Vision (ECCV) 2022
-
24. Prior-Guided Adversarial Initialization for Fast Adversarial Training
European Conference on Computer Vision (ECCV) 2022
-
23. Boosting Fast Adversarial Training with Learnable Adversarial Initialization
IEEE Transactions on image processing (TIP) 2022
-
22. FENeRF: Face Editing in Neural Radiance Fields
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
-
21. LAS-AT: Adversarial Training with Learnable Attack Strategy
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Oral)
-
20. High-Fidelity GAN Inversion for Image Attribute Editing
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022
-
19. Self-supervised Learning of Adversarial Examples: Towards Good Generalizations for DeepFake Detections
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (Oral)
-
18. Image Inpainting with Local and Global Refinement
IEEE Transactions on image processing (TIP) 2022
2022
-
16. DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis
IEEE International Conference on Computer Vision (ICCV) 2021
-
15. Meta-Attack: Class-agnostic and Model-agnostic Physical Adversarial Attack
IEEE International Conference on Computer Vision (ICCV) 2021
-
14. Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021
-
13.Generalizing Face Forgery Detection with High-frequency Features
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021
-
12. Targeted Attack Against Deep Neural Networks via Flipping Limited Weight Bits
International Conference on Learning Representations (ICLR) 2021
2021
-
11. Sparse Adversarial Attack via Perturbation Factorization
European Conference on Computer Vision (ECCV) 2020
2020
-
8. Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
-
7. Compressing Convolutional Neural Networks via Factorized Convolutional Filters
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
-
6. Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
2019
-
5. Weakly-Supervised CNN Learning for Facial Action Unit Intensity Estimation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018
-
4. Bilateral Ordinal Relevance Multi-instance Regression for Facial Action Unit Intensity Estimation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018
-