Publications
#LONG Group Members
2025
Taking A Closer Look at Interacting Objects: Interaction-Aware Open Vocabulary Scene Graph Generation
ArXiv Preprint
IterIS: Iterative Inference-Solving Alignment for LoRA Merging
Computer Vision and Pattern Recognition (CVPR)
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
Computer Vision and Pattern Recognition (CVPR)
Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification
Computer Vision and Pattern Recognition (CVPR)
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Computer Vision and Pattern Recognition (CVPR)
DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
International Conference on Learning Representations (ICLR)
CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing
International Conference on Learning Representations (ICLR)
2024
Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
ArXiv Preprint
DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism
European Conference on Computer Vision (ECCV)
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
European Conference on Computer Vision (ECCV)
Learning Combinatorial Prompts for Universal Controllable Image Captioning
International Journal of Computer Vision (IJCV)
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation
International Journal of Computer Vision (IJCV)
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
2023
Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)