Publications
#LONG Group Members
2024
Improving Diffusion-based Data Augmentation with Inversion Spherical Interpolation
ArXiv Preprint
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
ArXiv Preprint
DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism
European Conference on Computer Vision (ECCV)
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
European Conference on Computer Vision (ECCV)
Learning Combinatorial Prompts for Universal Controllable Image Captioning
International Journal of Computer Vision (IJCV)
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation
International Journal of Computer Vision (IJCV)
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
2023
Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)