📝 Publications [Full Paper]

Vision-Language Models

  • CVPR 2026 Aligning What Vision-Language Models See and Perceive with Adaptive Information Flow

    Chengxin Liu, Wonseok Choi, Chenshuang Zhang, Tae-Hyun Oh

    IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026)

    [project page]

  • CVPR 2026 SVHalluc: Benchmarking Speech–Vision Hallucination in Audio-Visual Large Language Models

    Chenshuang Zhang, Kyeong Seon Kim, Chengxin Liu, Tae-Hyun Oh

    IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026)

    [project page]


Object Detection


Object Counting


Segmentation and Tracking


Anime Talking Head