Special Topic: Large Multimodal Models (2025)
SCIS Selected Articles on Large Language Models (LLM)
RESEARCH PAPER
Webpage
Webpage-cn
SpringerLink
Google Scholar
VideoChat: chat-centric video understanding
Li K C, He Y N, Wang Y, et al
Sci China Inf Sci, 2025, 68(10): 200102
Li K C, He Y N, Wang Y, et al
Sci China Inf Sci, 2025, 68(10): 200102
Keywords: video understanding; large language model; multi-modality learning; large multimodal models; spatiotemporal perception
Cite as: Li K C, He Y N, Wang Y, et al. VideoChat: chat-centric video understanding. Sci China Inf Sci, 2025, 68(10): 200102, doi: 10.1007/s11432-024-4321-9
Special Topic: Large Multimodal Models (2025)
SCIS Selected Articles on Large Language Models (LLM)
LETTER
Supplementary
Webpage
Webpage-cn
SpringerLink
Google Scholar
Cited in SCI: 0
Progressive language-aware encoding and decoding for referring expression comprehension
Zhao, Yichen; Chen, Yaxiong; Rong, Yi; Xiong, Shengwu
Sci China Inf Sci, 2025, 68(10): 200111
Zhao, Yichen; Chen, Yaxiong; Rong, Yi; Xiong, Shengwu
Sci China Inf Sci, 2025, 68(10): 200111
Keywords: referring expression comprehension; vision-and-language; visual grounding; multimodal fusion and reasoning; multimodal transformer
Cite as: Zhao Y C, Chen Y X, Rong Y, et al. Progressive language-aware encoding and decoding for referring expression comprehension. Sci China Inf Sci, 2025, 68(10): 200111, doi: 10.1007/s11432-024-4312-9