News

Owing to the effectiveness in selectively focusing on regions of interest of images, the attention mechanism has been widely used in image caption task, which can provide more accurate image ...
Visual grounding tasks aim to localize image regions based on natural language references. In this work, we ex-plore whether generative VLMs predominantly trained on image-text data could be leveraged ...
Qiao, N. and Ye, S. (2025) Research on the Path of Digital Hoarding Behavior among Digital Natives Based on the I-PACE Model in China. Open Journal of Applied Sciences, 15, 1793-1816. doi: ...