News
Vision-language models (VLMs) are advanced computational techniques designed to process both images and written texts, making predictions accordingly. Among other things, these models could be used to ...
“GULCH Mag” is their new, glossy print offering that shines with in-depth coverage of Atlanta’s visual art and culture scene. Vol. 1 is available now, and Flamming and Hentschel joined "City Lights" ...
With a long creative history, Gaynelle Sloman has kept up with the times in art and more. Her painting, "Some Place in Time," is on view in Columbus.
Creating Connection The power of audio-visual storytelling lies in its ability to connect with audiences on an emotional level, a crucial element in fostering the empathy and understanding that often ...
Abstract Human navigation heavily relies on visual information. Although many previous studies have investigated how navigational information is inferred from visual features of scenes, little is ...
Materials for our paper: K. Apostolidis, J. Abesser, L. Cuccovillo, V. Mezaris, "Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol ...
The federally funded Described and Captioned Media Program (DCMP) has developed an 'AI Scene Description Tool' add-on to its video player to increase accessibility to video content for blind students ...
Scene Graph Generation (SGG) is a challenging task of detecting objects and predicting relationships between objects. After DETR was developed, one-stage SGG models based on a one-stage object ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results