News
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering - IEEE Xplore
Due to the rich spatio-temporal visual content and complex multimodal relations, Video Question Answering (VideoQA) has become a challenging task and attracted increasing attention. Current methods ...
Video scene detection, an initial step of video analysis, temporally divides heterogeneous video into semantic segments, which is widely used in video summarization, search, browsing and retrieval.
Unsupervised word embeddings provide rich linguistic and conceptual information about words. However, they may provide weak information about domain specific semantic relations for certain tasks such ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results