News
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering - IEEE Xplore
Due to the rich spatio-temporal visual content and complex multimodal relations, Video Question Answering (VideoQA) has become a challenging task and attracted increasing attention. Current methods ...
Video scene detection, an initial step of video analysis, temporally divides heterogeneous video into semantic segments, which is widely used in video summarization, search, browsing and retrieval.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results