News

The graph below shows the total number of publications each year in Audio-Visual Event Localization and Scene Understanding. References [1] Audio-Visual Segmentation with Semantics .
Visual question answering—answering questions about the contents of a scene or the actions of an object. Image editing and retrieval—removal of an object from an image and discovery of similar ...