News
Abstract: Dataflow visualization systems enable flexible visual data exploration by allowing the user to construct a dataflow diagram that composes query and visualization modules to specify system ...
Abstract: Zero-shot image captioning can harness the knowledge of pre-trained visual language models (VLMs) and language models (LMs) to generate captions for target domain images without paired ...
2023-07-26: We have released our training recipe for real-time AV-ASR, see here. 2023-06-16: We have released our training recipe for AutoAVSR, see here. 2023-03-27: We have released our AutoAVSR ...
This repository contains training and testing codes used in the NeurIPS 2022 paper 'AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments' by Sudipta Paul, Amit K. Roy-Chowdhury, and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results