News
Abstract: Despite great success across various multimodal tasks, Large Vision-Language Models (LVLMs) often encounter object hallucinations with generated textual responses being inconsistent with the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results