Abstract: Unmanned Aerial Vehicle (UAV) networks hold significant applications in emergency communications, intelligent transport systems, and post-disaster rescue operations. However, constrained by ...
Abstract: Video question answering (VideoQA), a critical task in vision-language understanding and reasoning, encounters significant challenges in integrating visual concepts for compositional ...