Abstract: Unmanned aerial vehicles (UAVs) visual localization in planetary aims to estimate the absolute pose of the UAV in the world coordinate system through satellite maps and images captured by on ...
It synthesizes a large-scale SELD dataset designed to include numerous sound event instances and various acoustic environments. It introduces PSELDNets trained on the large-scale synthetic SELD ...
Abstract: Weakly-supervised Temporal Action Localization (WTAL) aims to localize action instances with only video-level labels during training, where two primary issues are localization incompleteness ...