News
To address these challenges, the first visual prompting-based multimodal large language model (MLLM) named EarthMarker is proposed in the RS domain. EarthMarker is capable of interpreting RS imagery ...
Image-goal navigation is a critical task in autonomous visual navigation, requiring the robot to navigate to a target localization specified by an image. Previous works using data-driven methods ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results