Scene graph generation (SGG) aims to understand the visual objects and their semantic relationships from one given image. Until now, lots of SGG datasets with the eyelevel view are released but the SGG dataset with the overhead view is scarcely studied. By contrast to the multiple objects occlusion problem in the eyelevel view, the overhead view helps to clearly perceive the spatial relationships of objects in the ground scene. To fill in the gap of the overhead view dataset, this paper constructs and releases an urban aerial image scene graph generation dataset (UASG). Images from UASG are captured with the low-attitude overhead view. In UASG, 25,594 objects, 16,970 relationships and 27,175 attributes are manually annotated. To address aerial image scene graph generation, this paper proposes one new locality-preserving graph convolutional network (LPG). Different from the traditional graph convolutional network, the convolutional layer in LPG embeds the non-destructive initial features of the objects with the scene information for relationship prediction. To address the problem that there exists an extra-large number of potential object relationship pairs but only a small part of them is meaningful, we propose the adaptive bounding box scaling factor to intelligently prune the meaningless relationship pairs. Extensive experiments on UASG show that our LPG can significantly outperform the state-of-the-art methods and the effectiveness of the proposed locality-preserving strategy.
A large-scale aerial image scene graph generation datasets (ASG), including object annotation, attribute annotation, and relationship annotation.
type | object | relationship | attribute |
---|---|---|---|
number | 25594 | 16970 | 27175 |
原始图片 https://www.alipan.com/s/DSwgv4mnhuy
目标标注 https://drive.google.com/file/d/1ibXXBSxPekuOwmJJEfocVdGkJNhipOff/view?usp=sharing
属性和关系标注 https://drive.google.com/file/d/1wHVMlJaMcv97PZaP8eDFZt7lt4cfNifL/view?usp=sharing
v2.0标注版本软件 https://www.alipan.com/s/cFvhU7jc1Kf
The locality-preserving graph convolutional network (LPG) is at https://github.com/DrugD/AUG.git.
此处可能存在不合适展示的内容,页面不予展示。您可通过相关编辑功能自查并修改。
如您确认内容无涉及 不当用语 / 纯广告导流 / 暴力 / 低俗色情 / 侵权 / 盗版 / 虚假 / 无价值内容或违法国家有关法律法规的内容,可点击提交进行申诉,我们将尽快为您处理。