A Neuro-Symbolic AI System for Visual Question Answering in Pedestrian Video Sequences
- Authors
- Park, Jaeil; Bu, Seok-Jun; Cho, Sung-Bac
- Issue Date
- 2022
- Publisher
- Springer Verlag
- Keywords
- Visual question-answering; Neuro-symbolic reasoning; Scene graph; Pedestrian video
- Citation
- Lecture Notes in Computer Science, v.13469, pp 443 - 454
- Pages
- 12
- Indexed
- SCOPUS
- Journal Title
- Lecture Notes in Computer Science
- Volume
- 13469
- Start Page
- 443
- End Page
- 454
- URI
- https://scholarworks.gnu.ac.kr/handle/sw.gnu/73660
- DOI
- 10.1007/978-3-031-15471-3_38
- ISSN
- 0302-9743
1611-3349
- Abstract
- With the rapid increase in the amount of video data, efficient object recognition is mandatory for a system capable of automatically performing question and answering. In particular, real-world video environments with numerous types of objects and complex relationships require extensive knowledge representation and inference algorithms with the properties and relations of objects. In this paper, we propose a hybrid neuro-symbolic AI system that handles scene-graph of real-world video data. The method combines neural networks that generate scene graphs in consideration of the relationship between objects on real roads and symbol-based inference algorithms for responding to questions. We define object properties, relationships, and question coverage to cover the real-world objects in pedestrian video and traverse a scene-graph to perform complex visual question-answering. We have demonstrated the superiority of the proposed method by confirming that it answered with 99.71% accuracy to 5-types of questions in a pedestrian video environment.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - ETC > Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.