Hierarchical ConViT with Attention-Based Relational Reasoner for Visual Analogical Reasoning

نویسندگان

چکیده

Raven’s Progressive Matrices (RPMs) have been widely used to evaluate the visual reasoning ability of humans. To tackle challenges perception and logic on RPMs, we propose a Hierarchical ConViT with Attention-based Relational Reasoner (HCV-ARR). Traditional solution methods often apply relatively shallow convolution networks visually perceive shape patterns in RPM images, which may not fully model long-range dependencies complex pattern combinations RPMs. The proposed consists convolutional block capture low-level attributes patterns, transformer high-level image semantics such as formations. Furthermore, hierarchical captures features from multiple receptive fields, where layers focus fine details while deeper semantics. better underlying rules embedded an (ARR) is establish relations among images. ARR well exploits hidden question images through developed element-wise attentive reasoner. Experimental results three datasets demonstrate that HCV-ARR achieves significant performance gain compared state-of-the-art models. source code available at: https://github.com/wentaoheunnc/HCV-ARR.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analogical Reasoning with Relational Bayesian Sets

Analogical reasoning depends fundamentally on the ability to learn and generalize about relations between objects. There are many ways in which objects can be related, making automated analogical reasoning very challenging. Here we develop an approach which, given a set of pairs of related objects S = {A:B, A:B, . . . , A :B}, measures how well other pairs A:B fit in with the set S. This addres...

متن کامل

Hierarchical Selectivity for Object-Based Visual Attention

This paper presents a novel “hierarchical selectivity” mechanism for object-based visual attention. This mechanism integrates visual salience from bottom-up groupings and the top-down attentional setting. Under its guidance, covert visual attention can shift not only from one grouping to another but also from a grouping to its sub-groupings at a single resolution or multiple varying resolutions...

متن کامل

Incorporating Explanation-Based Generalization with Analogical Reasoning

The EBG system builds an explanation and learns a concept de nition as its generalization provided a domain theory is complete. It does not work when a domain theory is incomplete. Then we introduce a notion of generalizations by an analogy which makes it possible to construct rules necessary for domain theories. Furthermore, we develop EBG by analogical reasoning which copes with the incomplet...

متن کامل

Modeling visual problem solving as analogical reasoning.

We present a computational model of visual problem solving, designed to solve problems from the Raven's Progressive Matrices intelligence test. The model builds on the claim that analogical reasoning lies at the heart of visual problem solving, and intelligence more broadly. Images are compared via structure mapping, aligning the common relational structure in 2 images to identify commonalities...

متن کامل

Reputation with Analogical Reasoning∗

We consider a repeated interaction between a long-run player and a sequence of short-run players, in which the long-run player may either be rational or may be a mechanical type who plays the same (possibly mixed) action in every stage game. We depart from the classic model, exemplified by Fudenberg and Levine [4, 5], in assuming that the short-run players make inferences by analogical reasonin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i1.25072