Pose for Everything: Towards Category-Agnostic Pose Estimation

نویسندگان

چکیده

Existing works on 2D pose estimation mainly focus a certain category, e.g. human, animal, and vehicle. However, there are lots of application scenarios that require detecting the poses/keypoints unseen class objects. In this paper, we introduce task Category-Agnostic Pose Estimation (CAPE), which aims to create model capable any object given only few samples with keypoint definition. To achieve goal, formulate problem as matching design novel CAPE framework, termed POse Matching Network (POMNet). A transformer-based Keypoint Interaction Module (KIM) is proposed capture both interactions among different keypoints relationship between support query images. We also Multi-category (MP-100) dataset, dataset 100 categories containing over 20K instances well-designed for developing algorithms. Experiments show our method outperforms other baseline approaches by large margin. Codes data available at https://github.com/luminxu/Pose-for-Everything .

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards global consistent pose estimation from images

We propose a method for making globally consistent pose estimates using vision. Lu and Milios described an approach where the links between poses were estimated using range scanner data. When using point correspondences the length of the links cannot be estimated and the approach of Lu and Milios has to be modified. First we use the nonlinear orientation part of the pose differences to obtain a...

متن کامل

Towards Viewpoint Invariant 3D Human Pose Estimation

We propose a viewpoint invariant model for 3D human pose estimation from a single depth image. To achieve this, our discriminative model embeds local regions into a learned viewpoint invariant feature space. Formulated as a multi-task learning problem, our model is able to selectively predict partial poses in the presence of noise and occlusion. Our approach leverages a convolutional and recurr...

متن کامل

Joint Object Category and 3D Pose Estimation from 2D Images

2D object detection is the task of finding (i) what objects are present in an image and (ii) where they are located, while 3D pose estimation is the task of finding the pose of these objects in 3D space. State-of-the-art methods for solving these tasks follow a two-stage approach where a 3D pose estimation system is applied to bounding boxes (with associated category labels) returned by a 2D de...

متن کامل

Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation

Human pose estimation is the task of estimating the ‘pose’ or configuration of a person’s body parts e.g. labeling the position and orientation of the head, torso, arms and legs in an image. In this paper we propose an extension of the pictorial structure model (PSM) approach [2]. Our method incorporates richer models of appearance and prior over pose without introducing unacceptable computatio...

متن کامل

Towards real-time body pose estimation for presenters in meeti

This paper describes a computer vision-based approach to body pose estimation. The algorithm can be executed in real-time and processes low resolution, monocular image sequences. A silhouette is extracted and matched against a projection of a 16 DOF human body model. In addition, skin color is used to locate hands and head. No detailed human body model is needed. We evaluate the approach both q...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20068-7_23