Pose for Everything: Towards Category-Agnostic Pose Estimation
نویسندگان
چکیده
Existing works on 2D pose estimation mainly focus a certain category, e.g. human, animal, and vehicle. However, there are lots of application scenarios that require detecting the poses/keypoints unseen class objects. In this paper, we introduce task Category-Agnostic Pose Estimation (CAPE), which aims to create model capable any object given only few samples with keypoint definition. To achieve goal, formulate problem as matching design novel CAPE framework, termed POse Matching Network (POMNet). A transformer-based Keypoint Interaction Module (KIM) is proposed capture both interactions among different keypoints relationship between support query images. We also Multi-category (MP-100) dataset, dataset 100 categories containing over 20K instances well-designed for developing algorithms. Experiments show our method outperforms other baseline approaches by large margin. Codes data available at https://github.com/luminxu/Pose-for-Everything .
منابع مشابه
Towards global consistent pose estimation from images
We propose a method for making globally consistent pose estimates using vision. Lu and Milios described an approach where the links between poses were estimated using range scanner data. When using point correspondences the length of the links cannot be estimated and the approach of Lu and Milios has to be modified. First we use the nonlinear orientation part of the pose differences to obtain a...
متن کاملTowards Viewpoint Invariant 3D Human Pose Estimation
We propose a viewpoint invariant model for 3D human pose estimation from a single depth image. To achieve this, our discriminative model embeds local regions into a learned viewpoint invariant feature space. Formulated as a multi-task learning problem, our model is able to selectively predict partial poses in the presence of noise and occlusion. Our approach leverages a convolutional and recurr...
متن کاملJoint Object Category and 3D Pose Estimation from 2D Images
2D object detection is the task of finding (i) what objects are present in an image and (ii) where they are located, while 3D pose estimation is the task of finding the pose of these objects in 3D space. State-of-the-art methods for solving these tasks follow a two-stage approach where a 3D pose estimation system is applied to bounding boxes (with associated category labels) returned by a 2D de...
متن کاملClustered Pose and Nonlinear Appearance Models for Human Pose Estimation
Human pose estimation is the task of estimating the ‘pose’ or configuration of a person’s body parts e.g. labeling the position and orientation of the head, torso, arms and legs in an image. In this paper we propose an extension of the pictorial structure model (PSM) approach [2]. Our method incorporates richer models of appearance and prior over pose without introducing unacceptable computatio...
متن کاملTowards real-time body pose estimation for presenters in meeti
This paper describes a computer vision-based approach to body pose estimation. The algorithm can be executed in real-time and processes low resolution, monocular image sequences. A silhouette is extracted and matched against a projection of a 16 DOF human body model. In addition, skin color is used to locate hands and head. No detailed human body model is needed. We evaluate the approach both q...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-20068-7_23