Hierarchical I3D for Sign Spotting
نویسندگان
چکیده
Most of the vision-based sign language research to date has focused on Isolated Sign Language Recognition (ISLR), where objective is predict a single class given short video clip. Although there been significant progress in ISLR, its real-life applications are limited. In this paper, we focus challenging task Spotting instead, goal simultaneously identify and localise signs continuous co-articulated videos. To address limitations current ISLR-based models, propose hierarchical spotting approach which learns coarse-to-fine spatio-temporal features take advantage representations at various temporal levels provide more precise localisation. Specifically, develop Hierarchical I3D model (HS-I3D) consists network head that attached existing exploit different layers network. We evaluate HS-I3D ChaLearn 2022 Challenge - MSSL track achieve state-of-the-art 0.607 F1 score, was top-1 winning solution competition.
منابع مشابه
Hierarchical Plausibility-Graphs for Symbol Spotting in Graphical Documents
Graph representation of graphical documents often suffers from noise viz. spurious nodes and spurios edges of graph and their discontinuity etc. In general these errors occur during the low-level image processing viz. binarization, skeletonization, vectorization etc. Hierarchical graph representation is a nice and efficient way to solve this kind of problem by hierarchically merging node-node a...
متن کاملI3D 2014 Guest Editor's Introduction
Ç T HIS special section of the IEEE Transactions on Visualiza-tion and Computer Graphics (TVCG) brings you two extended papers based on work first presented at the ACM Symposium on Interactive 3D Graphics and Games (I3D) in 2014. I3D focuses on real-time rendering, animation, and interaction techniques. Games are not the only application for these kinds of interactive methods, but are certainly...
متن کاملGarbage Model Formulation with Conditional Random Fields for Sign Language Spotting
Sign language spotting is the task of detecting and recognizing the signs in a signed utterance, from a set vocabulary. The difficulty of sign language spotting is that the instances of signs vary in both motion and appearance. Moreover, the signs appear within a continuous gesture stream, interspersed with transitional movements between signs in a vocabulary and non-sign patterns (out-of-vocab...
متن کاملGabor wavelet similarity maps for optimising hierarchical road sign classifiers
In recent years it has been shown that hierarchical classifiers have a significant advantage over single stage classifiers both in classification accuracy and in complexity of the classification features. This paper introduces a new method for creating the structure of hierarchical classifiers using a novel method for determining clusters. The proposed method uses features obtained using Gabor ...
متن کاملI3D: An Interactive System for Exploring Annotated 3D Environments
In this paper, we present I3D, a system that combines the 3D input and high-performance rendering capabilities of high-end virtual reality systems with the data fetching abilities of network browsers. Using a Spaceball, the user can intuitively navigate inside the three-dimensional data, while selecting 3D objects with the mouse triggers requests for access to remote media documents that can be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2023
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-25085-9_14