Grounding-Tracking-Integration
نویسندگان
چکیده
In this paper, we study tracking by language that localizes the target box sequence in a video based on query. We propose framework called GTI decomposes problem into three sub-tasks: Grounding, Tracking, and Integration. The sub-task modules operate simultaneously predict frame-by-frame. “Grounding” predicts referred region directly from “Tracking” history of grounded regions previous frames. “Integration” generates final predictions synergistically combining grounding tracking. With “integration” task as key, explore how to indicate quality each frame achieve desired mutually beneficial combination. To end, an “RT-integration” method defines two scores guide integration: 1) R-score represents Region correctness whether prediction accurately covers target, 2) T-score Template provides informative visual cues improve future present our real-time implementation with proposed RT-integration, benchmark LaSOT Lingual OTB99 highly promising results. Moreover, produce disambiguated version queries facilitate studies.
منابع مشابه
Grounding Symbols through Sensorimotor Integration
A prominent robotics professor surprised me at last year's rsj conference: \There isn't really a symbol grounding problem for robotics, is there? I often ask people, `Is symbol grounding a problem for your re-search?' and no one says, `Yes.' " Sensing irony in his voice, I replied, \That's because no one is building systems with a human | or even vertebrate | level of competence. When they try ...
متن کاملOn the Integration of Grounding Language and Learning Objects
This paper presents a multimodal learning system that can ground spoken names of objects in their physical referents and learn to recognize those objects simultaneously from naturally co-occurring multisensory input. There are two technical problems involved: (1) the correspondence problem in symbol grounding – how to associate words (symbols) with their perceptually grounded meanings from mult...
متن کاملViewing Vision-Language Integration as a Double-Grounding Case
While vision-language integration is important for a wide range of Artificial Intelligence (AI) prototypes and applications, the notion of integration has not been established within a theoretical framework that would allow for more thorough research on the issue. In this paper, we attempt to explore the reasons that dictate this content integration by bringing together Searle’s theory of inten...
متن کاملSensor data integration for indoor human tracking
A human tracking system based on the integration of the measurements from an inertial motion capture system and a UWB (Ultra-Wide Band) location system has been developed. On the one hand, the rotational measurements from the inertial system are used to track precisely all limbs of the body of the human. On the other hand, the translational measurements from both systems are combined by three d...
متن کاملIntegration of Bayes detection with target tracking
Existing detection systems generally are operated using a fixed threshold and optimized to the Neyman–Pearson criterion. An alternative is Bayes detection, in which the threshold varies according to the ratio of prior probabilities. In a recursive target tracker such as the probabilistic data association filter (PDAF), such priors are available in the form of a predicted location and associated...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2021
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2020.3038720