Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion

نویسندگان

چکیده

One significant factor we expect the video representation learning to capture, especially in contrast with image learning, is object motion. However, found that current mainstream datasets, some action categories are highly related scene where happens, making model tend degrade a solution only information encoded. For example, trained may predict as playing football simply because it sees field, neglecting subject dancing cheerleader on field. This against our original intention towards and bring bias different dataset can not be ignored. In order tackle this problem, propose decouple motion (DSM) two simple operations, so attention better paid. Specifically, construct positive clip negative for each video. Compared video, positive/negative motion-untouched/broken but scene-broken/untouched by Spatial Local Disturbance Temporal Disturbance. Our objective pull closer while pushing farther latent space. way, impact of weakened temporal sensitivity network further enhanced. We conduct experiments tasks various backbones pre-training find method surpass SOTA methods remarkable 8.1% 8.8% improvement recognition task UCF101 HMDB51 datasets respectively using same backbone.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Motion Vector Estimation Approach for Video Error Concealment Based on the Video Scene Analysis

In order to enhance the accuracy of the motion vector (MV) estimation and also reduce the error propagation issue during the estimation, in this paper, a new adaptive error concealment (EC) approach is proposed based on the information extracted from the video scene. In this regard, the motion information of the video scene around the degraded MB is first analyzed to estimate the motion type of...

متن کامل

heterogeneity within the orientalist discourse: representation of the orient in womens travelogues and mens paintings

from 1950s onward, new theories and critical approaches burgeoned across humanities. these theories were context-oriented; as a result, the analysis of discursive practices gained significance. thus, social, political, historical and cultural discourses that have been hitherto marginalized and considered inferior to literary texts, were introduced as important texts to be analyzed by critics. o...

the relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation

with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...

15 صفحه اول

investigating the effect of motivation and attitude towards learning english, learning style preferences and gender on iranian efl learners proficiency

تحقیق حاضر به منظور بررسی تاثیر انگیزه و نگرش نسبت به یادگیری زبان انگلیسی، ترجیحات سبک یادگیری و جنسیت بر بسندگی فراگیران ایرانی زبان انگلیسی انجام شد. برای این منظور، 154 فراگیر ایرانی زبان انگلیسی در این تحقیق شرکت کردند. سه ابزار جمع آوری داده ها شامل آزمون تعیین سطح بسندگی زبان انگلیسی آکسفورد، پرسشنامه ترجیحات سبک یادگیری براچ و پرسشنامه انگیزه و نگرش نسبت به یادگیری زبان انگلیسی به م...

Unsupervised Feature Extraction for the Representation and Recognition of Lip Motion Video

The lip-reading recognition is reported with lip-motion features extracted from multiple video frames by three unsupervised learning algorithms, i.e., Principle Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Since the human perception of facial motion goes through two different pathways, i.e., the lateral fusifom gyrus for the invari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i11.17215