CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

نویسندگان

چکیده

Large-scale datasets have played indispensable roles in the recent success of face generation/editing and significantly facilitated advances emerging research fields. However, academic community still lacks a video dataset with diverse facial attribute annotations, which is crucial for on face-related videos. In this work, we propose large-scale, high-quality, rich named High-Quality Celebrity Video Dataset (CelebV-HQ). CelebV-HQ contains 35, 666 clips resolution $$512\times 512$$ at least, involving 15, 653 identities. All are labeled manually 83 attributes, covering appearance, action, emotion. We conduct comprehensive analysis terms age, ethnicity, brightness stability, motion smoothness, head pose diversity, data quality to demonstrate diversity temporal coherence CelebV-HQ. Besides, its versatility potential validated two representative tasks, i.e., unconditional generation editing. finally envision future CelebV-HQ, as well new opportunities challenges it would bring related directions. Data, code, models publicly available (Project page: https://celebv-hq.github.io/ Code models: https://github.com/CelebV-HQ/CelebV-HQ ).

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diverse Scene Stitching from a Large-Scale Aerial Video Dataset

Diverse scene stitching is a challenging task in aerial video surveillance. This paper presents a hybrid stitching method based on the observation that aerial videos captured in real surveillance settings are neither totally ordered nor completely unordered. Often, human operators apply continuous monitoring of the drone to revisit the same area of interest. This monitoring mechanism yields to ...

متن کامل

VoxCeleb: A Large-Scale Speaker Identification Dataset

Most existing datasets for speaker identification contain samples obtained under quite constrained conditions, and are usually hand-annotated, hence limited in size. The goal of this paper is to generate a large scale text-independent speaker identification dataset collected ‘in the wild’. We make two contributions. First, we propose a fully automated pipeline based on computer vision technique...

متن کامل

VideoSet: A Large-Scale Compressed Video Quality Dataset Based on JND Measurement

A new methodology to measure coded image/video quality using the just-noticeable-difference (JND) idea was proposed in [1]. Several small JND-based image/video quality datasets were released by the Media Communications Lab at the University of Southern California in [2,3]. In this work, we present an effort to build a large-scale JND-based coded video quality dataset. The dataset consists of 22...

متن کامل

FaceForensics: A Large-scale Video Dataset for Forgery Detection in Human Faces

With recent advances in computer vision and graphics, it is now possible to generate videos with extremely realistic synthetic faces, even in real time. Countless applications are possible, some of which raise a legitimate alarm, calling for reliable detectors of fake videos. In fact, distinguishing between original and manipulated video can be a challenge for humans and computers alike, especi...

متن کامل

AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video

Sangmin Oh, Anthony Hoogs, Amitha Perera, Naresh Cuntoor, Chia-Chih Chen, Jong Taek Lee, Saurajit Mukherjee, J. K. Aggarwal, Hyungtae Lee, Larry Davis, Eran Swears, Xiaoyang Wang, Qiang Ji, Kishore Reddy, Mubarak Shah, Carl Vondrick, Hamed Pirsiavash, Deva Ramanan, Jenny Yuen, Antonio Torralba, Bi Song, Anesco Fong, Amit Roy-Chowdhury, Mita Desai {sangmin.oh, anthony.hoogs, amitha.perera,naresh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20071-7_38