Rate-Distortion-Complexity Optimization of Video Encoders with Applications to Sign Language Video Compression

نویسندگان

  • Rahul Vanam
  • Eve A. Riskin
  • Richard E. Ladner
  • Maya R. Gupta
چکیده

Rate-Distortion-Complexity Optimization of Video Encoders with Applications to Sign Language Video Compression Rahul Vanam Co-Chairs of the Supervisory Committee: Professor Eve A. Riskin Electrical Engineering Professor Richard E. Ladner Computer Science and Engineering Applications for video compression have been growing over the years due to the availability of higher network bandwidth, lower cost of memory, and faster processor speed. Some new applications include real-time videoconferencing and video streaming. Most current cell phones come equipped with a video camera, and have the ability to capture a video and playback a recorded video. An emerging area of video compression is mobile videoconferencing, which is enabling the Deaf to communicate in American Sign Language (ASL) using video cell phones. Video encoders developed for PCs cannot be readily used for mobile phones, due to mobile phones’ low processor speeds. In this thesis, we present algorithms for improving the speed of the H.264 video encoder on both PC and cell phone platforms by selecting encoder parameters that jointly trade off encoding speed and video quality at different bitrates. We also apply our algorithms to a region-of-interest based video encoder specific to ASL. This encoder jointly optimizes for ASL intelligibility and bitrate. The parameters chosen by our algorithms are demonstrated to significantly improve the encoding speed at a given ASL intelligibility for different bitrates, on both PC and cell phone platforms. ASL videoconferencing on a cell phone drastically reduces its battery life. To extend the cell phone battery life, we detect the signer’s activity on the cell phone, and use it to control the backlight of the cell phone and the encoding frame rate. This approach improves the battery life by up to 54 minutes. Finally, we present a rate control method that allows the encoder bitrate to adapt to time-varying network bandwidth. We show that when this encoder operates with suitably chosen parameters, it yields both higher encoding speed and ASL intelligibility on a cell phone over a constant bitrate encoder.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Complexity constrained rate-distortion optimization of sign language video using an objective intelligibility metric

Sign language users are eager for the freedom and convenience of video communication over cellular devices. Compression of sign language video in this setting offers unique challenges. The low bitrates available make encoding decisions extremely important, while the power constraints of the device limit the encoder complexity. The ultimate goal is to maximize the intelligibility of the conversa...

متن کامل

Fast Intra Mode Decision for Depth Map coding in 3D-HEVC Standard

three dimensional- high efficiency video coding (3D-HEVC) is the expanded version of the latest video compression standard, namely high efficiency video coding (HEVC), which is used to compress 3D videos. 3D videos include texture video and depth map. Since the statistical characteristics of depth maps are different from those of texture videos, new tools have been added to the HEVC standard fo...

متن کامل

A Fast Block Size Decision For Intra Coding in HEVC Standard

Intra coding in High efficiency video coding (HEVC) can significantly improve the compression efficiency using 35 intra-prediction modes for 2N×2N (N is an integer number ranging from six to two) luma blocks. To find the luma block with the minimum rate-distortion, it must perform 11932 different rate-distortion cost calculations. Although this approach improves coding efficiency compared to th...

متن کامل

A Fast Block Size Decision For Intra Coding in HEVC Standard

Intra coding in High efficiency video coding (HEVC) can significantly improve the compression efficiency using 35 intra-prediction modes for 2N×2N (N is an integer number ranging from six to two) luma blocks. To find the luma block with the minimum rate-distortion, it must perform 11932 different rate-distortion cost calculations. Although this approach improves coding efficiency compared to th...

متن کامل

الگوریتم کنترل نرخ بیت متغیر ویدئو در سطح گروه تصاویر برای استاندارد فشرده‎سازی H.265

A rate control algorithm at the group of picture (GOP) level is proposed in this paper for variable bit rate applications of the H.265/HEVC video coding standard with buffer constraint. Due to structural changes in the HEVC compared to the previous standards, new rate control algorithms are needed to be designed. In the proposed algorithm, quantization parameter (QP) of each GOP is obtained by ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010