Pitch Marking Based on an Adaptable Filter and a Peak-Valley Estimation Method
نویسندگان
چکیده
In a text-to-speech (TTS) conversion system based on the time-domain pitch-synchronous overlap-add (TD-PSOLA) method, accurate estimation of pitch periods and pitch marks is necessary for pitch modification to assure an optimal quality of the synthetic speech. In general, there are two major issues on pitch marking: pitch detection and location determination. In this paper, an adaptable filter, which serves as a bandpass filter, is proposed for pitch detection to transform the voiced speech into a sine-like wave. Based on the sine-like wave, a peak-valley decision method is investigated to determine the appropriate part (positive part and negative part) of the voiced speech for pitch mark estimation. At each pitch period, two possible peaks/valleys are searched and the dynamic programming is performed to obtain the pitch marks. Experimental results indicate that our proposed method performed very well if correct pitch information is estimated.
منابع مشابه
A two-phase pitch marking method for TD-PSOLA synthesis
This paper describes a robust two-phase pitch marking method based on peak-valley decision and dynamic programming. In the first phase, we select either peaks or valleys for pitch mark candidates according to its similarity to an estimated pitch curve. In the second phase, we define state and transition probabilities, and then employ dynamic programming to find the most likely pitch marks. We h...
متن کاملEstimation of Roughness Parameters of A Surface Using Different Image Enhancement Techniques (TECHNICAL NOTE)
Surface roughness measurement is widely used to estimate the quality of the product during manufacturing processes. It has a great importance in manufacturing fields such as ceramic tiles, glass, and iron. Many are using surface profile-meter with a contact stylus to measure the surface roughness of work piece. In the stylus method, a stylus is moved along the surface and the vertical movement ...
متن کاملHybrid electroglottograph and speech signal based algorithm for pitch marking
Pitch marking is very significant in speech signal processing. In a text-to-speech (TTS) system based on the Time-Domain Pitch-Synchronous Overlap-Add (TD-PSOLA) method, robust estimation of pitch marks (PM) is especially important to the modification of the time and pitch scale of a speech signal in order to match it to that of the target speaker. The aim of this paper is to improve the accura...
متن کاملPitch Marks at Peaks or Valleys?
This paper deals with the problem of speech waveform polarity. As the polarity of speech waveform can influence the performance of pitch marking algorithms (see Sec. 4), a simple method for the speech signal polarity determination is presented in the paper. We call this problem peak/valley decision making, i.e. making of decision whether pitch marks should be placed at peaks (local maxima) or a...
متن کاملAn Effective Attack-Resilient Kalman Filter-Based Approach for Dynamic State Estimation of Synchronous Machine
Kalman filtering has been widely considered for dynamic state estimation in smart grids. Despite its unique merits, the Kalman Filter (KF)-based dynamic state estimation can be undesirably influenced by cyber adversarial attacks that can potentially be launched against the communication links in the Cyber-Physical System (CPS). To enhance the security of KF-based state estimation, in this paper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCLCLP
دوره 6 شماره
صفحات -
تاریخ انتشار 2001