CA2091560A1 - Methods for speech analysis and synthesis - Google Patents

Methods for speech analysis and synthesis

Info

Publication number
CA2091560A1
CA2091560A1 CA002091560A CA2091560A CA2091560A1 CA 2091560 A1 CA2091560 A1 CA 2091560A1 CA 002091560 A CA002091560 A CA 002091560A CA 2091560 A CA2091560 A CA 2091560A CA 2091560 A1 CA2091560 A1 CA 2091560A1
Authority
CA
Canada
Prior art keywords
pitch
values
making
integer
synthesis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002091560A
Other languages
French (fr)
Other versions
CA2091560C (en
Inventor
John C. Hardwick
Jae S. Lim
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Digital Voice Systems Inc
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2091560A1 publication Critical patent/CA2091560A1/en
Application granted granted Critical
Publication of CA2091560C publication Critical patent/CA2091560C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/087Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using mixed excitation models, e.g. MELP, MBE, split band LPC or HVXC
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Abstract

Sub-integer resolution pitch values are estimated in making the initial pitch estimate; the sub-integer pitch values are pref-erably estimated by interpolating intermediate variables between integer values. Pitch regions are used to reduce the amount of computation required in making the initial pitch estimate. Pitch-dependent resolution is used in making the initial pitch estimate, with higher resolution being used for smaller values of pitch.
CA002091560A 1990-09-20 1991-09-20 Methods for speech analysis and synthesis Expired - Lifetime CA2091560C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US585,830 1990-09-20
US07/585,830 US5226108A (en) 1990-09-20 1990-09-20 Processing a speech signal with estimated pitch
PCT/US1991/006853 WO1992005539A1 (en) 1990-09-20 1991-09-20 Methods for speech analysis and synthesis

Publications (2)

Publication Number Publication Date
CA2091560A1 true CA2091560A1 (en) 1992-03-21
CA2091560C CA2091560C (en) 2003-01-07

Family

ID=24343133

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002091560A Expired - Lifetime CA2091560C (en) 1990-09-20 1991-09-20 Methods for speech analysis and synthesis

Country Status (8)

Country Link
US (3) US5226108A (en)
EP (1) EP0549699B1 (en)
JP (1) JP3467269B2 (en)
KR (1) KR100225687B1 (en)
AU (1) AU658835B2 (en)
CA (1) CA2091560C (en)
DE (1) DE69131776T2 (en)
WO (1) WO1992005539A1 (en)

Families Citing this family (82)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5226108A (en) * 1990-09-20 1993-07-06 Digital Voice Systems, Inc. Processing a speech signal with estimated pitch
US5765127A (en) * 1992-03-18 1998-06-09 Sony Corp High efficiency encoding method
US5517511A (en) * 1992-11-30 1996-05-14 Digital Voice Systems, Inc. Digital transmission of acoustic signals over a noisy communication channel
US5574823A (en) * 1993-06-23 1996-11-12 Her Majesty The Queen In Right Of Canada As Represented By The Minister Of Communications Frequency selective harmonic coding
JP2658816B2 (en) * 1993-08-26 1997-09-30 日本電気株式会社 Speech pitch coding device
US6463406B1 (en) * 1994-03-25 2002-10-08 Texas Instruments Incorporated Fractional pitch method
US5715365A (en) * 1994-04-04 1998-02-03 Digital Voice Systems, Inc. Estimation of excitation parameters
US5787387A (en) * 1994-07-11 1998-07-28 Voxware, Inc. Harmonic adaptive speech coding method and system
AU696092B2 (en) * 1995-01-12 1998-09-03 Digital Voice Systems, Inc. Estimation of excitation parameters
EP0944037B1 (en) * 1995-01-17 2001-10-10 Nec Corporation Speech encoder with features extracted from current and previous frames
US5701390A (en) * 1995-02-22 1997-12-23 Digital Voice Systems, Inc. Synthesis of MBE-based coded speech using regenerated phase information
US5754974A (en) * 1995-02-22 1998-05-19 Digital Voice Systems, Inc Spectral magnitude representation for multi-band excitation speech coders
JP3747492B2 (en) * 1995-06-20 2006-02-22 ソニー株式会社 Audio signal reproduction method and apparatus
US5774837A (en) * 1995-09-13 1998-06-30 Voxware, Inc. Speech coding system and method using voicing probability determination
US6591240B1 (en) * 1995-09-26 2003-07-08 Nippon Telegraph And Telephone Corporation Speech signal modification and concatenation method by gradually changing speech parameters
JP3680374B2 (en) * 1995-09-28 2005-08-10 ソニー株式会社 Speech synthesis method
JP4132109B2 (en) * 1995-10-26 2008-08-13 ソニー株式会社 Speech signal reproduction method and device, speech decoding method and device, and speech synthesis method and device
US5684926A (en) * 1996-01-26 1997-11-04 Motorola, Inc. MBE synthesizer for very low bit rate voice messaging systems
WO1997027578A1 (en) * 1996-01-26 1997-07-31 Motorola Inc. Very low bit rate time domain speech analyzer for voice messaging
US5806038A (en) * 1996-02-13 1998-09-08 Motorola, Inc. MBE synthesizer utilizing a nonlinear voicing processor for very low bit rate voice messaging
US6035007A (en) * 1996-03-12 2000-03-07 Ericsson Inc. Effective bypass of error control decoder in a digital radio system
US5696873A (en) * 1996-03-18 1997-12-09 Advanced Micro Devices, Inc. Vocoder system and method for performing pitch estimation using an adaptive correlation sample window
US5774836A (en) * 1996-04-01 1998-06-30 Advanced Micro Devices, Inc. System and method for performing pitch estimation and error checking on low estimated pitch values in a correlation based pitch estimator
SE506341C2 (en) * 1996-04-10 1997-12-08 Ericsson Telefon Ab L M Method and apparatus for reconstructing a received speech signal
US5960386A (en) * 1996-05-17 1999-09-28 Janiszewski; Thomas John Method for adaptively controlling the pitch gain of a vocoder's adaptive codebook
JPH10105195A (en) * 1996-09-27 1998-04-24 Sony Corp Pitch detecting method and method and device for encoding speech signal
JPH10105194A (en) * 1996-09-27 1998-04-24 Sony Corp Pitch detecting method, and method and device for encoding speech signal
US6161089A (en) * 1997-03-14 2000-12-12 Digital Voice Systems, Inc. Multi-subframe quantization of spectral parameters
US6131084A (en) * 1997-03-14 2000-10-10 Digital Voice Systems, Inc. Dual subframe quantization of spectral magnitudes
US6456965B1 (en) * 1997-05-20 2002-09-24 Texas Instruments Incorporated Multi-stage pitch and mixed voicing estimation for harmonic speech coders
US5946650A (en) * 1997-06-19 1999-08-31 Tritech Microelectronics, Ltd. Efficient pitch estimation method
WO1999003095A1 (en) * 1997-07-11 1999-01-21 Koninklijke Philips Electronics N.V. Transmitter with an improved harmonic speech encoder
WO1999010719A1 (en) 1997-08-29 1999-03-04 The Regents Of The University Of California Method and apparatus for hybrid coding of speech at 4kbps
US5999897A (en) * 1997-11-14 1999-12-07 Comsat Corporation Method and apparatus for pitch estimation using perception based analysis by synthesis
US6199037B1 (en) 1997-12-04 2001-03-06 Digital Voice Systems, Inc. Joint quantization of speech subframe voicing metrics and fundamental frequencies
US6070137A (en) * 1998-01-07 2000-05-30 Ericsson Inc. Integrated frequency-domain voice coding using an adaptive spectral enhancement filter
KR19990065424A (en) * 1998-01-13 1999-08-05 윤종용 Pitch Determination for Low Delay Multiband Excitation Vocoder
US6064955A (en) 1998-04-13 2000-05-16 Motorola Low complexity MBE synthesizer for very low bit rate voice messaging
US6438517B1 (en) * 1998-05-19 2002-08-20 Texas Instruments Incorporated Multi-stage pitch and mixed voicing estimation for harmonic speech coders
GB9811019D0 (en) * 1998-05-21 1998-07-22 Univ Surrey Speech coders
US6463407B2 (en) * 1998-11-13 2002-10-08 Qualcomm Inc. Low bit-rate coding of unvoiced segments of speech
US6691084B2 (en) * 1998-12-21 2004-02-10 Qualcomm Incorporated Multiple mode variable rate speech coding
US6298322B1 (en) 1999-05-06 2001-10-02 Eric Lindemann Encoding and synthesis of tonal audio signals using dominant sinusoids and a vector-quantized residual tonal signal
US6470311B1 (en) 1999-10-15 2002-10-22 Fonix Corporation Method and apparatus for determining pitch synchronous frames
US6868377B1 (en) * 1999-11-23 2005-03-15 Creative Technology Ltd. Multiband phase-vocoder for the modification of audio or speech signals
US6377916B1 (en) 1999-11-29 2002-04-23 Digital Voice Systems, Inc. Multiband harmonic transform coder
US6975984B2 (en) * 2000-02-08 2005-12-13 Speech Technology And Applied Research Corporation Electrolaryngeal speech enhancement for telephony
US6564182B1 (en) * 2000-05-12 2003-05-13 Conexant Systems, Inc. Look-ahead pitch determination
CN1193347C (en) * 2000-06-20 2005-03-16 皇家菲利浦电子有限公司 Sinusoidal coding
US6587816B1 (en) 2000-07-14 2003-07-01 International Business Machines Corporation Fast frequency-domain pitch estimation
KR100367700B1 (en) * 2000-11-22 2003-01-10 엘지전자 주식회사 estimation method of voiced/unvoiced information for vocoder
EP1382143B1 (en) * 2001-04-24 2007-02-07 Nokia Corporation Methods for changing the size of a jitter buffer and for time alignment, communications system, receiving end, and transcoder
KR100393899B1 (en) * 2001-07-27 2003-08-09 어뮤즈텍(주) 2-phase pitch detection method and apparatus
KR100347188B1 (en) * 2001-08-08 2002-08-03 Amusetec Method and apparatus for judging pitch according to frequency analysis
US7124075B2 (en) * 2001-10-26 2006-10-17 Dmitry Edward Terez Methods and apparatus for pitch determination
US6912495B2 (en) * 2001-11-20 2005-06-28 Digital Voice Systems, Inc. Speech model and analysis, synthesis, and quantization methods
JP2004054526A (en) * 2002-07-18 2004-02-19 Canon Finetech Inc Image processing system, printer, control method, method of executing control command, program and recording medium
US7970606B2 (en) 2002-11-13 2011-06-28 Digital Voice Systems, Inc. Interoperable vocoder
US7251597B2 (en) * 2002-12-27 2007-07-31 International Business Machines Corporation Method for tracking a pitch signal
US7634399B2 (en) * 2003-01-30 2009-12-15 Digital Voice Systems, Inc. Voice transcoder
US6988064B2 (en) * 2003-03-31 2006-01-17 Motorola, Inc. System and method for combined frequency-domain and time-domain pitch extraction for speech signals
US8359197B2 (en) 2003-04-01 2013-01-22 Digital Voice Systems, Inc. Half-rate vocoder
US7373294B2 (en) * 2003-05-15 2008-05-13 Lucent Technologies Inc. Intonation transformation for speech therapy and the like
US8310441B2 (en) * 2004-09-27 2012-11-13 Qualcomm Mems Technologies, Inc. Method and system for writing data to MEMS display elements
US7319426B2 (en) * 2005-06-16 2008-01-15 Universal Electronics Controlling device with illuminated user interface
US8036886B2 (en) 2006-12-22 2011-10-11 Digital Voice Systems, Inc. Estimation of pulsed speech model parameters
JP5229234B2 (en) * 2007-12-18 2013-07-03 富士通株式会社 Non-speech segment detection method and non-speech segment detection apparatus
WO2011025532A1 (en) * 2009-08-24 2011-03-03 NovaSpeech, LLC System and method for speech synthesis using frequency splicing
US8767978B2 (en) 2011-03-25 2014-07-01 The Intellisis Corporation System and method for processing sound signals implementing a spectral motion transform
US9183850B2 (en) 2011-08-08 2015-11-10 The Intellisis Corporation System and method for tracking sound pitch across an audio signal
US8548803B2 (en) 2011-08-08 2013-10-01 The Intellisis Corporation System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
US8620646B2 (en) * 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
CN103325384A (en) 2012-03-23 2013-09-25 杜比实验室特许公司 Harmonicity estimation, audio classification, pitch definition and noise estimation
WO2013142726A1 (en) * 2012-03-23 2013-09-26 Dolby Laboratories Licensing Corporation Determining a harmonicity measure for voice processing
KR101475894B1 (en) * 2013-06-21 2014-12-23 서울대학교산학협력단 Method and apparatus for improving disordered voice
US9583116B1 (en) * 2014-07-21 2017-02-28 Superpowered Inc. High-efficiency digital signal processing of streaming media
US9842611B2 (en) 2015-02-06 2017-12-12 Knuedge Incorporated Estimating pitch using peak-to-peak distances
US9870785B2 (en) 2015-02-06 2018-01-16 Knuedge Incorporated Determining features of harmonic signals
US9922668B2 (en) 2015-02-06 2018-03-20 Knuedge Incorporated Estimating fractional chirp rate with multiple frequency representations
US10431236B2 (en) * 2016-11-15 2019-10-01 Sphero, Inc. Dynamic pitch adjustment of inbound audio to improve speech recognition
EP3447767A1 (en) * 2017-08-22 2019-02-27 Österreichische Akademie der Wissenschaften Method for phase correction in a phase vocoder and device
US11270714B2 (en) 2020-01-08 2022-03-08 Digital Voice Systems, Inc. Speech coding using time-varying interpolation

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3706929A (en) * 1971-01-04 1972-12-19 Philco Ford Corp Combined modem and vocoder pipeline processor
US3982070A (en) * 1974-06-05 1976-09-21 Bell Telephone Laboratories, Incorporated Phase vocoder speech synthesis system
US3995116A (en) * 1974-11-18 1976-11-30 Bell Telephone Laboratories, Incorporated Emphasis controlled speech synthesizer
US4004096A (en) * 1975-02-18 1977-01-18 The United States Of America As Represented By The Secretary Of The Army Process for extracting pitch information
US4015088A (en) * 1975-10-31 1977-03-29 Bell Telephone Laboratories, Incorporated Real-time speech analyzer
US4076958A (en) * 1976-09-13 1978-02-28 E-Systems, Inc. Signal synthesizer spectrum contour scaler
JPS597120B2 (en) * 1978-11-24 1984-02-16 日本電気株式会社 speech analysis device
FR2494017B1 (en) * 1980-11-07 1985-10-25 Thomson Csf METHOD FOR DETECTING THE MELODY FREQUENCY IN A SPEECH SIGNAL AND DEVICE FOR CARRYING OUT SAID METHOD
US4441200A (en) * 1981-10-08 1984-04-03 Motorola Inc. Digital voice processing system
US4696038A (en) * 1983-04-13 1987-09-22 Texas Instruments Incorporated Voice messaging system with unified pitch and voice tracking
DE3370423D1 (en) * 1983-06-07 1987-04-23 Ibm Process for activity detection in a voice transmission system
AU2944684A (en) * 1983-06-17 1984-12-20 University Of Melbourne, The Speech recognition
NL8400552A (en) * 1984-02-22 1985-09-16 Philips Nv SYSTEM FOR ANALYZING HUMAN SPEECH.
US4856068A (en) * 1985-03-18 1989-08-08 Massachusetts Institute Of Technology Audio pre-processing methods and apparatus
US4879748A (en) * 1985-08-28 1989-11-07 American Telephone And Telegraph Company Parallel processing pitch detector
US4797926A (en) * 1986-09-11 1989-01-10 American Telephone And Telegraph Company, At&T Bell Laboratories Digital speech vocoder
DE3640355A1 (en) * 1986-11-26 1988-06-09 Philips Patentverwaltung METHOD FOR DETERMINING THE PERIOD OF A LANGUAGE PARAMETER AND ARRANGEMENT FOR IMPLEMENTING THE METHOD
NL8701798A (en) * 1987-07-30 1989-02-16 Philips Nv METHOD AND APPARATUS FOR DETERMINING THE PROGRESS OF A VOICE PARAMETER, FOR EXAMPLE THE TONE HEIGHT, IN A SPEECH SIGNAL
US4809334A (en) * 1987-07-09 1989-02-28 Communications Satellite Corporation Method for detection and correction of errors in speech pitch period estimates
US5226108A (en) * 1990-09-20 1993-07-06 Digital Voice Systems, Inc. Processing a speech signal with estimated pitch

Also Published As

Publication number Publication date
WO1992005539A1 (en) 1992-04-02
KR930702743A (en) 1993-09-09
EP0549699A4 (en) 1995-04-26
EP0549699B1 (en) 1999-11-10
AU658835B2 (en) 1995-05-04
US5195166A (en) 1993-03-16
JP3467269B2 (en) 2003-11-17
JPH06503896A (en) 1994-04-28
US5226108A (en) 1993-07-06
DE69131776D1 (en) 1999-12-16
EP0549699A1 (en) 1993-07-07
KR100225687B1 (en) 1999-10-15
DE69131776T2 (en) 2004-07-01
AU8629891A (en) 1992-04-15
CA2091560C (en) 2003-01-07
US5581656A (en) 1996-12-03

Similar Documents

Publication Publication Date Title
CA2091560A1 (en) Methods for speech analysis and synthesis
MY109174A (en) Time variable spectral analysis based on interpolation for speech coding
EP1850326A3 (en) Frame erasure compensation method in a variable rate speech coder
AU551937B2 (en) Speech analysis and synthesis
GB2036515B (en) Predictive speech signal coding with reduced noise effects
ES2030057T3 (en) A PRECIPITATED AMORPHOUS SILICE AND ITS PREPARATION METHOD.
DE69611720D1 (en) PRODUCTION OF RADIOISOTOPES BY ISOTOPIC CONVERSION
AU5436590A (en) Methods and apparatus for reconstructing non-quantized adaptively transformed voice signals
CA2021508A1 (en) Digital speech coder having improved long term lag parameter determination
CA2156679A1 (en) Data Rate Conversion
BR9103645A (en) SYNTHESIS OF 1,2-SYNTHETIC POLYBUTADIENE IN A WATERFUL MEDIA
LV11062A (en) Water wall reactor with verdoso slan
CA2213779A1 (en) Speech synthesis
FR2740640B1 (en) METHOD FOR DETERMINING THE TYPE AND CODING MODE FOR CODING THE SHAPE OF AN OBJECT
FR2702075B1 (en) METHOD FOR GENERATING A SPECTRAL WEIGHTING FILTER IN A SPEECH ENCODER.
DE69613140D1 (en) PRODUCTION OF A PROFILE BY POURING
CA2241549A1 (en) Waveform synthesis
CA2175906A1 (en) Conversion of bisnoralcohol to bisnoraldehyde
JPS57125999A (en) Voice synthesizer
AT374436B (en) METHOD FOR PRODUCING A COMPACT, COARSE GRAINED SODIUM PERCARBONATE
DE69100859T2 (en) The use of an amorphous alloy catalyst to convert carbon dioxide.
JPS52142691A (en) Production of catalytic raw material or catalytic component for polyolefine
JPS531014A (en) Electronic musical instrument
HK1014431A1 (en) Synthesising speech by converting phonemes to digital waveforms
KR900004755A (en) Methods of synthesizing intermediates useful in the preparation of nozirimycin and related compounds

Legal Events

Date Code Title Description
EEER Examination request
MKEX Expiry