US3621150A - Speech processor for changing voice pitch - Google Patents

Speech processor for changing voice pitch Download PDF

Info

Publication number
US3621150A
US3621150A US858623A US3621150DA US3621150A US 3621150 A US3621150 A US 3621150A US 858623 A US858623 A US 858623A US 3621150D A US3621150D A US 3621150DA US 3621150 A US3621150 A US 3621150A
Authority
US
United States
Prior art keywords
frequency
signals
signal
register
clock
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US858623A
Inventor
George W Pappas
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lockheed Corp
Original Assignee
Sanders Associates Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sanders Associates Inc filed Critical Sanders Associates Inc
Application granted granted Critical
Publication of US3621150A publication Critical patent/US3621150A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/00007Time or data compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility

Definitions

  • the invention It d0 Search 179/1002 makes use of the fact that normal Speech may be chopped or 1002 1002 1002 [555 Tc; 178/1310 segmented at certain rates and still retain its intelligibility.
  • the 3 basic principle of the invention is accomplished in either an [56] Rekrences cued electronic or electromechanical embodiment wherein the following steps are performed.
  • the speech to be processed is seg- UNITED STATES PATENTS mented into very small and, in some applications. equal 2,352,023 6/1944 Schuller 179/1002 R pieces.
  • This invention relates generally to speech-processing techniques and is more particularly concerned with apparatus for reproducing the speech of a diver in a helium-oxygen atmosphere in intelligible form, restoring to normal sounding a normal recording played at a higher speed and providing compression of a voice signal for transmission over a low-grade transmission line.
  • this voice communication is solved by feeding the helium speech to amplitude and pitch circuits.
  • this prior art device uses 34 filters to separate the voice signal, each segment is processed separately and then recombined.
  • the pitch circuits the frequencies of the 34 lowest harmonics are determined.
  • the amplitude circuits the power levels within each of the 34 ISO-Hz. intervals of the speech spectrum are determined. The amplitudes are then shifted and applied to harmonics of the lower frequency.
  • the modulators which are coupled to the amplitude and pitch circuits, these power levels control the loudness of the 34 harmonic frequencies thus producing a pattern or envelope closely corresponding to the envelope of-normal speech.
  • the speech is divided into two bands of speech frequencies located in the area of maximum speech articulation, from 850 to 5,000 Hz.
  • Each band then goes first through a balanced modulator, then through a high pass filter that accepts the upper sideband only.
  • the filtered signals then go through a balanced demodulator driven by a continuously adjustable oscillator. This oscillator is set to provide the desired amount of shifting for the particular operational conditions.
  • the two bands are then mixed and the reconstructed speech is reproduced.
  • harmonic compressor One prior art method of accomplishing this speed hearing is known as a harmonic compressor.
  • speech is fed to a bank of 36 band-pass filters which separates the speech into its different frequency components.
  • the output of the bank of filters is sent, in turn, to 36 frequency dividers which halve the frequencies of the narrow-band signals from the filters. From the dividers, the halved frequency signals go to networks which remove distortion and combine the 36 halved signals into one signal. The frequency components of this signal are then half of the original input values.
  • This harmonically compressed signal is then recorded on magnetic tape. The halved frequencies are then restored to their original values by doubling the playback speed.
  • This prior art device does not operate in real time, i.e., the speech must first be recorded and played back later and is so voluminous that it takes up the space of a 6-foot high cabinet.
  • the voice signal is compressed for transmission over low bandwidth communication lines.
  • the prior art devices mentioned above might be adapted to provide for this application.
  • the aforesaid prior art apparatus are disadvantaged in that they either require an excessive amount of electronics, a multiplicity of special purpose and accurate components, especially filters, a very large and expensive configuration with an excessive power requirement, or they do not operate in real time. Because of these deficiencies in the prior art apparatus, the applications set forth are not fully realized, for example, a person would be limited to access of a speed hearing device.
  • SUMMARY AND OBJECTS OF THE INVENTION of the invention is to provide Yet another object of the invention is to provide an electronic embodiment comprising two shift registers into which voice signals, having been digitized, are loaded, and means for switching back and forth between these registers after each loading cycle so that there is no pause in the output signal as one register is loaded.
  • a further object of the invention is to provide an electromechanical embodiment wherein a magnetic tape loop is moved past a recording head into which the voice signal is applied, which tape loop is played by several playback heads moving in the same direction as the tape, but slower.
  • the basis for the apparatus of the invention is based on the fact that normal speech can be chopped or interrupted at certain rates and still retain its intelligibility. This factor is supported in an article of The Journal of the Acoustical Society of America, Volume 22, No. 2, pages I67 to 173, dated Mar. I950, and entitled, The intelligibility of Interrupted Speech. Both the electronic and electromechanical embodiments of the invention take full advantage of this factor.
  • an electronic embodiment of the apparatus of the invention digitizes the voice signal.
  • This voice signal might be that of a diver in a helium-oxygen atmosphere, or might be a recording of nonnal voice played back at higher speed or might be simply a voice signal being transmitted over a communications line.
  • the digitized voice signal is shifted into a first shift register at a first frequency.
  • this first register is fully loaded this condition is detected thereupon shifting the digitized voice signal out of the first register at a second frequency. This second frequency is lower that the first frequency.
  • subsequent and now digitized voice signals are shifted into the second register at the first frequency.
  • a magnetic tape loop is moved past a recording head into which the input voice signal is applied.
  • the tape is played back by several playback heads moving in the same direction as the tape, but slower. These playback heads are equally spaced on the circumference of a rotating wheel with each head sequentially making contact with the tape.
  • the tape between the heads during head changeover does not get played, thus segmenting the input signal with ever other segment, i.e., the tape not played, being discarded.
  • the loss of information from the discarded segments in the embodiments mentioned hereinabove is negligible when the chopping rate, i.e., the segment length is optimized for the characteristics of human speech and hearing.
  • the segments ideally should be long compared with voice pitch waveform periods of to milliseconds and short compared with voice syllabic periods of 100 to 200 milliseconds. For example, a segment period of milliseconds has been used.
  • the apparatus of the invention segments the incoming voice signal, discards alternate segments and reads out the segmented signal at a lower rate, thereby keeping the same time base as the input signal. It should be noted that with the first frequency being double that of the second frequency the pitch of the incoming voice signal will be reduced by one-half and that although 50 percent of the input signal has been lost at this point the articulation will still be satisfactory.
  • FIG. 1 illustrates a block diagram of an electronic embodiment of the speech processor of the invention
  • FIG. 2 illustrates the timing waveforms at signal points indicated in the block diagram illustrated in FIG. 1;
  • FIG. 3 illustrates a block diagram of an electromechanical embodiment of the speech processor of the invention.
  • FIG. 4 illustrates a modification of the processor shown in FIG. 3.
  • FIG. I there is illustrated a block diagram of an all electronic embodiment of the apparatus for processing the aforementioned signals.
  • An electromechanical embodiment will be discussed later.
  • the signal which is analog in nature is coupled to the input of an analog to digital (A/D) converter 10 which converter digitizes the input signal.
  • the converter 10 might be a simple clipper or limiter thereby producing a chopped signal at the output of converter 10, or might be a delta modulator thereby producing a binary pulse pattern which is representative of the input signal.
  • Converter 10 might also binary code the amplitude of the input signal which embodiment will be discussed hereinafter.
  • the output of converter 10 is connected to the inputs of shift registers l6 and 18, which registers receive their respective clock pulses by way of switch 20 from either A clock 14 or B clock 12.
  • Clock M operates at a higher frequency than clock 12 and although both clocks are shown separately it should be understood that the pulse train out of clock 12 could have been derived from divider means connected to the output of clock 14.
  • Switch 20 as well as switch 24, might include a well-known combination of logic gates and/or fiip-fiops and switch 20 is initially set so that terminal a is connected to terminal c while terminal I; is connected to terminal a. After switch 20 is initially toggled, a is connected to d and b is connected to 0. Subsequent toggling alternately changes the connections cited above.
  • the toggle signal may be generated by a counter 22 which generates a pulse each time clock M produces that number of pulses as registers 16 and id have signal storage positions.
  • the toggle signal might also be generated by a monostable multivibrator, not shown, whose time constant approximates the time for loading a register at the higher frequency of clock 14.
  • Counter 22 also toggles switch 24 which is initially set so that terminal a is connected to terminal I). The output of shift register 16 is connected to terminal b of switch 24 while the output of shift register 18 is connected to terminal 0.
  • switches 20 and 24 initially allows register 18 to clock in the digitized signal at the clock M frequency while register 16 has its signal stored therein clocked out of terminal a of switch 24 at the clock 12 frequency. Note that usually there is no signal stored in register 16 initially.
  • counter 22 When counter 22 generates a toggle signal the above condition will change so that now the signal just stored in register 18 will be clocked out of switch 24 at the clock 12 frequency while subsequent digitized signals will be clocked into register 16 at the clock M frequency. This process will continue in the alternate manner noted above.
  • the signal at terminal a of switch 24 is processed by a digital to analog (D/A) converter 26 and then filtered by means of filter 28 thereby providing the output signal of the apparatus of the invention.
  • D/A digital to analog
  • the signal is always clocked into a register at the higher frequency of clock 14 and clocked out of the registers at the lower frequency of clock l2. It can also be seen that the processed signal is lower in frequency than the digitized signal out of converter 10 and that such processed signal has been chopped thus resulting in some loss of information of the input signal. It can also be seen that two registers are used so that there is no pause in the output as one register is loaded.
  • Waveform A is a representation of the digitized signal out of A/D converter 10.
  • Waveforms B and C are representative of the clock pulse trains produced by clocks l4 and I2 respectively. As illustrated, waveform B is twice the frequency of waveform C.
  • Waveform D is the toggle signal from counter 22 which signal has been shown for illustration purposes as occurring every sixth pulse of clock 14.
  • Waveform E is the processed signal occurring at the output of switch 24.
  • Shift registers 16 and 18 each have, in this example, six storage positions. Since counter 22 produces a toggle signal every six pulses of clock 14, it can be seen that each time a register loads up at the higher clock rate it is then switched to output the stored signals at the lower clock rate.
  • the digitized input signal is clipped and converted to pulses as indicated in waveform A. Note that this conversion to pulses is not necessary since the clipped waveform could have been directly sampled.
  • waveform A is clocked into shift register 18 at the rate shown by waveform B, and at the same time register 16 is receiving the clock pulses indicated by waveform C. Initially, however, no signals are clocked out of register l6 since no signals had been stored therein.
  • register 18 will now have stored in it pulses indicated in waveform A as numbers 1 through 6.
  • counter 22 will emit a pulse as indicated by waveform D toggling the connections of switch 20 and of switch 24 such that now signals as indicated by wavefonn A beginning with the binary zero at time 7 will be clocked into register 16 at the waveform B rate.
  • the signals now stored in register 18 will be clocked out of register 18 at the waveform C rate and through the connections of switch 24 into the D/A converter 26 and out via filter 28.
  • the signal emitted from the a terminal of switch 24 is indicated by while clock 12 had wavefonn E.
  • a binary one pulse is shown which is identical to the signal as shown in wavefonn A at time 1.
  • a binary zero is emitted which binary zero is identical to the binary zero indicated at time 2 in waveform A. in a similar fashion the binary one indicated at time 11 in waveform E is identical to the binary one shown in waveform A at time 3.
  • the processing technique of the invention may be utilized for improving the intelligibility of speech in a helium-oxygen atmosphere, i.e., in that circumstance where a diver is at considerable water depth and is breathing a helium-oxygen mixture.
  • the voice of the diver for example at 200 feet is of high pitch and unintelligible.
  • the pitch of the voice becomes lower, i.e., the pitch is restored to normal, and the speech is now intelligible.
  • a normal recording might for example be played at twice the normal speed. At this higher speed the voice on the recording would be unintelligible.
  • the pitch would be lowered but is now intelligible, and the frequency of the voice recording would be doubled thus resulting in information being absorbed at a faster rate.
  • this processor In a further application of this invention it would be possible to use this processor to reduce the bandwidth of voice before transmitting it over a communications channel. With a ratio in clock frequencies of 2 to 1, a 50 percent reduction in the radiofrequency bandwidth may be possible. Another similar device on the receiving end would be utilized to reconvert the voice to its original pitch. In this application it would not be necessary to use the converter 26 and filter 28, transmission could be in digital form.
  • the receiving device would then incorporate a similar device as shown in the apparatus of the invention, however, the positions of clock 12 and clock 14 would be interchanged. That is to say the incoming processed digital signals from switch 24 of the transmitting device would be clocked into the receiving device at the lower frequency and clocked out at the higher frequency. This higher frequency output signal will then be converted into analog fonn and filtered thereby producing the received output signal. Further the receiving device would not require an A/D converter similar to converter 10 in F IG. 1.
  • converter 10 illustrated in FIG. 1, might have produced a binary code of the amplitude of the input signal. For example, if the amplitude of the input signal is divided into 8 levels then the converter 10 would have at its output three binary lines corresponding to 2, 2', and 2 and thereby giving eight possible binary indications of the level of the input signal.
  • shift register 16 would now be replaced by three identical and parallel shift registers. These sets of registers would be coupled as shown in FIG. 1 and the operation would remain substantially identical to that previously discussed.
  • the clock line from terminal 0 of switch 20 would clock the parallel registers replacing shift register 16 and'the clock line from terminal d of switch 20 would clock the parallel registers replacing shift register 18.
  • the D/A converter 26 would recombine the signals as switched by switch 24.
  • FIG. 3 there is illustrated an electromechanical embodiment of the invention.
  • the speech to be processed is applied to input terminal 74.
  • Input terminal 74 is connected to record head 42.
  • record head 42 and pressure pad 44 passes a magnetic tape loop 40 which is driven in a clockwise direction by capstan 56 and pinchroller 54.
  • Capstan 56 is driven in a counterclockwise direction, as shown by the arrow, by a suitable motor, not shown.
  • Tape loop 40 passes along the circumference of playback assembly 41 which assembly 41 is driven in a clockwise direction by another suitable motor, not shown.
  • Playback assembly 41 includes in this illustration two playback heads 62 and 64 spaced 180 apart along the circumference.
  • the two playback heads are coupled to the output terminal 72 by means of their slip rings 68 and brush 66.
  • the output 76 of brush 66 is coupled to a playback amplifier 70, whose output is coupled to output terminal 72.
  • the tape loop 40 is guided along the circumference of playback assembly 41 by means of roller bearings 60, which are utilized to reduce friction between the tape loop 40 and assembly 41.
  • rollers 50 and 52 have coupled to it a spring mechanism 58 for increased tension of the tape loop 40.
  • Playback assembly 41 might have included more than two playback heads spaced equally along the circumference. In any such arrangement as well as the two playback head arrangements shown, only one playback head is coupled to the output terminal 72 at a time. As illustrated, the tape behind playback head 64 to just in front of playback head 62 will not get played back thereby resulting in segmenting of the speech and discarding every other segment of the speech which is fed into record head 42. If only one playback head were used, the basic concept of the invention could still be achieved, however, the playback would not be continuous.
  • FIG. 3 is a simple schematic representation of the invention and does not include all the mechanical and electrical details but is sufficient to describe the invention. These details may be accomplished in any convenient manner.
  • the operation of the embodiment of FIG. 3 is as follows.
  • the closed magnetic tape loop 40 moves past the record head 42 at a fixed speed determined by the capstan 56.
  • the signal to be processed is applied to the record head 42 through the input terminal 74.
  • the recorded tape then passes over a playback housing M in the shape of a wheel which housing includes two or more playback heads 62 and 64 equally spaced along the circumference of the wheel.
  • the operation will be described with two heads but more than two may be desirable. For instance, the less the circumferential distance between heads the slower need be the speed of the tape and wheel for a given quality reproduction. Only one head at a time can be played back.
  • the speed of the wheel is variable from rpm. to an rpm. equal to the tape speed. With the speed 0 there is no pitch reduction; as the speed increases to one-half the speed of the tape, the pitch decreases by one-half. At this point the size of the tape segments played back become equal to the segments not played back. As the speed of the wheel is increased further the pitch is lowered further and smaller segments are played back relative to those not played. With a speed control for the wheel the pitch frequency will be adjustable.
  • the tape loop -30 passes over the wheel, it passes by the capstan 56 which drives the tape at a constant speed. It then passes the erase head 46 and guide rollers 50 and 52 and returns to the record head.
  • a spring is shown on one of the guide rollers to hold the tape loop tight against the playback heads. It also simplifies the replacement of the tape loop. If additional pressure is needed to hold the tape in contact with the playback heads a semicircular shoe could press against the tape. Another suggestion for holding the tape tight against the heads is shown in F IG. 4.
  • playback housing 41 is illustrated with a flexible tape pressure means generally indicated at 92.
  • Playback housing til now includes a cylinder 80 which has coated on its outer surface a friction-reducing substance such as that sold under the trademark TEFLON.” This essentially frictionless cylinder 80 replaces the roller bearings 60 of FIG. 3. Tape loop 40 passes around and is pressed against cylinder 80 by means of flexible tape pressure means 92.
  • Pressure means 92 includes flexible pressure tape 90 which is held tightly against recording tape 40 as it passes over the wheel or cylinder 80. Pressure tape 90 is held in position by means of guide rollers 82 and 84 and is kept tight by means of roller 86 and spring tension means 88. The pressure tape 90 is free to move around the rollers 82, 84 and 86 thus reducing the drag on the recording tape 40 as it passes over the cylinder 80. If it is required to reduce the drag further, a capstan, not
  • This embodiment of the invention might be utilized in a communication system in which case the receiving end would include apparatus as shown in FIG. 3. However, since the incoming speech is now lower in pitch than normal, the speed of revolution of the playback housing 41 would be greater than the speed of the tape loop 40.
  • Apparatus for processing speech signals comprising:
  • F. means for alternately reading out said first and second storage means at a second rate, said signals being read out at said second rate being coupled to said output terminal;
  • G a digital-to-analog converter coupled to the output of said storage means.
  • first and second storage means comprises first and second shift registers.
  • first switch means for coupling said pulses of said first frequency to the clock input of said first shift register whereby signals are loaded into said first shift register at said first frequency and for simultaneously coupling said pulses of said second frequency to the clock input of said second shift register whereby signals are unloaded from said second shift register at said second frequency.
  • Apparatus as defined in claim 6, further including:
  • a filter network coupled to receive a signal from said digitalto-analog converter.
  • first and second shift registers are each comprised of n parallel connected shift registers, each of which n parallel shift registers is coupled to a corresponding line of said n output lines.
  • Apparatus for processing speech signals comprising:
  • D. means for alternately clocking said digital signals into one of said shift registers at a first frequency
  • F. means for converting said digital signals clocked out of said shift registers at said second frequency to analog form.
  • Signal-processing apparatus having a processor output terminal, said apparatus comprising:
  • A. analog-to-digital conversion means B. means for generating first clock pulses at a first frequency; C. means for generating second clock pulses at a second frequency, said second frequency being lower in frequency than said first frequency; D. first and second shift register means 1. each having a signal input and output terminal, 2. each coupled to said conversion means at said input terminal,
  • F. means for alternately coupling said first clock pulses to said clock input of said first register means and then to said clock input of said second register means each time signal said sensing means senses the occurrence of m of said first clock pulses;
  • G means for alternately coupling said second clock pulses to said clock input of one of said register means which is not connected to said first clock pulses;
  • said first and second shift register means are each comprised of n parallel connected shift registers, each of which n parallel shift registers is coupled to a corresponding line of said n output lines.
  • Apparatus as defined in claim 18 wherein said means for converting a digital signal to an analog signal includes a filter network for further restoring said signal to analog form.
  • a method for processing voice signals comprising the steps of:
  • a method for processing signals comprising the steps of:

Abstract

By utilizing the speech processor of the invention, the speech of a diver in a helium-oxygen atmosphere is made intelligible, or a normal recording played at higher speeds is made intelligible thereby providing a speed hearing capability, or a voice signal is compressed for transmission over low bandwidth communication lines. The invention makes use of the fact that normal speech may be chopped or segmented at certain rates and still retain its intelligibility. The basic principle of the invention is accomplished in either an electronic or electromechanical embodiment wherein the following steps are performed. The speech to be processed is segmented into very small and, in some applications, equal pieces. Every other piece is discarded and the remaining pieces are recombined. The recombined pieces are played back at a slower speed dependent on the length of the discarded pieces.

Description

United States Patent Inventor George 1 1 2,997,697 8/1961 MacAdam .a 179/1002 T Lyndeboroush 3.458.642 7/1969 Leslie 179/1002 T 1 PP- 858,623 3,480,737 1 H1969 Greenberg et a1. 179/1002 R 1 Filed g n 69 3.485.960 12/1969 Gray 179/100.2T
22:21:: 8311;123:122 I Primary Examiner-Bernard Konick Nashua N H Assistant Examiner-Robert S. Tupper AHorney--L0uis Etlinger [54] SPEECH PROCESSOR FOR CHANGING VOICE PITCH 21 Chims4Drawing 51 ABSTRACT: By utilizing the speech processor of the invention, the speech of a diver in a helium-oxygen atmosphere is [52] U.S.C| ..I79/I00.2 K, made imemgible or a normal recording played at higher 179/1002 179N002 179/1555 TC speeds is made intelligible thereby providing a speed hearing [51] Int. Cl .1 Gl1b5/02, capability, or a voice Signal is compressed for "unsmission 50 F I f 61 1b 27/10 over low bandwidth communication lines. The invention It d0 Search 179/1002 makes use of the fact that normal Speech may be chopped or 1002 1002 1002 [555 Tc; 178/1310 segmented at certain rates and still retain its intelligibility. The 3 basic principle of the invention is accomplished in either an [56] Rekrences cued electronic or electromechanical embodiment wherein the following steps are performed. The speech to be processed is seg- UNITED STATES PATENTS mented into very small and, in some applications. equal 2,352,023 6/1944 Schuller 179/1002 R pieces. Every other piece is discarded and the remaining 1959 F l f flnk l 1- 1 l -2T pieces are recombined. The recombined pieces are played 1 /1 Ellison 179/100.2T back at a slower speed dependent on the length of the 2,909,616 10/1959 Marty 179/ 100.2 T discarded pieces.
INPUT A/ D ED '6 p 51-11 FT CONVERTER REGISTER slew/I (s) 1 224 /26 CLOCK SWITCH g D/ A 8 CONVERTER l c 5 HI F T REGISTER(S) TOGGLE F CLOCK {CLOCK OUTPUT v F'LTER i SIGNAL SET '0 c SWITCH IIA ll [TOGGLE CLOCK B COUNTER PATENTEDNHV'IBIQ?! 3521.150
' SHEVEIIUFZ INPUT A/ 0 FT :3 CONVERTER REGISTEFHS) SIGNAL I I SET (/24 Z26 CLOCK SWITCH a A CONVERTER l8 c SHIFT J J .1 REGISTER(S) TOGGLE 28 CLOCK y OUTPUT IcLocK M 'L :-\I SIGNAL SET J G C SWITCH 14 d w-- u u b A {TOGGLE CLOCK E f COUNTER 22 F161.
l 3 5 7 9 ll l3 l5 l7 l9 2| 23 25 27 29 FIGZ.
INVENTOR. GEORGE W. PAPPAS QZ'LJMA A TTOR/VEY PATENTEDuuv 16 197i 3,621 150 sum 2 OF 2 I OUTPUT INVENTOR GEORGE W. PAPPAS A TTOR/VE Y BACKGROUND OF THE INVENTION 1 Field of the Invention This invention relates generally to speech-processing techniques and is more particularly concerned with apparatus for reproducing the speech of a diver in a helium-oxygen atmosphere in intelligible form, restoring to normal sounding a normal recording played at a higher speed and providing compression of a voice signal for transmission over a low-grade transmission line.
2. Description of the Prior Art In the prior art there are a number of techniques commonly employed for speech processing in the areas of improving the intelligibility of speech in helium-oxygen atmospheres, in speed hearing especially for the blind and in data compression for transmission over low bandwidth communication lines.
In the first application, divers, in order to prevent "bends" and nitrogen narcosis, breathe a pressurized mixture of oxygen, nitrogen and helium. The helium gives the divers voices an unnatural, squeaky, Donald-Duck-like quality. As a result, voice communications between divers and people on the surface are seriously impaired.
In one prior art device, this voice communication is solved by feeding the helium speech to amplitude and pitch circuits. In general, this prior art device uses 34 filters to separate the voice signal, each segment is processed separately and then recombined. In the pitch circuits, the frequencies of the 34 lowest harmonics are determined. In the amplitude circuits, the power levels within each of the 34 ISO-Hz. intervals of the speech spectrum are determined. The amplitudes are then shifted and applied to harmonics of the lower frequency. In the modulators, which are coupled to the amplitude and pitch circuits, these power levels control the loudness of the 34 harmonic frequencies thus producing a pattern or envelope closely corresponding to the envelope of-normal speech.
In another prior art device, the speech is divided into two bands of speech frequencies located in the area of maximum speech articulation, from 850 to 5,000 Hz. Each band then goes first through a balanced modulator, then through a high pass filter that accepts the upper sideband only. The filtered signals then go through a balanced demodulator driven by a continuously adjustable oscillator. This oscillator is set to provide the desired amount of shifting for the particular operational conditions. The two bands are then mixed and the reconstructed speech is reproduced.
In the second application, people, especially the blind, are given an ability of speed hearing of recorded speech at word rates comparable to speed reading. A recording is played at usually twice the normal speed which results in a high pitched Donald-Duck-like babble. Apparatus of the prior art and of the present invention remove this babble thereby resulting in an ordinary record being speeded up and reproducing intelligible sound.
One prior art method of accomplishing this speed hearing is known as a harmonic compressor. In the harmonic compressor, speech is fed to a bank of 36 band-pass filters which separates the speech into its different frequency components. The output of the bank of filters is sent, in turn, to 36 frequency dividers which halve the frequencies of the narrow-band signals from the filters. From the dividers, the halved frequency signals go to networks which remove distortion and combine the 36 halved signals into one signal. The frequency components of this signal are then half of the original input values. This harmonically compressed signal is then recorded on magnetic tape. The halved frequencies are then restored to their original values by doubling the playback speed. This prior art device does not operate in real time, i.e., the speech must first be recorded and played back later and is so voluminous that it takes up the space of a 6-foot high cabinet.
In a third application, the voice signal is compressed for transmission over low bandwidth communication lines. The prior art devices mentioned above might be adapted to provide for this application.
The aforesaid prior art apparatus are disadvantaged in that they either require an excessive amount of electronics, a multiplicity of special purpose and accurate components, especially filters, a very large and expensive configuration with an excessive power requirement, or they do not operate in real time. Because of these deficiencies in the prior art apparatus, the applications set forth are not fully realized, for example, a person would be limited to access of a speed hearing device.
SUMMARY AND OBJECTS OF THE INVENTION of the invention is to provide Yet another object of the invention is to provide an electronic embodiment comprising two shift registers into which voice signals, having been digitized, are loaded, and means for switching back and forth between these registers after each loading cycle so that there is no pause in the output signal as one register is loaded.
A further object of the invention is to provide an electromechanical embodiment wherein a magnetic tape loop is moved past a recording head into which the voice signal is applied, which tape loop is played by several playback heads moving in the same direction as the tape, but slower.
The invention accordingly comprises the features of construction, combination of elements and arrangement of parts which will be exemplified in the construction hereinafter set forth and the scope of the invention will be indicated in the claims.
The basis for the apparatus of the invention is based on the fact that normal speech can be chopped or interrupted at certain rates and still retain its intelligibility. This factor is supported in an article of The Journal of the Acoustical Society of America, Volume 22, No. 2, pages I67 to 173, dated Mar. I950, and entitled, The intelligibility of Interrupted Speech. Both the electronic and electromechanical embodiments of the invention take full advantage of this factor.
Briefly, an electronic embodiment of the apparatus of the invention digitizes the voice signal. This voice signal might be that of a diver in a helium-oxygen atmosphere, or might be a recording of nonnal voice played back at higher speed or might be simply a voice signal being transmitted over a communications line. By all electronic means, the digitized voice signal is shifted into a first shift register at a first frequency. When this first register is fully loaded this condition is detected thereupon shifting the digitized voice signal out of the first register at a second frequency. This second frequency is lower that the first frequency. Simultaneously with shifting out of the digitized voice signal from the first register, subsequent and now digitized voice signals are shifted into the second register at the first frequency. When the second register is fully loaded, this condition is detected and thereupon a digitized voice signal is shifted out of the second register at the second frequency. This cycling as described above continues in an altemate manner until the input voice signal is fully processed. This fully processed digital signal is continually restored to an analog signal, which may then be replayed through a speaker.
In an electromechanical embodiment, a magnetic tape loop is moved past a recording head into which the input voice signal is applied. The tape is played back by several playback heads moving in the same direction as the tape, but slower. These playback heads are equally spaced on the circumference of a rotating wheel with each head sequentially making contact with the tape. The tape between the heads during head changeover does not get played, thus segmenting the input signal with ever other segment, i.e., the tape not played, being discarded.
The loss of information from the discarded segments in the embodiments mentioned hereinabove is negligible when the chopping rate, i.e., the segment length is optimized for the characteristics of human speech and hearing. The segments ideally should be long compared with voice pitch waveform periods of to milliseconds and short compared with voice syllabic periods of 100 to 200 milliseconds. For example, a segment period of milliseconds has been used.
Thus, the apparatus of the invention segments the incoming voice signal, discards alternate segments and reads out the segmented signal at a lower rate, thereby keeping the same time base as the input signal. It should be noted that with the first frequency being double that of the second frequency the pitch of the incoming voice signal will be reduced by one-half and that although 50 percent of the input signal has been lost at this point the articulation will still be satisfactory.
BRIEF DESCRIPTION OF THE DRAWINGS The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular description of the preferred embodiments of the invention as illustrated in the accompanying drawings in which:
FIG. 1 illustrates a block diagram of an electronic embodiment of the speech processor of the invention;
FIG. 2 illustrates the timing waveforms at signal points indicated in the block diagram illustrated in FIG. 1;
FIG. 3 illustrates a block diagram of an electromechanical embodiment of the speech processor of the invention; and
FIG. 4 illustrates a modification of the processor shown in FIG. 3.
DESCRIPTION OF THE PREFERRED EMBODIMENTS Now referring to FIG. I there is illustrated a block diagram of an all electronic embodiment of the apparatus for processing the aforementioned signals. An electromechanical embodiment will be discussed later. The signal which is analog in nature is coupled to the input of an analog to digital (A/D) converter 10 which converter digitizes the input signal. The converter 10 might be a simple clipper or limiter thereby producing a chopped signal at the output of converter 10, or might be a delta modulator thereby producing a binary pulse pattern which is representative of the input signal. Converter 10 might also binary code the amplitude of the input signal which embodiment will be discussed hereinafter. The output of converter 10 is connected to the inputs of shift registers l6 and 18, which registers receive their respective clock pulses by way of switch 20 from either A clock 14 or B clock 12. Clock M operates at a higher frequency than clock 12 and although both clocks are shown separately it should be understood that the pulse train out of clock 12 could have been derived from divider means connected to the output of clock 14.
Switch 20, as well as switch 24, might include a well-known combination of logic gates and/or fiip-fiops and switch 20 is initially set so that terminal a is connected to terminal c while terminal I; is connected to terminal a. After switch 20 is initially toggled, a is connected to d and b is connected to 0. Subsequent toggling alternately changes the connections cited above.
The toggle signal may be generated by a counter 22 which generates a pulse each time clock M produces that number of pulses as registers 16 and id have signal storage positions. The toggle signal might also be generated by a monostable multivibrator, not shown, whose time constant approximates the time for loading a register at the higher frequency of clock 14. Counter 22 also toggles switch 24 which is initially set so that terminal a is connected to terminal I). The output of shift register 16 is connected to terminal b of switch 24 while the output of shift register 18 is connected to terminal 0.
The combination of switches 20 and 24 initially allows register 18 to clock in the digitized signal at the clock M frequency while register 16 has its signal stored therein clocked out of terminal a of switch 24 at the clock 12 frequency. Note that usually there is no signal stored in register 16 initially. When counter 22 generates a toggle signal the above condition will change so that now the signal just stored in register 18 will be clocked out of switch 24 at the clock 12 frequency while subsequent digitized signals will be clocked into register 16 at the clock M frequency. This process will continue in the alternate manner noted above.
Further, the signal at terminal a of switch 24 is processed by a digital to analog (D/A) converter 26 and then filtered by means of filter 28 thereby providing the output signal of the apparatus of the invention.
It can be seen then that the signal is always clocked into a register at the higher frequency of clock 14 and clocked out of the registers at the lower frequency of clock l2. It can also be seen that the processed signal is lower in frequency than the digitized signal out of converter 10 and that such processed signal has been chopped thus resulting in some loss of information of the input signal. It can also be seen that two registers are used so that there is no pause in the output as one register is loaded.
In operation and for ease of explanation reference should now be made to FIG. l in combination with the waveforms of FIG. 2. The waveforms are indicated by the letters A to E. Waveform A is a representation of the digitized signal out of A/D converter 10. Waveforms B and C are representative of the clock pulse trains produced by clocks l4 and I2 respectively. As illustrated, waveform B is twice the frequency of waveform C. Waveform D is the toggle signal from counter 22 which signal has been shown for illustration purposes as occurring every sixth pulse of clock 14. Waveform E is the processed signal occurring at the output of switch 24.
Shift registers 16 and 18 each have, in this example, six storage positions. Since counter 22 produces a toggle signal every six pulses of clock 14, it can be seen that each time a register loads up at the higher clock rate it is then switched to output the stored signals at the lower clock rate.
Let us now assume for initial condition that shift registers 16 and 18 are clear of any stored signals; that switch 20 is set so that terminals 0 and c are connected together and terminals b and d are so connected; and that switch 24 is set so that terminal a is connected to terminal 12.
Let us also assume that the digitized input signal, whether it be a recorded voice replayed at higher speed, or a diver breathing in a helium-oxygen atmosphere, etc., is clipped and converted to pulses as indicated in waveform A. Note that this conversion to pulses is not necessary since the clipped waveform could have been directly sampled. At time I, as illustrated FIG. 2, a pulse or binary one is indicated, while at time 2, no pulse or a binary zero is indicated, etc. Waveform A is clocked into shift register 18 at the rate shown by waveform B, and at the same time register 16 is receiving the clock pulses indicated by waveform C. Initially, however, no signals are clocked out of register l6 since no signals had been stored therein.
It should be understood that although no signal is being clocked out of shift register 16, signals from converter 10 as indicated by waveform A will be clocked into register 16 at the lower clock rate indicated by waveform C. It will be observed, however, that any such data clocked into either of the registers at the lower clock rate, i.e., the clock rate of clock 12, will not be used since switch 24 will be connected in such a manner as to discard such clocked in signals.
At time 6 therefore, register 18 will now have stored in it pulses indicated in waveform A as numbers 1 through 6. At this point in time counter 22 will emit a pulse as indicated by waveform D toggling the connections of switch 20 and of switch 24 such that now signals as indicated by wavefonn A beginning with the binary zero at time 7 will be clocked into register 16 at the waveform B rate. At the same time the signals now stored in register 18 will be clocked out of register 18 at the waveform C rate and through the connections of switch 24 into the D/A converter 26 and out via filter 28. The signal emitted from the a terminal of switch 24 is indicated by while clock 12 had wavefonn E. At time 7 a binary one pulse is shown which is identical to the signal as shown in wavefonn A at time 1. At time 9 a binary zero is emitted which binary zero is identical to the binary zero indicated at time 2 in waveform A. in a similar fashion the binary one indicated at time 11 in waveform E is identical to the binary one shown in waveform A at time 3.
Because the output clock rate as shown by this example is one-half the frequency of the input clock rate, counter 22 is ready to emit its next toggle signal. Accordingly, the binary ones or zeros shown in wavefonn A at times 4, 5, and 6 will be discarded such that one-half of the input signal is no longer used. This condition is agreeable with the factor that speech may be chopped at certain rates and still retain its intelligibility.
At time 12 when counter 22 emits its next toggle signal as indicated in waveform D, now stored in register 16 are the pulses of waveform A as indicated at times 7 through 12. Upon toggling, switch 24 will emit these signals as indicated at times 13, and 17 in waveform E. Pulses l3, l5 and 17 of wavefonn E are identical to pulses 7, 8 and 9 respectively of waveform A. The signals of waveform A as indicated at times 10, 11 and 12 are not used. This operation continues alternately until the last signal as indicated by waveform A at time 24 is processed. It can be seen that waveform E now contains the first three signals of every group of six signals originally received and indicated by waveform A. Accordingly, the signals indicated at times 25, 27 and 29 of waveform E are identical to the signals of waveform A at times 19, and 21, respectively.
It should be understood that the length of registers 16 and 18 as well as the toggle frequency have been picked for purposes of explanation only and that such length and frequency may have been any number which would be determined by the application involved. The process as described above compresses the input signal by a factor of two and provides a sampling factor of 50 percent. These ratios may be varied from unity to any factor desired by changing the length of the discarded signals and the frequency of clock 12.
it should be additionally noted that as the frequency of clock 12 is lowered the pitch frequency is lowered. When the frequency of clock 12 is one-half the frequency of clock 14, the pitch has been reduced to one-half. Although 50 percent of the input information has been lost at this point the articulation will still be satisfactory as shown by the reference cited hereinbefore entitled, The intelligibility of Interrupted Speech." The length of the shift registers and the clocking rates are determined by the voice pitch period and audio band pass. Higher frequency clocks can be used for improved frequency response of the audio output but the shift registers will also have to be lengthened. in actual practice shift registers 16 and 18 were built to each include 400 storage positions and counter 22 emitted a toggle signal every 400 clock pulses of clock 14. Clock 14 had a frequency of 20 kilohertz a frequency of 10 kilohertz.
Thus, the processing technique of the invention may be utilized for improving the intelligibility of speech in a helium-oxygen atmosphere, i.e., in that circumstance where a diver is at considerable water depth and is breathing a helium-oxygen mixture. The voice of the diver for example at 200 feet is of high pitch and unintelligible. By chopping this voice and playing it back slower than it was spoken, the pitch of the voice becomes lower, i.e., the pitch is restored to normal, and the speech is now intelligible. In an application of speed hearing, especially for the blind person, a normal recording might for example be played at twice the normal speed. At this higher speed the voice on the recording would be unintelligible. By chopping this voice, the pitch would be lowered but is now intelligible, and the frequency of the voice recording would be doubled thus resulting in information being absorbed at a faster rate.
In a further application of this invention it would be possible to use this processor to reduce the bandwidth of voice before transmitting it over a communications channel. With a ratio in clock frequencies of 2 to 1, a 50 percent reduction in the radiofrequency bandwidth may be possible. Another similar device on the receiving end would be utilized to reconvert the voice to its original pitch. In this application it would not be necessary to use the converter 26 and filter 28, transmission could be in digital form. The receiving device would then incorporate a similar device as shown in the apparatus of the invention, however, the positions of clock 12 and clock 14 would be interchanged. That is to say the incoming processed digital signals from switch 24 of the transmitting device would be clocked into the receiving device at the lower frequency and clocked out at the higher frequency. This higher frequency output signal will then be converted into analog fonn and filtered thereby producing the received output signal. Further the receiving device would not require an A/D converter similar to converter 10 in F IG. 1.
As hereinbefore stated converter 10, illustrated in FIG. 1, might have produced a binary code of the amplitude of the input signal. For example, if the amplitude of the input signal is divided into 8 levels then the converter 10 would have at its output three binary lines corresponding to 2, 2', and 2 and thereby giving eight possible binary indications of the level of the input signal. To provide for these three binary lines at the output of converter 10, shift register 16 would now be replaced by three identical and parallel shift registers. These sets of registers would be coupled as shown in FIG. 1 and the operation would remain substantially identical to that previously discussed. The clock line from terminal 0 of switch 20 would clock the parallel registers replacing shift register 16 and'the clock line from terminal d of switch 20 would clock the parallel registers replacing shift register 18. The D/A converter 26 would recombine the signals as switched by switch 24.
Now referring to FIG. 3, there is illustrated an electromechanical embodiment of the invention. The speech to be processed is applied to input terminal 74. Input terminal 74 is connected to record head 42. Between record head 42 and pressure pad 44 passes a magnetic tape loop 40 which is driven in a clockwise direction by capstan 56 and pinchroller 54. Capstan 56 is driven in a counterclockwise direction, as shown by the arrow, by a suitable motor, not shown. Tape loop 40 passes along the circumference of playback assembly 41 which assembly 41 is driven in a clockwise direction by another suitable motor, not shown.
Playback assembly 41 includes in this illustration two playback heads 62 and 64 spaced 180 apart along the circumference. The two playback heads are coupled to the output terminal 72 by means of their slip rings 68 and brush 66. The output 76 of brush 66 is coupled to a playback amplifier 70, whose output is coupled to output terminal 72. The tape loop 40 is guided along the circumference of playback assembly 41 by means of roller bearings 60, which are utilized to reduce friction between the tape loop 40 and assembly 41.
After the tape loop 40 passes assembly 41 and capstan 56, it
passes between erase head 46 and pressure pad 48. The loop 40 is held tight by means of rollers 50 and 52. Roller 52 has coupled to it a spring mechanism 58 for increased tension of the tape loop 40.
Playback assembly 41 might have included more than two playback heads spaced equally along the circumference. In any such arrangement as well as the two playback head arrangements shown, only one playback head is coupled to the output terminal 72 at a time. As illustrated, the tape behind playback head 64 to just in front of playback head 62 will not get played back thereby resulting in segmenting of the speech and discarding every other segment of the speech which is fed into record head 42. If only one playback head were used, the basic concept of the invention could still be achieved, however, the playback would not be continuous.
it should be understood that the block diagram of FIG. 3, is a simple schematic representation of the invention and does not include all the mechanical and electrical details but is sufficient to describe the invention. These details may be accomplished in any convenient manner.
The operation of the embodiment of FIG. 3 is as follows. The closed magnetic tape loop 40 moves past the record head 42 at a fixed speed determined by the capstan 56. A short distance before the tape loop 40 passes the record head 42 it passes an erase head 46 where all previous signals are erased. The signal to be processed is applied to the record head 42 through the input terminal 74. The recorded tape then passes over a playback housing M in the shape of a wheel which housing includes two or more playback heads 62 and 64 equally spaced along the circumference of the wheel. The operation will be described with two heads but more than two may be desirable. For instance, the less the circumferential distance between heads the slower need be the speed of the tape and wheel for a given quality reproduction. Only one head at a time can be played back. This can be accomplished by limiting the length of are that the tape makes contact with the wheel or by the brush and slip ring arrangement. Both methods are shown for the two head arrangement. It should be noted that for a given quality reproduction the erase head 46 should be employed, but that nevertheless the system would still operate satisfactorily without the erase head so long as the record head is energized.
With the housing 41 or (wheel) stationary, one head, will be in contact with the tape 40 (or if more than one, only one will be coupled with the electronics through its slip ring). The playback channel will play back exactly what was recorded but delayed by a fraction of a second. If the wheel is moved slowly in the same direction as the tape the relative speed of the tape across the playback head will become less than it was when the wheel was stationary. This lowers the pitch of the speech. After the head, which was making contact with the tape, rotates 180 it loses contact with the tape and the other head starts making contact. The tape between the two heads during head changeover does not get played, since it is in front of the head just going on the tape and in back of the head which has left the tape. The head never makes contact with any tape in front of it, since the tape is moving faster than the wheel.
The speed of the wheel is variable from rpm. to an rpm. equal to the tape speed. With the speed 0 there is no pitch reduction; as the speed increases to one-half the speed of the tape, the pitch decreases by one-half. At this point the size of the tape segments played back become equal to the segments not played back. As the speed of the wheel is increased further the pitch is lowered further and smaller segments are played back relative to those not played. With a speed control for the wheel the pitch frequency will be adjustable.
After the tape loop -30 passes over the wheel, it passes by the capstan 56 which drives the tape at a constant speed. It then passes the erase head 46 and guide rollers 50 and 52 and returns to the record head.
A spring is shown on one of the guide rollers to hold the tape loop tight against the playback heads. It also simplifies the replacement of the tape loop. If additional pressure is needed to hold the tape in contact with the playback heads a semicircular shoe could press against the tape. Another suggestion for holding the tape tight against the heads is shown in F IG. 4.
In FIG. 45, playback housing 41 is illustrated with a flexible tape pressure means generally indicated at 92. Playback housing til now includes a cylinder 80 which has coated on its outer surface a friction-reducing substance such as that sold under the trademark TEFLON." This essentially frictionless cylinder 80 replaces the roller bearings 60 of FIG. 3. Tape loop 40 passes around and is pressed against cylinder 80 by means of flexible tape pressure means 92.
Pressure means 92 includes flexible pressure tape 90 which is held tightly against recording tape 40 as it passes over the wheel or cylinder 80. Pressure tape 90 is held in position by means of guide rollers 82 and 84 and is kept tight by means of roller 86 and spring tension means 88. The pressure tape 90 is free to move around the rollers 82, 84 and 86 thus reducing the drag on the recording tape 40 as it passes over the cylinder 80. If it is required to reduce the drag further, a capstan, not
shown, could be used to drive the flexible pressure tape at the same speed as the recording tape 40.
This embodiment of the invention might be utilized in a communication system in which case the receiving end would include apparatus as shown in FIG. 3. However, since the incoming speech is now lower in pitch than normal, the speed of revolution of the playback housing 41 would be greater than the speed of the tape loop 40.
It has thus been seen that the on electromechanical embodiment of the invention as illustrated in FIG. 3 segments, discards every other segment, and combines the remaining segments by means of a tape loop which is fed by the speech to be processed and which is driven at a constant speed past a playback housing which for best results includes at least two playback heads spaced substantially apart, the playback housing rotating at some adjustable slower speed so as to reduce the tape loop to playback head speed.
Other methods of reducing the effective playback head to tape speed involve oscillating heads or dual capstan drives.
It will thus be seen that the objects set forth above among those made apparent from the preceding description, are efficiently attained, and since certain changes may be made in the above construction without departing from the scope of the invention, it is intended that all matter contained in the above description or shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.
Having described the invention, what is claimed as new and secured by Letters Patent is:
1. Apparatus for processing speech signals, comprising:
A. an input terminal for receiving said speech signals;
B. an analog-to-digital converter coupled to said input terminal;
C. first and second storage means coupled to said analog-todigital converter;
D. means for alternately reading said signals into said first and second storage means at a first rate;
E. an output terminal;
F. means for alternately reading out said first and second storage means at a second rate, said signals being read out at said second rate being coupled to said output terminal; and
G. a digital-to-analog converter coupled to the output of said storage means.
2. Apparatus as defined in claim 1, wherein said speech signals are that of a diver speaking in a helium-oxygen atmosphere.
3. Apparatus as defined in claim I wherein said speech signals are that of a normal voice recording played back at a higher speed than the original recording speed.
4. Apparatus as defined in claim I wherein said first and second storage means comprises first and second shift registers.
5. Apparatus as defined in claim 4 wherein said readin and readout means includes:
A. clock means for generating pulses at a first frequency and for generating pulses at a second frequency, said second frequency being lower in frequency than said first frequency; and
B. first switch means for coupling said pulses of said first frequency to the clock input of said first shift register whereby signals are loaded into said first shift register at said first frequency and for simultaneously coupling said pulses of said second frequency to the clock input of said second shift register whereby signals are unloaded from said second shift register at said second frequency.
6. Apparatus as defined in claim 5 wherein said readin and readout means further include:
A. means for sensing the loading of a predetermined amount of signal into said first shift register;
B. means connecting said sensing means to said first switch means for toggling said first switch means upon said sensing such that said pulses of said first frequency are coupled to the clock input of said second shift register whereby signals are loaded into said second shift register at said first frequency and such that said pulses of said second frequency are coupled to the clock input of said first shift register whereby signals are unloaded from said first shift register at said second frequency; and
C. second switch means for coupling the register being unloaded to said output terminal.
7. Apparatus as defined in claim 6 wherein said second switch means is toggled by said sensing means.
8. Apparatus as defined in claim 6 wherein said second frequency of said clock means is variable.
9. Apparatus as defined in claim 6 wherein said analog-todigital converter includes a clipping network.
10. Apparatus as defined in claim 6 wherein said analog-todigital converter produces a binary pulse pattern of the speech signals on n output lines.
11. Apparatus as defined in claim 6, further including:
a filter network coupled to receive a signal from said digitalto-analog converter.
12. Apparatus as defined in claim 6 wherein said analog-todigital converter includes a delta modulator.
13. Apparatus as defined in claim 12 wherein said first and second shift registers are each comprised of n parallel connected shift registers, each of which n parallel shift registers is coupled to a corresponding line of said n output lines.
14. Apparatus for processing speech signals, comprising:
A. an input terminal for receiving said speech signals;
B. means for converting said speech signals into digital signals;
C. a pair of shift registers coupled to receive said digital signals;
D. means for alternately clocking said digital signals into one of said shift registers at a first frequency;
E. means for alternately clocking said digital signals out of the other of said shift registers at a second frequency which is lower than said first frequency; and
F. means for converting said digital signals clocked out of said shift registers at said second frequency to analog form.
15. Signal-processing apparatus having a processor output terminal, said apparatus comprising:
A. analog-to-digital conversion means; B. means for generating first clock pulses at a first frequency; C. means for generating second clock pulses at a second frequency, said second frequency being lower in frequency than said first frequency; D. first and second shift register means 1. each having a signal input and output terminal, 2. each coupled to said conversion means at said input terminal,
3. each having m storage positions, and
4. each having a clock input for shifting the signal into said signal input tenninal through each of Said m storage positions, and out of said signal output terminal;
E. means for sensing the occurrence of m of said first clock pulses;
F. means for alternately coupling said first clock pulses to said clock input of said first register means and then to said clock input of said second register means each time signal said sensing means senses the occurrence of m of said first clock pulses;
G. means for alternately coupling said second clock pulses to said clock input of one of said register means which is not connected to said first clock pulses;
H. means for coupling said signal output tenninal of one of said register means which has its clock input coupled to said second clock pulses, to said processor output terminal, and
l. digital-to-analog conversion means coupling said shift register means to said processor output terminal whereby the signal is loaded into said register means at a higher rate t an the signal 18 unloaded rom said register means and whereby the signal at said processor output terminal is proportional to the ratio of frequencies of the second clock pulses over the first clock pulses times that amount of signal in the signal from the conversion means.
16. Apparatus as defined in claim 15 wherein said signal is that of a diver submerged in water whereby said signal is made intelligible.
17. Apparatus as defined in claim 15 wherein said signal is that of a normal voice recording played back at higher speed thereby resulting in an intelligible, speed hearing playback.
18. Apparatus as defined in claim 15 wherein A. said analog-to-digital converter produces a binary pulse pattern of said signal on n output lines; and
B. said first and second shift register means are each comprised of n parallel connected shift registers, each of which n parallel shift registers is coupled to a corresponding line of said n output lines.
19. Apparatus as defined in claim 18 wherein said means for converting a digital signal to an analog signal includes a filter network for further restoring said signal to analog form.
20. A method for processing voice signals comprising the steps of:
A. converting said speech signals into digital signals;
B. segmenting said digital signals;
C. discarding every other segment of said segmented signals;
D. combining the remaining segments of said segmented signals; and
E. converting said digital signals to an analog signal.
21. A method for processing signals comprising the steps of:
A. converting said signals into digital form;
B. shifting said digital signals into a first register at a first frequency;
C. detecting a cycle of said shifting whereby said first register is fully loaded;
D. shifting, upon said detecting of said fully loaded first register, said digital signals out of said first register at a second frequency, said second frequency being lower than said first frequency, and simultaneously shifting said digital signals into a second register at said first frequeny;
E. detecting a cycle of said shifting whereby said second register is fully loaded;
F. shifting, upon said detecting of said fully loaded second register, said digital signals out of said second register at said second frequency, and
G. restoring said digital signals shifted out of said registers to analog signals.
a a a a

Claims (24)

1. Apparatus for processing speech signals, comprising: A. an input terminal for receiving said speech signals; B. an analog-to-digital converter coupled to said input terminal; C. first and second storage means coupled to said analog-todigital converter; D. means for alternately reading said signals into said first and second storage means at a first rate; E. an output terminal; F. means for alternately reading out said first and second storage means at a second rate, said signals being read out at said second rate being coupled to said output terminal; and G. a digital-to-analog converter coupled to the output of said storage means.
2. Apparatus as defined in claim 1 wherein said speech signals are that of a diver speaking in a helium-oxygen atmosphere.
2. each coupled to said conversion means at said signal input terminal,
3. each having m storage positions, and
3. Apparatus as defined in claim 1 wherein said speech signals are that of a normal voice recording played back at a higher speed than the original recording speed.
4. Apparatus as defined in claim 1 wherein said first and second storage means comprises first and second shift registers.
4. each having a clock input for shifting the signal into said signal input terminal through each of said m storage positions, and out of said signal output terminal; E. means for sensing the occurrence of m of said first clock pulses; F. means for alternately coupling said first clock pulses to said clock input of said first register means and then to said clock input of said second register means each time said sensing means senses the occurrence of m of said first clock pulses; G. means for alternately coupling said second clock pulses to said clock input of one of said register means which is not connected to said first clock pulses; H. means for coupling said signal output terminal of one of said register means which has its clock input coupled to said second clock pulses, to said processor output terminal, and I. digital-to-analog conversion means coupling said shift register means to said processor output terminal whereby the signal is loaded into said register means at a higher rate than the signal is unloaded from said register means and whereby the signal at said processor output terminal is proportional to the ratio of frequencies of the second clock pulses over the first clock pulses times that amount of signal in the signal from the conversion means.
5. Apparatus as defined in claim 4 wherein said readin and readout means includes: A. clock means for generating pulses at a first frequency and for generating pulses at a second frequency, said second frequency being lower in frequency than said first frequency; and B. first switch means for coupling said pulses of said first frequency to the clock input of said first shift register whereby signals are loaded into said first shift register at said first frequency and for simultaneously coupling said pulses of said second frequency to the clock input of said second shift register whereby signals are unloaded from said second shift register at said second frequency.
6. Apparatus as defined in claim 5 wherein said readin and readout means further include: A. means for sensing the loading of a predetermined amount of signal into said first shift register; B. means connecting said sensing means to said first switch means for toggling said first switch means upon said sensing such that said pulses of said first frequency are coupled to the clock input of said second shift register whereby signals are loaded into said second shift register at said first frequency and such that said pulses of said second frequency are coupled to the clock input of said first shift register whereby signals are unloaded from said first shift register at said second frequency; and C. second switch means for coupling the register being unloaded to said output terminal.
7. Apparatus as defined in claim 6 wherein said second switch means is toggled by said sensing means.
8. Apparatus as defined in claim 6 wherein said second frequency of said clock means is variable.
9. Apparatus as defined in claim 6 wherein said analog-to-digital converter includes a clipping network.
10. Apparatus as defined in claim 6 wherein said analog-to-digital converter produces a binary pulse pattern of the speech signals on n output lines.
11. Apparatus as defined in claim 6, further including: a filter network coupled to receive a signal from said digital-to-analog converter.
12. Apparatus as defined in claim 6 wherein said analog-to-digital converter includes a delta modulator.
13. Apparatus as defined in claim 12 wherein said first and second shift registers are each comprised of n parallel connected shift registers, each of which n parallel shift registers is coupled to a corresponding line of said n output lines.
14. Apparatus for processing speech signals, comprising: A. an input terminal for receiving said speech signals; B. means for converting said speech signals into digital signals; C. a pair of shift registers coupled to receive said digital signals; D. means for alternately clocking said digital signals into one of said shift registers at a first frequency; E. means for alternately clocking said digital signals out of the other of said shift registers at a second frequency which is lower than said first frequency; and F. means for converting said digital signals clocked out of said shift registers at said second frequency to analog form.
15. Signal-processing apparatus having a processor output terminal, said apparatus comprising: A. analog-to-digital conversion means; B. means for generating first clock pulses at a first frequency; C. means for generating second clock pulses at a second frequency, said second frequency being lower in frequency than said first frequency; D. first and second shift register means
16. Apparatus as defined in claim 15 wherein said signal is that of a diver submerged in water whereby said signal is made intelligible.
17. Apparatus as defined in claim 15 wherein sAid signal is that of a normal voice recording played back at higher speed thereby resulting in an intelligible, speed hearing playback.
18. Apparatus as defined in claim 15 wherein A. said analog-to-digital converter produces a binary pulse pattern of said signal on n output lines; and B. said first and second shift register means are each comprised of n parallel connected shift registers, each of which n parallel shift registers is coupled to a corresponding line of said n output lines.
19. Apparatus as defined in claim 18 wherein said means for converting a digital signal to an analog signal includes a filter network for further restoring said signal to analog form.
20. A method for processing voice signals comprising the steps of: A. converting said speech signals into digital signals; B. segmenting said digital signals; C. discarding every other segment of said segmented signals; D. combining the remaining segments of said segmented signals; and E. converting said digital signals to an analog signal.
21. A method for processing signals comprising the steps of: A. converting said signals into digital form; B. shifting said digital signals into a first register at a first frequency; C. detecting a cycle of said shifting whereby said first register is fully loaded; D. shifting, upon said detecting of said fully loaded first register, said digital signals out of said first register at a second frequency, said second frequency being lower than said first frequency, and simultaneously shifting said digital signals into a second register at said first frequency; E. detecting a cycle of said shifting whereby said second register is fully loaded; F. shifting, upon said detecting of said fully loaded second register, said digital signals out of said second register at said second frequency, and G. restoring said digital signals shifted out of said registers to analog signals.
US858623A 1969-09-17 1969-09-17 Speech processor for changing voice pitch Expired - Lifetime US3621150A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US85862369A 1969-09-17 1969-09-17

Publications (1)

Publication Number Publication Date
US3621150A true US3621150A (en) 1971-11-16

Family

ID=25328750

Family Applications (1)

Application Number Title Priority Date Filing Date
US858623A Expired - Lifetime US3621150A (en) 1969-09-17 1969-09-17 Speech processor for changing voice pitch

Country Status (1)

Country Link
US (1) US3621150A (en)

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3715509A (en) * 1971-02-18 1973-02-06 Us Air Force Method and means for providing resolution level selection in a spectrum analyzer
JPS4835703A (en) * 1971-08-13 1973-05-26
US3750024A (en) * 1971-06-16 1973-07-31 Itt Corp Nutley Narrow band digital speech communication system
DE2306527A1 (en) * 1972-02-10 1973-08-16 Matsushita Electric Ind Co Ltd SAMPLING MODULATION SYSTEM FOR AN ELECTRONIC MUSICAL INSTRUMENT
US3772679A (en) * 1970-06-15 1973-11-13 Honeywell Inc Precessing analog trace display
US3784754A (en) * 1971-02-23 1974-01-08 I Hagiwara Apparatus and method for transmitting and receiving signals based upon half cycles
US3786201A (en) * 1972-02-04 1974-01-15 J Myers Audio-digital recording system
US3798372A (en) * 1972-05-12 1974-03-19 D Griggs Apparatus and method for retardation of recorded speech
US3806644A (en) * 1973-02-23 1974-04-23 Bell Telephone Labor Inc Video format converter
US3809805A (en) * 1972-08-25 1974-05-07 Arvin Ind Inc Video bandwidth reduction
US3828361A (en) * 1971-08-13 1974-08-06 Cambridge Res & Dev Group Speech compressor-expander
US3828134A (en) * 1973-07-09 1974-08-06 Us Navy Device for changing pitch of an audio signal to improve intelligibility
US3830971A (en) * 1971-09-04 1974-08-20 Philips Corp Line standard converter for converting a television signal having a number of n-lines per image into a television signal having a number of m-lines per image
US3846827A (en) * 1973-02-12 1974-11-05 Cambridge Res & Dev Group Speech compressor-expander with signal sample zero reset
US3855424A (en) * 1972-02-15 1974-12-17 Philips Corp Information processor for changing tempo of playback from the recorded tempo
US3860760A (en) * 1973-06-06 1975-01-14 Us Army Electronic time compressor or expander
US3863026A (en) * 1973-08-15 1975-01-28 Us Navy Helium speech decoder
US3869708A (en) * 1971-08-13 1975-03-04 Cambridge Res & Dev Group Speech compressor with gap filling
US3873778A (en) * 1972-06-07 1975-03-25 Sony Corp Signal processing system
US3924069A (en) * 1973-08-15 1975-12-02 Us Navy Helium speech decoder
US3936610A (en) * 1971-08-13 1976-02-03 Cambridge Research And Development Group Dual delay line storage sound signal processor
US3941982A (en) * 1974-07-22 1976-03-02 Particle Measuring Systems, Inc. Method and apparatus for two-dimensional data acquisition
US3949175A (en) * 1973-09-28 1976-04-06 Hitachi, Ltd. Audio signal time-duration converter
US3997913A (en) * 1975-01-09 1976-12-14 The United States Of America As Represented By The Secretary Of The Army Electronic time compressor/expander utilizing magnetic storage
US4228322A (en) * 1979-01-02 1980-10-14 International Business Machines Corporation Decreasing time duration of recorded speech
US4365115A (en) * 1979-04-28 1982-12-21 Kanbayashi Seisakujo Company, Ltd. Time-axis compression-expansion devices for sound signals
USRE31614E (en) * 1979-01-02 1984-06-26 International Business Machines Corporation Decreasing time duration of recorded speech
US4464784A (en) * 1981-04-30 1984-08-07 Eventide Clockworks, Inc. Pitch changer with glitch minimizer
FR2543724A1 (en) * 1983-03-30 1984-10-05 Electricite De France Method and device for recorded audio signal processing
US4800378A (en) * 1985-08-23 1989-01-24 Snap-On Tools Corporation Digital engine analyzer
WO1996012270A1 (en) * 1994-10-12 1996-04-25 Pixel Instruments Time compression/expansion without pitch change
US20020184024A1 (en) * 2001-03-22 2002-12-05 Rorex Phillip G. Speech recognition for recognizing speaker-independent, continuous speech
US7916293B2 (en) 2007-12-04 2011-03-29 Particle Measuring Systems, Inc. Non-orthogonal particle detection systems and methods

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2352023A (en) * 1938-08-26 1944-06-20 Schuller Eduard Sound reproducer
US2886650A (en) * 1952-10-22 1959-05-12 Univ Illinois Recording device
US2903521A (en) * 1955-09-12 1959-09-08 John V Ellison Band expansion recorder
US2909616A (en) * 1953-06-15 1959-10-20 Marty Rene Magnetic talking machine
US2997697A (en) * 1959-07-08 1961-08-22 American Telephone & Telegraph Data transmission system
US3458642A (en) * 1966-02-01 1969-07-29 Donald J Leslie Electronic organ utilizing proportional frequency changes for harmonic structure
US3480737A (en) * 1966-03-08 1969-11-25 Cambridge Res & Dev Group Apparatus for reducing time duration of signal reproduction
US3485960A (en) * 1966-09-29 1969-12-23 Ocean Systems Rotary magnetic head record and play-back system for rendering unintelligible speech intelligible

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2352023A (en) * 1938-08-26 1944-06-20 Schuller Eduard Sound reproducer
US2886650A (en) * 1952-10-22 1959-05-12 Univ Illinois Recording device
US2909616A (en) * 1953-06-15 1959-10-20 Marty Rene Magnetic talking machine
US2903521A (en) * 1955-09-12 1959-09-08 John V Ellison Band expansion recorder
US2997697A (en) * 1959-07-08 1961-08-22 American Telephone & Telegraph Data transmission system
US3458642A (en) * 1966-02-01 1969-07-29 Donald J Leslie Electronic organ utilizing proportional frequency changes for harmonic structure
US3480737A (en) * 1966-03-08 1969-11-25 Cambridge Res & Dev Group Apparatus for reducing time duration of signal reproduction
US3485960A (en) * 1966-09-29 1969-12-23 Ocean Systems Rotary magnetic head record and play-back system for rendering unintelligible speech intelligible

Cited By (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3772679A (en) * 1970-06-15 1973-11-13 Honeywell Inc Precessing analog trace display
US3715509A (en) * 1971-02-18 1973-02-06 Us Air Force Method and means for providing resolution level selection in a spectrum analyzer
US3784754A (en) * 1971-02-23 1974-01-08 I Hagiwara Apparatus and method for transmitting and receiving signals based upon half cycles
US3750024A (en) * 1971-06-16 1973-07-31 Itt Corp Nutley Narrow band digital speech communication system
US3828361A (en) * 1971-08-13 1974-08-06 Cambridge Res & Dev Group Speech compressor-expander
US3869708A (en) * 1971-08-13 1975-03-04 Cambridge Res & Dev Group Speech compressor with gap filling
JPS551596B2 (en) * 1971-08-13 1980-01-16
JPS4835703A (en) * 1971-08-13 1973-05-26
US3936610A (en) * 1971-08-13 1976-02-03 Cambridge Research And Development Group Dual delay line storage sound signal processor
US3830971A (en) * 1971-09-04 1974-08-20 Philips Corp Line standard converter for converting a television signal having a number of n-lines per image into a television signal having a number of m-lines per image
US3786201A (en) * 1972-02-04 1974-01-15 J Myers Audio-digital recording system
DE2306527A1 (en) * 1972-02-10 1973-08-16 Matsushita Electric Ind Co Ltd SAMPLING MODULATION SYSTEM FOR AN ELECTRONIC MUSICAL INSTRUMENT
US3855424A (en) * 1972-02-15 1974-12-17 Philips Corp Information processor for changing tempo of playback from the recorded tempo
US3798372A (en) * 1972-05-12 1974-03-19 D Griggs Apparatus and method for retardation of recorded speech
US3873778A (en) * 1972-06-07 1975-03-25 Sony Corp Signal processing system
US3809805A (en) * 1972-08-25 1974-05-07 Arvin Ind Inc Video bandwidth reduction
US3846827A (en) * 1973-02-12 1974-11-05 Cambridge Res & Dev Group Speech compressor-expander with signal sample zero reset
US3806644A (en) * 1973-02-23 1974-04-23 Bell Telephone Labor Inc Video format converter
US3860760A (en) * 1973-06-06 1975-01-14 Us Army Electronic time compressor or expander
US3828134A (en) * 1973-07-09 1974-08-06 Us Navy Device for changing pitch of an audio signal to improve intelligibility
US3863026A (en) * 1973-08-15 1975-01-28 Us Navy Helium speech decoder
US3924069A (en) * 1973-08-15 1975-12-02 Us Navy Helium speech decoder
US3949175A (en) * 1973-09-28 1976-04-06 Hitachi, Ltd. Audio signal time-duration converter
US3941982A (en) * 1974-07-22 1976-03-02 Particle Measuring Systems, Inc. Method and apparatus for two-dimensional data acquisition
US3997913A (en) * 1975-01-09 1976-12-14 The United States Of America As Represented By The Secretary Of The Army Electronic time compressor/expander utilizing magnetic storage
US4228322A (en) * 1979-01-02 1980-10-14 International Business Machines Corporation Decreasing time duration of recorded speech
USRE31614E (en) * 1979-01-02 1984-06-26 International Business Machines Corporation Decreasing time duration of recorded speech
US4365115A (en) * 1979-04-28 1982-12-21 Kanbayashi Seisakujo Company, Ltd. Time-axis compression-expansion devices for sound signals
US4464784A (en) * 1981-04-30 1984-08-07 Eventide Clockworks, Inc. Pitch changer with glitch minimizer
FR2543724A1 (en) * 1983-03-30 1984-10-05 Electricite De France Method and device for recorded audio signal processing
US4800378A (en) * 1985-08-23 1989-01-24 Snap-On Tools Corporation Digital engine analyzer
US20060015348A1 (en) * 1994-10-12 2006-01-19 Pixel Instruments Corp. Television program transmission, storage and recovery with audio and video synchronization
US8185929B2 (en) 1994-10-12 2012-05-22 Cooper J Carl Program viewing apparatus and method
US20050039219A1 (en) * 1994-10-12 2005-02-17 Pixel Instruments Program viewing apparatus and method
US20050240962A1 (en) * 1994-10-12 2005-10-27 Pixel Instruments Corp. Program viewing apparatus and method
WO1996012270A1 (en) * 1994-10-12 1996-04-25 Pixel Instruments Time compression/expansion without pitch change
US20100247065A1 (en) * 1994-10-12 2010-09-30 Pixel Instruments Corporation Program viewing apparatus and method
US9723357B2 (en) 1994-10-12 2017-08-01 J. Carl Cooper Program viewing apparatus and method
US8769601B2 (en) 1994-10-12 2014-07-01 J. Carl Cooper Program viewing apparatus and method
US8428427B2 (en) 1994-10-12 2013-04-23 J. Carl Cooper Television program transmission, storage and recovery with audio and video synchronization
US7089184B2 (en) * 2001-03-22 2006-08-08 Nurv Center Technologies, Inc. Speech recognition for recognizing speaker-independent, continuous speech
US20020184024A1 (en) * 2001-03-22 2002-12-05 Rorex Phillip G. Speech recognition for recognizing speaker-independent, continuous speech
US7916293B2 (en) 2007-12-04 2011-03-29 Particle Measuring Systems, Inc. Non-orthogonal particle detection systems and methods
US8427642B2 (en) 2007-12-04 2013-04-23 Particle Measuring Systems, Inc. Two-dimensional optical imaging methods and systems for particle detection
US8174697B2 (en) 2007-12-04 2012-05-08 Particle Measuring Systems, Inc. Non-orthogonal particle detection systems and methods
US8154724B2 (en) 2007-12-04 2012-04-10 Particle Measuring Systems, Inc. Two-dimensional optical imaging methods and systems for particle detection
US8027035B2 (en) 2007-12-04 2011-09-27 Particle Measuring Systems, Inc. Non-orthogonal particle detection systems and methods

Similar Documents

Publication Publication Date Title
US3621150A (en) Speech processor for changing voice pitch
US3681756A (en) System for frequency modification of speech and other audio signals
Lee Time compression and expansion of speech by the sampling method
US4121058A (en) Voice processor
US5787334A (en) Method and apparatus for automatically identifying a program including a sound signal
US3662115A (en) Audio response apparatus using partial autocorrelation techniques
US3406344A (en) Transmission of low frequency signals by modulation of voice carrier
EP0118771B1 (en) Compression and expansion of digitized voice signals
Goodall Telephony by pulse code modulation
DE3269500D1 (en) Method of, and apparatus for recording and/or reproducing an information signal
US3786195A (en) Variable delay line signal processor for sound reproduction
KR940016074A (en) Digital Signal Processing Apparatus, Method and Recording Media
US6009386A (en) Speech playback speed change using wavelet coding, preferably sub-band coding
US3936610A (en) Dual delay line storage sound signal processor
US3499996A (en) Bandwidth compressor and expander
JPS6271001A (en) Intermittent magnetic recording and reproducing device
US3659052A (en) Multiplex terminal with redundancy reduction
EP0117276B1 (en) Privacy communication apparatus
US3462555A (en) Reduction of distortion in speech signal time compression systems
US2928902A (en) Signal transmission
US3555203A (en) Method of and apparatus for time adjustment of information
US6490109B1 (en) Data storage device
US3485960A (en) Rotary magnetic head record and play-back system for rendering unintelligible speech intelligible
JPS5774809A (en) Voice signal processing device
US3838218A (en) Bifrequency controlled analog shift register speech processor