WO2006082764A1 - Method and system for controlling a vehicle using voice commands - Google Patents

Method and system for controlling a vehicle using voice commands Download PDF

Info

Publication number
WO2006082764A1
WO2006082764A1 PCT/JP2006/301375 JP2006301375W WO2006082764A1 WO 2006082764 A1 WO2006082764 A1 WO 2006082764A1 JP 2006301375 W JP2006301375 W JP 2006301375W WO 2006082764 A1 WO2006082764 A1 WO 2006082764A1
Authority
WO
WIPO (PCT)
Prior art keywords
command
driving
vehicle
voice commands
controlling
Prior art date
Application number
PCT/JP2006/301375
Other languages
French (fr)
Inventor
Yu-Han Chiu
Chia-Shin Yen
Chien-Ming Wu
Che-Ming Lin
Original Assignee
Matsushita Electric Industrial Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co., Ltd. filed Critical Matsushita Electric Industrial Co., Ltd.
Publication of WO2006082764A1 publication Critical patent/WO2006082764A1/en

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60RVEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
    • B60R16/00Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for
    • B60R16/02Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements
    • B60R16/037Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements for occupant comfort, e.g. for automatic adjustment of appliances according to personal settings, e.g. seats, mirrors, steering wheel
    • B60R16/0373Voice control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02087Noise filtering the noise being separate speech, e.g. cocktail party
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Definitions

  • the invention relates to a method and system for controlling a vehicle, more particularly to a method and system for controlling a vehicle using voice commands.
  • Japanese Patent Publication No. JP041 19400 discloses a speech recognition device for a vehicle, which includes a plurality of microphones 1 1 , a phase shift unit 13, an adder unit 14, a maximum amplitude detection unit 15, and a speech recognition unit 16.
  • the microphones 1 1 are separately disposed in the vehicle, and are used to collect speech sounds coming from a certain seat.
  • the phase shift unit 13 shifts the phase of an electrical signal obtained by each microphone 11 with a corresponding phase shifting amount according to the location of the respective microphone 1 1.
  • the adder unit 14 is used to add a corresponding output signal transmitted from the phase shift unit 13 to each of the source electrical signals obtained by the microphones 11.
  • the maximum amplitude detection unit 15 is used to detect signals having the maximum amplitude from the output signals of the adder unit 14, treats the detected signals as voice commands from the aforesaid seat, and outputs the detected signals to the speech recognition unit 16 for recognition purposes.
  • the aforesaid conventional speech recognition device indeed can identify a voice command coming from a certain seat for controlling the vehicle if only one person (e.g., the driver or a passenger) in the vehicle speaks. However, if several people in different seats produce speech sounds at the same time, the microphones 1 1 will receive mixed voice commands, and the aforesaid conventional speech recognition device is unable to process such mixed voice commands. Therefore, there is a need for a method and system for controlling a vehicle using voice commands that can cope with a scenario in which several speakers in different seats in the vehicle produce speech sounds at the same time.
  • the main object of the present invention is to provide a method for controlling a vehicle using voice commands, which can be employed to separate mixed voice commands. Then, if a voice command thus separated is related to driving, the method of the present invention can be further employed to determine whether such a driving-purpose command was indeed issued by the driver of the vehicle.
  • a method for controlling a vehicle using voice commands comprises the following steps. Initially, a blind source separator is used to separate sounds collected by a plurality of microphones into a plurality of sound sources. Then, according to predetermined voice command data in a command database, a voice command is identified from sound source signals sent from the blind source separator, and a determination is made as to whether the voice command is a driving-purpose command. Next, if the voice command is a driving-purpose command, an issuing direction of the driving-purpose command is calculated according to position-related information of the driving-purpose command. Subsequently, according to the issuing direction of the driving- purpose command thus calculated, a determination is made as to whether the driving-purpose command was issued by a driver. Thereafter, upon determining that the driving-purpose command was issued by the driver, a controlled device in the vehicle to which the driving-purpose command corresponds is controlled accordingly.
  • another object of the present invention is to provide a system for controlling a vehicle using voice commands, which can be employed to separate mixed voice commands. Then, if a voice command thus separated is related to driving, the system of the present invention can be further employed to determine whether such a driving-purpose command was indeed issued by the driver of the vehicle.
  • a system for controlling a vehicle using voice commands of this invention can separate a plurality of sound sources collected by a plurality of microphones into a plurality of voice commands, and subsequently utilize the voice commands to control a plurality of controlled devices in the vehicle.
  • the system includes a blind source separator, a command database, a speech recognizer, a direction calculator, and a controller.
  • the blind source separator is used to separate sounds collected by the microphones into the sound sources.
  • the speech recognizer is used to receive sound source signals sent from the blind source separator, and to identify the voice commands from the sound source signals according to predetermined voice command data in the command database.
  • the direction calculator calculates an issuing direction of a voice command coming from each sound source according to the voice commands identified by the speech recognizer and position-related information of the sound sources sent from the blind source separator.
  • the controller determines whether or not to control the corresponding controlled devices according to the voice commands identified by the speech recognizer and the issuing directions of the voice commands as calculated by the direction calculator.
  • Figure 1 is a block diagram illustrating a conventional speech recognition device
  • Figure 2 is a system block diagram showing a preferred embodiment of a system for controlling a vehicle using voice commands according to the present invention
  • Figure 3 is a flowchart illustrating a method for controlling a vehicle using voice commands according to the present invention
  • Figure 4 is a schematic diagram showing the positions of microphones in the preferred embodiment of the present invention, as well as the relative relationship of the microphones with sounds emitted by the driver and a passenger
  • Figure 5 is a flowchart illustrating a preferred embodiment of a method for controlling a vehicle using voice commands according to the present invention.
  • Figure 6 is a schematic view illustrating use of a hyperbolic equation to derive an issuing direction of a driving-purpose command.
  • the preferred embodiment of a system for controlling a vehicle using voice commands is capable of separating a plurality of sound sources collected by a plurality of microphones (e.g., two microphones, M1 and M2) into a plurality of voice commands, and subsequently utilizing these voice commands to control a plurality of controlled devices 4 in a vehicle (not shown).
  • the system includes an amplifier 31 , an analog/digital converter 32, a blind source separator 33, a speech recognizer 34, a command database 35, a direction calculator 36, and a controller 37.
  • the microphones M1 , M2 are used to receive speech sounds uttered by at least one speaker (who may be the driver or a passenger) in the vehicle, and to convert the sound energy into electrical signals.
  • BSS blind source separation
  • the system can process the mixed speech sounds received by the microphones M1 , M2 when the speakers speak at the same time.
  • the amplifier 31 is used to amplify the electrical signals sent from the microphones M1 , M2.
  • the analog/digital converter 32 is used to convert the analog signals sent from the amplifier 31 into a set of discrete values to represent the magnitude of the sound energy.
  • the blind source separator 33 is used to separate the mixed speech signals sent from the analog/digital converter 32, and to transmit the speech signals thus separated to the speech recognizer 34.
  • the blind source separator 33 further sends information related to issuing directions of the sounds to the direction calculator 36.
  • the speech recognizer 34 is used to receive the separated speech signals sent from the blind source separator 33. According to predetermined voice command data retrieved from the command database 35, the speech recognizer 34 identifies voice commands for controlling a corresponding one of the controlled devices 4 in the vehicle from the separated speech signals received thereby, and outputs a command code of the corresponding one of the controlled devices 4. In addition, the speech recognizer 34 can be further used to determine whether the identified voice command is a driving- purpose command (i.e., to control an external rear view mirror, windscreen wipers, etc.) or a general-purpose command (i.e., to control entertainment equipment, air-conditioning equipment, etc.).
  • a driving- purpose command i.e., to control an external rear view mirror, windscreen wipers, etc.
  • a general-purpose command i.e., to control entertainment equipment, air-conditioning equipment, etc.
  • the direction calculator 36 is activated.
  • the direction calculator 36 is used to receive control signals sent from the speech recognizer 34, and uses the information related to the issuing directions of the sounds as sent from the blind source separator 33 to calculate the direction from which the driving-purpose command was issued for transmission to the controller 37.
  • the controller 37 is used to process the command codes sent from the speech recognizer 34. If the command code received by the controller 37 is a driving-purpose command code, a determination is made according to the calculation result transmitted from the direction calculator 36 as to whether the driving-purpose command was issued by the driver. If the driving-purpose command was issued by the driver, a control signal is sent to control the controlled device 4 which corresponds to that driving-purpose command code. If the command code received by the controller 37 is a general-purpose command code, a control signal is directly sent to control the controlled device 4 which corresponds to that general-purpose command code.
  • the method for controlling a vehicle using voice commands includes the following steps. Initially, in step 51 , analog mixed sounds of speeches made by a passenger and the driver in the vehicle are inputted through the microphones M1 , M2, and the amplifier 31 and the analog/digital converter 32 are used in sequence to amplify the sounds and convert the sounds into digital signals.
  • step 52 the mixed speech signals sent from the analog/digital converter 32 are separated into a plurality of original sound sources using the blind source separator 33 for transmission to the speech recognizer 34, and information related to issuing directions of the sounds is transmitted to the direction calculator 36.
  • step 53 a speech signal is identified from the plurality of separated original sound sources sent from the blind source separator 33 using the speech recognizer 34.
  • step 54 a determination is made as to whether the identified speech signal is a driving-purpose command according to the command database 35. If the speech signal is not a driving- purpose command, and is a general-purpose command, a general- purpose command code is directly transmitted to the controller 37, regardless of whether the general-purpose command was issued by the driver or the passenger. Furthermore, in step 55, the controller 37 is employed to process the general-purpose command code, and directly sends a control signal to control the controlled device 4, such as entertainment equipment or air-conditioning equipment, which corresponds to the general-purpose command code.
  • the controlled device 4 such as entertainment equipment or air-conditioning equipment
  • step 56 is performed to transmit the control signal to the direction calculator 36 for activating the latter to calculate the issuing direction of the driving-purpose command for subsequent transmission to the controller 37.
  • the speech recognizer 34 also outputs a driving-purpose command code to the controller 37.
  • step 57 the controller 37 is used to determine whether the driving- purpose command came from the direction of the driver. If yes, this indicates that the driving-purpose command was issued by the driver, and, in step 58, the controller 37 uses the driving-purpose command code to control the controlled device, such as an external rear view mirror or a windscreen wiper, which corresponds to the driving- purpose command code. Otherwise, this indicates that the driving- purpose command was issued by the passenger. Then, in step 59, the driving-purpose command is ignored without further processing.
  • the microphones M1 , M2 for collecting mixed sounds are disposed on a rear view mirror within the vehicle.
  • sounds within a vehicle compartment 20 e.g., sounds coming from the driver S1 or the passenger or other sound sources S2
  • the microphones MI 1 M2 will pick up convoluted sounds.
  • mixed matrix A( ⁇ ) represents the impulse response of the vehicle compartment 20.
  • T represents delay of the impulse response.
  • ⁇ "( r ) represents the impulse response from the driver S1 to the microphone M1.
  • ⁇ 2i( r ) represents the impulse response from the driver S1 to the microphone M2.
  • a n( ⁇ ) represents the impulse response from the passenger or other sound sources S2 to the microphone M1.
  • ⁇ 22( r ) represents the impulse response from the passenger or other sound sources S2 to the microphone M2.
  • elements S1 ⁇ and S2 ⁇ of the matrix "*(*' represent the sound source signals of the driver S1 and the passenger or other sound sources S2, respectively, where t represents a point in time of the sound signal.
  • elements wxl (0 and m ⁇ ⁇ 2(t) of the matr j x X(t) represent the signals received by the microphones M1 and M2, respectively. mix ⁇ (t)
  • a mixed signal V' convoluted with the impulse response of the vehicle compartment 20 can be obtained from equation (4), where P represents the length of the impulse response.
  • a preferred embodiment of a method for controlling a vehicle using voice commands employs multiple adaptive decorrelation (MAD), and a BSS algorithm of frequency domain independent component analysis (FDICA) to perform step 52 so as to separate the mixed sounds for generating a driver command and a passenger command.
  • MAD multiple adaptive decorrelation
  • FDICA frequency domain independent component analysis
  • the BSS equation is as shown in the following equation (6), where Q represents the length of a filter.
  • the de-mixing matrix w ⁇ will be equal to an inverse matrix of the mixed matrix A( ⁇ ).
  • the de-mixing matrix w ⁇ is an approximation of an inverse matrix of the mixed matrix A( ⁇ ).
  • the de-mixing matrix w ⁇ can be used to estimate time delay samples between impulse responses.
  • the time delay samples between impulse responses ⁇ n and ⁇ 2i are equivalent to the time delay samples between impulse responses w ⁇ and ⁇ w ⁇ .
  • a time delay sample is equivalent to a time differential between the maximum peak values of two impulse responses.
  • the two impulse responses come from the same sound source. For example, if the time of the maximum peak value of ⁇ ⁇ occurs in the 10th sample, and the time of the maximum peak value of a ⁇ occurs in the 14th sample, the time delay samples are 4 samples.
  • step 56 further includes sub-steps 561 , 562, and 563.
  • the direction calculator 36 uses the de-mixing matrix W transmitted from the blind source separator 33 to calculate an inverse matrix W-1 thereof.
  • time delay is calculated.
  • the issuing direction of the driving-purpose command is calculated in a manner to be described hereinafter. Referring to Figure 6, according to the time delay samples, the issuing direction of the driving-purpose command can be derived using a hyperbolic equation.
  • a rear view mirror (not shown) in the vehicle will be rotated an angle ⁇ 2, which can be inputted into the system of the present invention.
  • Symbols (m, n) represent coordinates of the driver S1 or the passenger or other sound sources S2 in the x'y' coordinate system.
  • Symbol d represents the distance between the two microphones M1 , M2.
  • Angle ⁇ 1 is the angle to be derived.
  • equations (9) and (10) the coordinates of the sound source S(m,n) in the x'y' coordinate system are shown in the following equations (11) and (12):
  • angle ⁇ 1 can be calculated as tan-1(n/m). Then, by subtracting ⁇ 2 from ⁇ 1, the direction of the sound source S(m,n) can be obtained.
  • the actual angle of the driver S1 relative to the point of origin in the xy coordinate system can be pre-defined to be between 30 and 60 degrees, for instance.
  • the controller 37 will determine that the driving-purpose command was issued by the driver S1, and will then control the controlled device 4 which corresponds to the driving-purpose command. On the contrary, if the direction calculator 36 calculates the angle ⁇ 1- ⁇ 2 of the driving- purpose command to be outside the range of from 30 to 60 degrees, the controller 37 will determine that the driving-purpose command was issued by the passenger or other sound sources S2, and will then ignore the driving-purpose command without further processing.
  • CSP crosspower spectrum phase
  • the method and system for controlling a vehicle using voice commands employs the blind source separator 33 to separate mixed voice commands received by the microphones M1 , M2. Then, if the voice command thus separated is a driving-purpose command, the method and system of the present invention can further determine whether the driving-purpose command was issued by the driver S1 by calculating the issuing direction of the driving-purpose command. If the driving-purpose command was indeed issued by the driver S1 , the controlled device in the vehicle to which the driving-purpose command corresponds will be controlled. On the contrary, if the driving-purpose command was not issued by the driver S1 , the driving-purpose command will not be processed.
  • the present invention can be applied to method and system for controlling a vehicle using voice commands.

Abstract

A method for controlling a vehicle using voice commands includes: using a blind source separator to separate sounds collected by a plurality of microphones into a plurality of sound sources; identifying a voice command from sound source signals sent from the blind source separator according to predetermined voice command data in a command database, and determining whether the voice command is a driving-purpose command; if the voice command is a driving-purpose command, calculating an issuing direction of the driving-purpose command according to position-related information of the driving-purpose command; determining whether the driving-purpose command was issued by a driver according to the issuing direction of the driving-purpose command thus calculated; and controlling a corresponding controlled device in the vehicle if the driving-purpose command was issued by the driver. A system for controlling a vehicle using voice commands is also disclosed.

Description

DESCRIPTION
METHOD AND SYSTEM FOR CONTROLLING A VEHICLE USING VOICE COMMANDS
Technical Field
The invention relates to a method and system for controlling a vehicle, more particularly to a method and system for controlling a vehicle using voice commands.
Background Art
The driver of a vehicle and passengers therein generally use their hands and feet to control controlled devices (e.g., stepping on the accelerator pedal or brake) in the vehicle. However, with the advance of speech recognition technology, there are currently available actual examples of using speech to control vehicles. For instance, referring to Figure 1 , Japanese Patent Publication No. JP041 19400 discloses a speech recognition device for a vehicle, which includes a plurality of microphones 1 1 , a phase shift unit 13, an adder unit 14, a maximum amplitude detection unit 15, and a speech recognition unit 16. The microphones 1 1 are separately disposed in the vehicle, and are used to collect speech sounds coming from a certain seat. The phase shift unit 13 shifts the phase of an electrical signal obtained by each microphone 11 with a corresponding phase shifting amount according to the location of the respective microphone 1 1. The adder unit 14 is used to add a corresponding output signal transmitted from the phase shift unit 13 to each of the source electrical signals obtained by the microphones 11. The maximum amplitude detection unit 15 is used to detect signals having the maximum amplitude from the output signals of the adder unit 14, treats the detected signals as voice commands from the aforesaid seat, and outputs the detected signals to the speech recognition unit 16 for recognition purposes.
The aforesaid conventional speech recognition device indeed can identify a voice command coming from a certain seat for controlling the vehicle if only one person (e.g., the driver or a passenger) in the vehicle speaks. However, if several people in different seats produce speech sounds at the same time, the microphones 1 1 will receive mixed voice commands, and the aforesaid conventional speech recognition device is unable to process such mixed voice commands. Therefore, there is a need for a method and system for controlling a vehicle using voice commands that can cope with a scenario in which several speakers in different seats in the vehicle produce speech sounds at the same time.
Disclosure of Invention
Therefore, the main object of the present invention is to provide a method for controlling a vehicle using voice commands, which can be employed to separate mixed voice commands. Then, if a voice command thus separated is related to driving, the method of the present invention can be further employed to determine whether such a driving-purpose command was indeed issued by the driver of the vehicle.
Accordingly, a method for controlling a vehicle using voice commands comprises the following steps. Initially, a blind source separator is used to separate sounds collected by a plurality of microphones into a plurality of sound sources. Then, according to predetermined voice command data in a command database, a voice command is identified from sound source signals sent from the blind source separator, and a determination is made as to whether the voice command is a driving-purpose command. Next, if the voice command is a driving-purpose command, an issuing direction of the driving-purpose command is calculated according to position-related information of the driving-purpose command. Subsequently, according to the issuing direction of the driving- purpose command thus calculated, a determination is made as to whether the driving-purpose command was issued by a driver. Thereafter, upon determining that the driving-purpose command was issued by the driver, a controlled device in the vehicle to which the driving-purpose command corresponds is controlled accordingly.
In addition, another object of the present invention is to provide a system for controlling a vehicle using voice commands, which can be employed to separate mixed voice commands. Then, if a voice command thus separated is related to driving, the system of the present invention can be further employed to determine whether such a driving-purpose command was indeed issued by the driver of the vehicle.
Accordingly, a system for controlling a vehicle using voice commands of this invention can separate a plurality of sound sources collected by a plurality of microphones into a plurality of voice commands, and subsequently utilize the voice commands to control a plurality of controlled devices in the vehicle. The system includes a blind source separator, a command database, a speech recognizer, a direction calculator, and a controller. The blind source separator is used to separate sounds collected by the microphones into the sound sources. The speech recognizer is used to receive sound source signals sent from the blind source separator, and to identify the voice commands from the sound source signals according to predetermined voice command data in the command database. The direction calculator calculates an issuing direction of a voice command coming from each sound source according to the voice commands identified by the speech recognizer and position-related information of the sound sources sent from the blind source separator. The controller determines whether or not to control the corresponding controlled devices according to the voice commands identified by the speech recognizer and the issuing directions of the voice commands as calculated by the direction calculator.
Brief Description of Drawings
Other features and advantages of the present invention will become apparent in the following detailed description of the preferred embodiment with reference to the accompanying drawings, of which:
Figure 1 is a block diagram illustrating a conventional speech recognition device;
Figure 2 is a system block diagram showing a preferred embodiment of a system for controlling a vehicle using voice commands according to the present invention;
Figure 3 is a flowchart illustrating a method for controlling a vehicle using voice commands according to the present invention; Figure 4 is a schematic diagram showing the positions of microphones in the preferred embodiment of the present invention, as well as the relative relationship of the microphones with sounds emitted by the driver and a passenger; Figure 5 is a flowchart illustrating a preferred embodiment of a method for controlling a vehicle using voice commands according to the present invention; and
Figure 6 is a schematic view illustrating use of a hyperbolic equation to derive an issuing direction of a driving-purpose command.
Best Mode for Carrying Out the Invention
Referring to Figure 2, the preferred embodiment of a system for controlling a vehicle using voice commands according to this invention is capable of separating a plurality of sound sources collected by a plurality of microphones (e.g., two microphones, M1 and M2) into a plurality of voice commands, and subsequently utilizing these voice commands to control a plurality of controlled devices 4 in a vehicle (not shown). The system includes an amplifier 31 , an analog/digital converter 32, a blind source separator 33, a speech recognizer 34, a command database 35, a direction calculator 36, and a controller 37.
The microphones M1 , M2 are used to receive speech sounds uttered by at least one speaker (who may be the driver or a passenger) in the vehicle, and to convert the sound energy into electrical signals. As blind source separation (BSS) techniques are employed in this invention, the system can process the mixed speech sounds received by the microphones M1 , M2 when the speakers speak at the same time. The amplifier 31 is used to amplify the electrical signals sent from the microphones M1 , M2. The analog/digital converter 32 is used to convert the analog signals sent from the amplifier 31 into a set of discrete values to represent the magnitude of the sound energy. The blind source separator 33 is used to separate the mixed speech signals sent from the analog/digital converter 32, and to transmit the speech signals thus separated to the speech recognizer 34. The blind source separator 33 further sends information related to issuing directions of the sounds to the direction calculator 36. The speech recognizer 34 is used to receive the separated speech signals sent from the blind source separator 33. According to predetermined voice command data retrieved from the command database 35, the speech recognizer 34 identifies voice commands for controlling a corresponding one of the controlled devices 4 in the vehicle from the separated speech signals received thereby, and outputs a command code of the corresponding one of the controlled devices 4. In addition, the speech recognizer 34 can be further used to determine whether the identified voice command is a driving- purpose command (i.e., to control an external rear view mirror, windscreen wipers, etc.) or a general-purpose command (i.e., to control entertainment equipment, air-conditioning equipment, etc.). If the voice command code outputted to the controller 37 by the speech recognizer 34 belongs to a driving-purpose command code, the direction calculator 36 is activated. The direction calculator 36 is used to receive control signals sent from the speech recognizer 34, and uses the information related to the issuing directions of the sounds as sent from the blind source separator 33 to calculate the direction from which the driving-purpose command was issued for transmission to the controller 37.
The controller 37 is used to process the command codes sent from the speech recognizer 34. If the command code received by the controller 37 is a driving-purpose command code, a determination is made according to the calculation result transmitted from the direction calculator 36 as to whether the driving-purpose command was issued by the driver. If the driving-purpose command was issued by the driver, a control signal is sent to control the controlled device 4 which corresponds to that driving-purpose command code. If the command code received by the controller 37 is a general-purpose command code, a control signal is directly sent to control the controlled device 4 which corresponds to that general-purpose command code.
Referring to Figures 2 and 3, the method for controlling a vehicle using voice commands according to this invention includes the following steps. Initially, in step 51 , analog mixed sounds of speeches made by a passenger and the driver in the vehicle are inputted through the microphones M1 , M2, and the amplifier 31 and the analog/digital converter 32 are used in sequence to amplify the sounds and convert the sounds into digital signals.
Then, in step 52, the mixed speech signals sent from the analog/digital converter 32 are separated into a plurality of original sound sources using the blind source separator 33 for transmission to the speech recognizer 34, and information related to issuing directions of the sounds is transmitted to the direction calculator 36.
Next, in step 53, a speech signal is identified from the plurality of separated original sound sources sent from the blind source separator 33 using the speech recognizer 34.
Subsequently, in step 54, a determination is made as to whether the identified speech signal is a driving-purpose command according to the command database 35. If the speech signal is not a driving- purpose command, and is a general-purpose command, a general- purpose command code is directly transmitted to the controller 37, regardless of whether the general-purpose command was issued by the driver or the passenger. Furthermore, in step 55, the controller 37 is employed to process the general-purpose command code, and directly sends a control signal to control the controlled device 4, such as entertainment equipment or air-conditioning equipment, which corresponds to the general-purpose command code.
On the contrary, if the speech signal identified by the speech recognizer 34 in the determination step is a driving-purpose command, step 56 is performed to transmit the control signal to the direction calculator 36 for activating the latter to calculate the issuing direction of the driving-purpose command for subsequent transmission to the controller 37. At the same time, the speech recognizer 34 also outputs a driving-purpose command code to the controller 37. Then, in step 57, the controller 37 is used to determine whether the driving- purpose command came from the direction of the driver. If yes, this indicates that the driving-purpose command was issued by the driver, and, in step 58, the controller 37 uses the driving-purpose command code to control the controlled device, such as an external rear view mirror or a windscreen wiper, which corresponds to the driving- purpose command code. Otherwise, this indicates that the driving- purpose command was issued by the passenger. Then, in step 59, the driving-purpose command is ignored without further processing.
Referring to Figure 4, in the preferred embodiment of this invention, the microphones M1 , M2 for collecting mixed sounds are disposed on a rear view mirror within the vehicle. In general, sounds within a vehicle compartment 20 (e.g., sounds coming from the driver S1 or the passenger or other sound sources S2) will convolute with impulse responses of the vehicle compartment 20. Therefore, the microphones MI 1 M2 will pick up convoluted sounds. As shown in the following equation (1 ), mixed matrix A(τ) represents the impulse response of the vehicle compartment 20. T represents delay of the impulse response. α"(r) represents the impulse response from the driver S1 to the microphone M1. α2i(r) represents the impulse response from the driver S1 to the microphone M2. an(τ) represents the impulse response from the passenger or other sound sources S2 to the microphone M1. α22(r) represents the impulse response from the passenger or other sound sources S2 to the microphone M2.
Figure imgf000010_0001
As shown in the following equation (2), elements S1^ and S2^ of the matrix "*(*' represent the sound source signals of the driver S1 and the passenger or other sound sources S2, respectively, where t represents a point in time of the sound signal.
Si(O
S(t) = (2)
S2(t)_
As shown in the following equation (3), elements wxl(0 and mιχ2(t) of the matrjx X(t) represent the signals received by the microphones M1 and M2, respectively. mix\(t)
X(t) = (3) mix2(t)
Accordingly, a mixed signal V' convoluted with the impulse response of the vehicle compartment 20 can be obtained from equation (4), where P represents the length of the impulse response.
«ii (r) au(τ)JS\(t - τ) mix\(t)
∑ A(τ)S (t - r) = ∑ = X(t) (4) τ=\ r=l Lα21 (τ) aΩ(τ)lS2(t - τ)j mix2(t)
Referring to Figures 2 and 5, a preferred embodiment of a method for controlling a vehicle using voice commands according to the present invention employs multiple adaptive decorrelation (MAD), and a BSS algorithm of frequency domain independent component analysis (FDICA) to perform step 52 so as to separate the mixed sounds for generating a driver command and a passenger command.
Such a BSS algorithm can be used to yield a de-mixing matrix ™ (τ) shown in the following equation (5) so that W*X=S, where the symbol "*" represents a convolve operator.
Figure imgf000011_0001
Thus, the BSS equation is as shown in the following equation (6), where Q represents the length of a filter.
= S(t) ... (6)
Figure imgf000011_0002
Therefore, the following equations (7) and (8) can be obtained according to equations (4) and (6):
A = ~ r Wj/-l (7)
Figure imgf000012_0001
In the ideal case, the de-mixing matrix w^ will be equal to an inverse matrix of the mixed matrix A(τ). However, in a general case, the de-mixing matrix w^ is an approximation of an inverse matrix of the mixed matrix A(τ). Thus, the de-mixing matrix w^ can be used to estimate time delay samples between impulse responses. As shown in equation (8), the time delay samples between impulse responses αn and α2i are equivalent to the time delay samples between impulse responses w∞ and ~ w^ . A time delay sample is equivalent to a time differential between the maximum peak values of two impulse responses. The two impulse responses come from the same sound source. For example, if the time of the maximum peak value of απ occurs in the 10th sample, and the time of the maximum peak value of a^ occurs in the 14th sample, the time delay samples are 4 samples.
Therefore, in the preferred embodiment of a method for controlling a vehicle using voice commands according to the present invention as shown in Figure 5, step 56 further includes sub-steps 561 , 562, and 563. In sub-step 561 , the direction calculator 36 uses the de-mixing matrix W transmitted from the blind source separator 33 to calculate an inverse matrix W-1 thereof. Then, in sub-step 562, time delay is calculated. Subsequently, in sub-step 563, the issuing direction of the driving-purpose command is calculated in a manner to be described hereinafter. Referring to Figure 6, according to the time delay samples, the issuing direction of the driving-purpose command can be derived using a hyperbolic equation. As shown in Figure 6, generally speaking, a rear view mirror (not shown) in the vehicle will be rotated an angle Θ2, which can be inputted into the system of the present invention. Symbols (m, n) represent coordinates of the driver S1 or the passenger or other sound sources S2 in the x'y' coordinate system. Symbol d represents the distance between the two microphones M1 , M2. Symbol a represents delay distance, which is equal to (vχk)/Fs, where v is the speed of sound (=331.4+0.6χ temperature (°C)(m/sec)); k is the number of time delay samples; and Fs is the sampling rate. Angle Θ1 is the angle to be derived.
In Figure 5, assuming the numerical values to the left of the points of origin in the xy and x'y' coordinate systems are positive, an equation for line L in the xy coordinate system is x=h, where h represents a horizontal distance from the sound source S(m,n) to the center of the rear view mirror (i.e. , the points of origin of the xy and x'y1 coordinate systems), and the horizontal distance h is supplied by the manufacturer of the vehicle.
Accordingly, a linear equation of line L in the x'y' coordinate system is shown in equation (9).
x'cos02 +/sin02 = A (9)
An equation for the hyperbolic curve C in the x'y' coordinate system is shown in the following equation (10):
Figure imgf000013_0001
According to equations (9) and (10), the coordinates of the sound source S(m,n) in the x'y' coordinate system are shown in the following equations (11) and (12):
2h(d2 -a2)sin2 O1 -lha1 cos2 θ2 -{2h{d2 -a2)sinθ2 2{d2 -a2)sm2θ2cosθ2 -2a2 cos3 θ2
^4h2{d22)sin2 θ2 -((d2-a2)sin2θ2 -a2 cos2 θ2X4h2(d2 -a2)-a2(d22)cos2 θ2)) 2(d2 -a2)sin2 θ2 cos<92 -2a2 cos3 θ2
H)
2h(d2 -a2)smθ2 n =
2(d2 -a2)sϊn2 θ2 -2a2 cos2 θ2
τj4h2(d2 -a-)sin2θ2 ~((c/2 -a2)sin2θ2 -a2 cos2θ2)(Ah2(d2 -a2)-a2(d2 -g-)cos2θ2) l(d2 - a2) sin2 θ2 - 2a2 cos2 θ2
(12)
Thus, angle Θ1 can be calculated as tan-1(n/m). Then, by subtracting Θ2 from Θ1, the direction of the sound source S(m,n) can be obtained.
In the method and system for controlling a vehicle using voice commands according to the present invention, the actual angle of the driver S1 relative to the point of origin in the xy coordinate system can be pre-defined to be between 30 and 60 degrees, for instance.
Therefore, when the direction calculator 36 calculates the angle Θ1-Θ2 of a driving-purpose command to be between 30 and 60 degrees, the controller 37 will determine that the driving-purpose command was issued by the driver S1, and will then control the controlled device 4 which corresponds to the driving-purpose command. On the contrary, if the direction calculator 36 calculates the angle Θ1-Θ2 of the driving- purpose command to be outside the range of from 30 to 60 degrees, the controller 37 will determine that the driving-purpose command was issued by the passenger or other sound sources S2, and will then ignore the driving-purpose command without further processing. In addition, aside from the method of utilizing time delay samples and hyperbolic equations to calculate the issuing direction of a voice command, other methods such as beamforming and crosspower spectrum phase (CSP) suitable for calculating the issuing direction of the voice command also fall within the scope of the present invention.
In sum, the method and system for controlling a vehicle using voice commands according to the present invention employs the blind source separator 33 to separate mixed voice commands received by the microphones M1 , M2. Then, if the voice command thus separated is a driving-purpose command, the method and system of the present invention can further determine whether the driving-purpose command was issued by the driver S1 by calculating the issuing direction of the driving-purpose command. If the driving-purpose command was indeed issued by the driver S1 , the controlled device in the vehicle to which the driving-purpose command corresponds will be controlled. On the contrary, if the driving-purpose command was not issued by the driver S1 , the driving-purpose command will not be processed.
While the present invention has been described in connection with what is considered the most practical and preferred embodiment, it is understood that this invention is not limited to the disclosed embodiment but is intended to cover various arrangements included within the spirit and scope of the broadest interpretation so as to encompass all such modifications and equivalent arrangements. Industrial Applicability
The present invention can be applied to method and system for controlling a vehicle using voice commands.

Claims

1. A method for controlling a vehicle using voice commands, comprising the following steps:
(a) using a blind source separator to separate sounds collected by a plurality of microphones into a plurality of sound sources;
(b)according to predetermined voice command data in a command database, identifying a voice command from sound source signals sent from the blind source separator, and determining whether the voice command is a driving-purpose command;
(c) if the voice command is a driving-purpose command, calculating an issuing direction of the driving-purpose command according to position-related information of the driving-purpose command; (d) according to the issuing direction of the driving-purpose command thus calculated, determining whether the driving-purpose command was issued by a driver; and
(e) if the driving-purpose command was issued by the driver, controlling a controlled device in the vehicle to which the driving- purpose command corresponds.
2. The method for controlling a vehicle using voice commands as claimed in Claim 1 , further comprising a step of using an amplifier to amplify the sounds collected by the microphones before step (a).
3. The method for controlling a vehicle using voice commands as claimed in Claim 1 , further comprising a step of using an analog/digital converter to convert analog signals of the sounds collected by the microphones to digital signals before step (a).
4. The method for controlling a vehicle using voice commands as claimed in Claim 1 , wherein if the voice command is a general- purpose command, a controlled device in the vehicle to which the general-purpose command corresponds is directly controlled.
5. The method for controlling a vehicle using voice commands as claimed in Claim 1 , wherein if the driving-purpose command was not issued by the driver, the driving-purpose command is ignored.
6. The method for controlling a vehicle using voice commands as claimed in Claim 1 , wherein step (a) further includes using a de- mixing matrix to separate the sounds collected by the microphones into the sound sources, and step (c) further includes calculating in sequence an inverse matrix of the de-mixing matrix and a time delay so as to calculate the issuing direction of the driving-purpose command.
7. A system for controlling a vehicle using voice commands, which can separate a plurality of sound sources collected by a plurality of microphones into a plurality of voice commands, and which subsequently utilizes the voice commands to control a plurality of controlled devices in the vehicle, said system comprising: a blind source separator for separating sounds collected by the microphones into the sound sources; a command database; a speech recognizer for receiving sound source signals sent from the blind source separator, and for identifying the voice commands from the sound source signals according to predetermined voice command data in the command database; a direction calculator which calculates an issuing direction of a voice command coming from each sound source according to the voice commands identified by the speech recognizer and position- related information of the sound sources sent from the blind source separator; and a controller which, according to the voice commands identified by the speech recognizer and the issuing directions of the voice commands as calculated by the direction calculator, determines whether or not to control the corresponding controlled devices.
8. The system for controlling a vehicle using voice commands as claimed in Claim 7, further comprising an amplifier for amplifying the sounds collected by the microphones.
9. The system for controlling a vehicle using voice commands as claimed in Claim 7, further comprising an analog/digital converter for converting analog signals of the sounds collected by the microphones to digital signals.
PCT/JP2006/301375 2005-02-01 2006-01-23 Method and system for controlling a vehicle using voice commands WO2006082764A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN 200510006462 CN1815556A (en) 2005-02-01 2005-02-01 Method and system capable of operating and controlling vehicle using voice instruction
CN200510006462.5 2005-02-01

Publications (1)

Publication Number Publication Date
WO2006082764A1 true WO2006082764A1 (en) 2006-08-10

Family

ID=36370846

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2006/301375 WO2006082764A1 (en) 2005-02-01 2006-01-23 Method and system for controlling a vehicle using voice commands

Country Status (2)

Country Link
CN (1) CN1815556A (en)
WO (1) WO2006082764A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1901282A2 (en) * 2006-09-15 2008-03-19 Volkswagen Aktiengesellschaft Speech communications system for a vehicle and method of operating a speech communications system for a vehicle
US8700405B2 (en) 2010-02-16 2014-04-15 Honeywell International Inc Audio system and method for coordinating tasks
CN103731768A (en) * 2013-12-25 2014-04-16 深圳Tcl新技术有限公司 Sound pickup method and device
GB2529509A (en) * 2014-06-11 2016-02-24 Honeywell Int Inc Adaptive beam forming devices, methods, and systems
EP3264266A4 (en) * 2015-05-20 2018-03-28 Huawei Technologies Co. Ltd. Method for positioning sounding location, and terminal device
US11237594B2 (en) * 2011-05-10 2022-02-01 Kopin Corporation Headset computer that uses motion and voice commands to control information display and remote devices
JP7383761B2 (en) 2021-06-03 2023-11-20 阿波▲羅▼智▲聯▼(北京)科技有限公司 Audio processing method, device, electronic device, storage medium and computer program for vehicles

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102566961A (en) * 2010-12-31 2012-07-11 上海博泰悦臻电子设备制造有限公司 Voice executing method and voice executing device based on application program of vehicle-mounted device
DE102011109932B4 (en) * 2011-08-10 2014-10-02 Audi Ag Method for controlling functional devices in a vehicle during voice command operation
WO2013101066A1 (en) * 2011-12-29 2013-07-04 Intel Corporation Direct grammar access
CN102800315A (en) * 2012-07-13 2012-11-28 上海博泰悦臻电子设备制造有限公司 Vehicle-mounted voice control method and system
CN103287362B (en) * 2013-06-25 2015-11-25 安科智慧城市技术(中国)有限公司 A kind of vehicle-mounted voice control system, method and electronlmobil
CN104978958A (en) * 2014-04-14 2015-10-14 美的集团股份有限公司 Voice control method and system
CN105302082A (en) * 2014-06-08 2016-02-03 上海能感物联网有限公司 Controller apparatus for on-site automatic navigation and car driving by non-specific person foreign language speech
CN105302799A (en) * 2014-06-08 2016-02-03 上海能感物联网有限公司 Controller apparatus of field automatic navigating and automobile driving by using foreign natural-language text
CN105302079A (en) * 2014-06-08 2016-02-03 上海能感物联网有限公司 Controller apparatus for controlling on-site car driving by Chinese speech
CN105292016A (en) * 2014-06-08 2016-02-03 上海能感物联网有限公司 Automobile system driven under control of on-site Chinese speech
CN105161101A (en) * 2014-06-08 2015-12-16 苗码信息科技(上海)股份有限公司 Method of achieving automatic navigation and driving automobile through on-site speaker-independent foreign language voice
CN104966514A (en) * 2015-04-30 2015-10-07 北京车音网科技有限公司 Speech recognition method and vehicle-mounted device
CN106297794A (en) * 2015-05-22 2017-01-04 西安中兴新软件有限责任公司 The conversion method of a kind of language and characters and equipment
CN105976815A (en) * 2016-04-22 2016-09-28 乐视控股(北京)有限公司 Vehicle voice recognition method and vehicle voice recognition device
US20170316783A1 (en) * 2016-04-28 2017-11-02 GM Global Technology Operations LLC Speech recognition systems and methods using relative and absolute slot data
US10319371B2 (en) * 2016-05-04 2019-06-11 GM Global Technology Operations LLC Disambiguation of vehicle speech commands
CN106878281B (en) * 2017-01-11 2020-03-31 上海蔚来汽车有限公司 In-vehicle positioning device and method based on mixed audio and in-vehicle equipment control system
CN108859840B (en) * 2018-07-23 2019-10-25 爱驰汽车有限公司 Electric car charge control method, system, equipment and storage medium
CN113763956A (en) * 2019-03-12 2021-12-07 百度在线网络技术(北京)有限公司 Interaction method and device applied to vehicle
CN110850829B (en) * 2019-11-19 2021-11-26 思必驰科技股份有限公司 Weather-based intelligent control method and system for vehicle-mounted equipment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0472356A1 (en) * 1990-08-16 1992-02-26 Fujitsu Ten Limited Speech recognition apparatus for a vehicle, using a microphone arrangement to determine the seat from which a command is generated
EP1429314A1 (en) * 2002-12-13 2004-06-16 Sony International (Europe) GmbH Correction of energy as input feature for speech processing

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0472356A1 (en) * 1990-08-16 1992-02-26 Fujitsu Ten Limited Speech recognition apparatus for a vehicle, using a microphone arrangement to determine the seat from which a command is generated
EP1429314A1 (en) * 2002-12-13 2004-06-16 Sony International (Europe) GmbH Correction of energy as input feature for speech processing

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
D.SCHOBBEN, K.TORKKOLA AND P.SMARAGDIS: "evaluation of blind signal separation methods", PROC.INT.WORKSHOP INDEPENDENT COMPONENT ANALYSIS BLIND SIGNAL SEPARATION, AUSSOIS, FRANCE, JAN 11-15, 1999, 1999, XP002383440, Retrieved from the Internet <URL:http://citeseer.ist.psu.edu/cache/papers/cs/3480/http:zSzzSzsound.media.mit.eduzSz~pariszSzparis-ica99.pdf/schobben99evaluation.pdf> [retrieved on 20060530] *
KONDO K ET AL: "SPEECH RECOGNITION IN CARS BY SPEAKER LOCALIZATION USING MICROPHONE ARRAY", SYSTEMS & COMPUTERS IN JAPAN, WILEY, HOBOKEN, NJ, US, vol. 34, no. 8, July 2003 (2003-07-01), pages 1 - 12, XP001178501, ISSN: 0882-1666 *
NEDELKO GRBICGRBIC ET AL: "Blind Signal Separation Using Overcomplete Subband Representation", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 9, no. 5, July 2001 (2001-07-01), XP011054112, ISSN: 1063-6676 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1901282A3 (en) * 2006-09-15 2008-05-21 Volkswagen Aktiengesellschaft Speech communications system for a vehicle and method of operating a speech communications system for a vehicle
US8214219B2 (en) 2006-09-15 2012-07-03 Volkswagen Of America, Inc. Speech communications system for a vehicle and method of operating a speech communications system for a vehicle
EP1901282A2 (en) * 2006-09-15 2008-03-19 Volkswagen Aktiengesellschaft Speech communications system for a vehicle and method of operating a speech communications system for a vehicle
US8700405B2 (en) 2010-02-16 2014-04-15 Honeywell International Inc Audio system and method for coordinating tasks
US9642184B2 (en) 2010-02-16 2017-05-02 Honeywell International Inc. Audio system and method for coordinating tasks
US11237594B2 (en) * 2011-05-10 2022-02-01 Kopin Corporation Headset computer that uses motion and voice commands to control information display and remote devices
US11947387B2 (en) 2011-05-10 2024-04-02 Kopin Corporation Headset computer that uses motion and voice commands to control information display and remote devices
CN103731768A (en) * 2013-12-25 2014-04-16 深圳Tcl新技术有限公司 Sound pickup method and device
US9451362B2 (en) 2014-06-11 2016-09-20 Honeywell International Inc. Adaptive beam forming devices, methods, and systems
GB2529509B (en) * 2014-06-11 2018-04-11 Honeywell Int Inc Adaptive beam forming devices, methods, and systems
US10062379B2 (en) 2014-06-11 2018-08-28 Honeywell International Inc. Adaptive beam forming devices, methods, and systems
GB2529509A (en) * 2014-06-11 2016-02-24 Honeywell Int Inc Adaptive beam forming devices, methods, and systems
US10410650B2 (en) 2015-05-20 2019-09-10 Huawei Technologies Co., Ltd. Method for locating sound emitting position and terminal device
EP3264266A4 (en) * 2015-05-20 2018-03-28 Huawei Technologies Co. Ltd. Method for positioning sounding location, and terminal device
JP7383761B2 (en) 2021-06-03 2023-11-20 阿波▲羅▼智▲聯▼(北京)科技有限公司 Audio processing method, device, electronic device, storage medium and computer program for vehicles

Also Published As

Publication number Publication date
CN1815556A (en) 2006-08-09

Similar Documents

Publication Publication Date Title
WO2006082764A1 (en) Method and system for controlling a vehicle using voice commands
EP1923866B1 (en) Sound source separating device, speech recognizing device, portable telephone, sound source separating method, and program
EP1703471A1 (en) Automatic recognition of vehicle operation noises
US6889189B2 (en) Speech recognizer performance in car and home applications utilizing novel multiple microphone configurations
US7206255B2 (en) Signal processing apparatus and signal processing method
US8996383B2 (en) Motor-vehicle voice-control system and microphone-selecting method therefor
EP3301948A1 (en) System and method for localization and acoustic voice interface
JP5543023B2 (en) Object sound enhancement device and car navigation system
US20050159945A1 (en) Noise cancellation system, speech recognition system, and car navigation system
WO2016103709A1 (en) Voice processing device
CN110691299A (en) Audio processing system, method, apparatus, device and storage medium
EP3163573A1 (en) Speech collector in car cabin
JP2001075594A (en) Voice recognition system
CN112397065A (en) Voice interaction method and device, computer readable storage medium and electronic equipment
WO2016103710A1 (en) Voice processing device
WO2015086895A1 (en) Spatial audio processing apparatus
EP1493993A1 (en) Method and device for controlling a speech dialog system
US20040128127A1 (en) Method for processing speech using absolute loudness
CN111489750A (en) Sound processing apparatus and sound processing method
KR101704510B1 (en) Method for enhancement of speech of interest, an apparatus for enhancement of speech of interest and a vehicle equipped with the apparatus
JP2005227512A (en) Sound signal processing method and its apparatus, voice recognition device, and program
JP2005354223A (en) Sound source information processing apparatus, sound source information processing method, and sound source information processing program
CN109243457A (en) Voice-based control method, device, equipment and storage medium
JP2965301B2 (en) Sound collector
JP2005157086A (en) Speech recognition device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 06701510

Country of ref document: EP

Kind code of ref document: A1

WWW Wipo information: withdrawn in national office

Ref document number: 6701510

Country of ref document: EP