US20070244705A1 - Electronic instrument - Google Patents

Electronic instrument Download PDF

Info

Publication number
US20070244705A1
US20070244705A1 US11/785,241 US78524107A US2007244705A1 US 20070244705 A1 US20070244705 A1 US 20070244705A1 US 78524107 A US78524107 A US 78524107A US 2007244705 A1 US2007244705 A1 US 2007244705A1
Authority
US
United States
Prior art keywords
command information
control
unit
speech recognition
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US11/785,241
Other versions
US7853448B2 (en
Inventor
Shusuke Narita
Susumu Tokoshima
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Funai Electric Co Ltd
Original Assignee
Funai Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Funai Electric Co Ltd filed Critical Funai Electric Co Ltd
Assigned to FUNAI ELECTRIC CO., LTD. reassignment FUNAI ELECTRIC CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: NARITA, SHUSUKE, TOKOSHIMA, SUSUMU
Publication of US20070244705A1 publication Critical patent/US20070244705A1/en
Application granted granted Critical
Publication of US7853448B2 publication Critical patent/US7853448B2/en
Expired - Fee Related legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Definitions

  • the present invention relates to an electronic instrument capable of performing various controls by speech recognition.
  • the electronic instrument includes a digital broadcast receiving apparatus that searches a broadcast program based on a keyword, for example, inputted by voice by a user and displays a result of the search based on the keyword (refer to JP-Tokukai-2002-290859A as Patent Document 1).
  • a program reservation system in which a user can select a desired category from among categories for guiding the programs, for example, such as “Weather Forecast”, “News”, and “Baseball”, which are displayed as a list on TV display unit, and further can select a name of a desired program by voice from among programs displayed as a list after confirming programs names, broadcasting times, broadcast channels, and broadcast contents relating to programs which coincide with the category selected by voice (refer to JP-Tokukai-H09-322086A as Patent Document 2).
  • Patent Documents 1 to 3 have a function to display results of the speech recognition, the function merely makes it easy to grasp the results of the speech recognition by means of visual recognition, and cannot prevent in advance a malfunction caused by erroneous recognition of the speech recognition.
  • An object of the present invention to provide an electronic instrument capable of preventing in advance the malfunction caused by the erroneous recognition of the speech recognition.
  • the electronic instrument in an electronic instrument which performs speech recognition by an acoustic model pattern for speech information inputted from an outside, and which is controlled by command information obtained based on a result of the speech recognition, the electronic instrument comprises:
  • a display control unit for displaying a control content corresponding to the command information based on the result of the speech recognition, on a display unit;
  • a control unit for performing the control based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time, and for canceling the control based on the command information based on the result of the speech recognition when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time;
  • a shortening unit for shortening the standby time when a ratio of the number of executions in which one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a first set value
  • a command information storage unit for storing the one kind of command information and a plurality of the acoustic model patterns so as to refer the one kind of command information to the plurality of the acoustic model patterns;
  • a command information deletion unit for deleting one acoustic model pattern relating to the one kind of command information from the command information storage unit when a ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a second set value.
  • the control content corresponding to the command information based on the result of the speech recognition by the acoustic model pattern can be displayed on the display unit by the display control unit.
  • the control for the control content displayed on the display unit by the display control unit can be instructed to be cancelled by the instruction unit.
  • the control can be performed by the control unit based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time.
  • the control based on the command information based on the result of the speech recognition can be cancelled by the control unit when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time.
  • the command information storage unit the one kind of command information and the plurality of acoustic model patterns can be stored so as to refer the one kind of command information to the plurality of acoustic model patterns. Furthermore, the standby time can be shortened by the shortening unit when the ratio of the number of executions in which the one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the first set value.
  • the one acoustic model pattern relating to the one kind of command information can be deleted from the command information storage unit by the command information deletion unit when the ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the second set value.
  • the one acoustic model pattern relating to the command information in which a possibility of the erroneous recognition is high it is possible to delete the one acoustic model pattern from the command information storage unit. In such a way, it is possible to prevent in advance the malfunction of the electronic instrument, which is caused by the erroneous recognition, and it is possible to customize the command information storage unit.
  • the electronic instrument in an electronic instrument which performs speech recognition by an acoustic model pattern for speech information inputted from an outside, and which is controlled by command information obtained based on a result of the speech recognition, the electronic instrument comprises:
  • a display control unit for displaying a control content corresponding to the command information based on the result of the speech recognition, on a display unit;
  • a control unit for performing the control based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time, and for canceling the control based on the command information based on the result of the speech recognition when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time.
  • the control content corresponding to the command information based on the result of the speech recognition by the acoustic model pattern can be displayed on the display unit by the display control unit.
  • the control for the control content displayed on the display unit by the display control unit can be instructed to be cancelled by the instruction unit.
  • the control can be performed by the control unit based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time.
  • the control based on the command information based on the result of the speech recognition can be cancelled by the control unit when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time.
  • the electronic instrument further comprises:
  • a shortening unit for shortening the standby time when a ratio of the number of executions in which one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a first set value.
  • the standby time can be shortened by the shortening unit when the ratio of the number of executions in which the one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays that the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the first set value.
  • the electronic instrument further comprises:
  • a command information storage unit for storing one kind of command information and a plurality of the acoustic model patterns so as to refer the one kind of command information to the plurality of the acoustic model patterns
  • a command information deletion unit for deleting one acoustic model pattern relating to the one kind of command information from the command information storage unit when a ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a second set value.
  • the command information storage unit the one kind of command information and the plurality of acoustic model patterns can be stored so as to refer the one kind of command information to the plurality of acoustic model patterns.
  • the one acoustic model pattern relating to the one kind of command information can be deleted from the command information storage unit by the command information deletion unit when the ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the second set value.
  • the one acoustic model pattern relating to the command information in which a possibility of the erroneous recognition is high it is possible to delete the one acoustic model pattern from the command information storage unit. In such a way, it is possible to prevent in advance the malfunction of the electronic instrument, which is caused by the erroneous recognition, and it is possible to customize the command information storage unit.
  • FIG. 1 is a block diagram showing a configuration of main portions of a digital television set according to the present invention
  • FIG. 2 is a view showing an example of a command table in the present invention
  • FIG. 3 is a view showing a display example by display control unit of the present invention.
  • FIG. 4 is a flowchart showing operation processing of an instrument body portion in the present invention.
  • FIG. 5 is a flowchart showing command table update processing in the present invention.
  • the description will be made of the embodiment by taking a digital television set as an example of an electronic instrument.
  • the electronic instrument is not limited to the digital television set, and for example, may be electrical appliances such as a videotape recorder and an air conditioner.
  • a digital television set 100 of this embodiment comprises an instrument body portion 1 that receives television broadcast signals (hereinafter, referred to as broadcast signals), converts the received broadcast signals into predetermined output signals, and outputs video/audio data, and a microphone-equipped remote control device (hereinafter, referred to as a microphone-equipped remote controller) 2 as an instruction unit.
  • broadcast signals television broadcast signals
  • microphone-equipped remote control device hereinafter, referred to as a microphone-equipped remote controller 2 as an instruction unit.
  • the instrument body portion 1 comprises an antenna 3 that receives the broadcast signals, a tuner 4 that selects a broadcast signal of a predetermined broadcast channel from the broadcast signals received by the antenna 3 , a demodulation unit 5 that performs demodulation and error correction processing for the broadcast signal outputted from the tuner 4 , a descrambler 6 that descrambles a scramble single for preventing illegal watching, which is added to the broadcast signal demodulated by the demodulation unit 5 , a demultiplexer 7 that acquires the video/audio data by separating and extracting respective data from the broadcast signal descrambled by the descrambler 6 , a decoder 8 that performs decompression processing for the video/audio data acquired by the demultiplexer 7 , an on-screen character display (OSD) processing unit 9 as a display control unit for adding OSD data to the video data decompressed by the decoder 8 , a receiver 10 as a display unit for outputting the video/audio data processed by the decoder
  • a microphone is used as the speech information acquisition unit 11 , and the speech information acquisition unit 11 acquires speech directly as the speech information.
  • the speech information acquisition unit 11 functions as an instruction unit by acquiring speech information from a user, that control contents displayed in such a manner that a central processing unit (CPU) 121 executes a display control program 123 c , are cancelled.
  • CPU central processing unit
  • the speech information acquisition unit 11 may be designed not only to acquire the speech directly as the speech information but also to acquire a speech signal as the speech information, which is obtained after converting the speech into the speech signal.
  • control unit 12 comprises the CPU 121 , a random access memory (RAM) 122 , a storage unit 123 , and the like.
  • RAM random access memory
  • the CPU 121 performs a variety of control operations in accordance with a command table and a variety of processing programs, which are stored in the storage unit 123 .
  • the RAM 122 comprises a program storage area for expanding the processing programs and the like, which are executed by the CPU 121 , a data storage area that stores input data, processing results generated when the above-described processing programs are executed, and the like.
  • the storage unit 123 stores a system program executable by the instrument body portion 1 , the variety of processing programs executable by the system program, date for use in the case of executing these variety of processing programs, data of results of arithmetic processing executed by the CPU 121 , and the like. Note that the programs are stored in the storage unit 123 in a form of computer-readable program codes.
  • a command table 123 a a command table 123 a , a speech recognition program 123 b , a display control program 123 c , a control program 123 d , a shortening program 123 e , a command information deletion program 123 f , and the like, are stored.
  • the command table 123 a is a table that stores command information including speech information for controlling the instrument body portion 1 .
  • the command table 123 a stores one kind of the command information and a plurality of acoustic model patterns so as to refer one kind of the command information to a plurality of acoustic model patterns. Moreover, the command table 123 a stores a plurality of acoustic model patterns relating to each command information, the number of executions in which the command information corresponding to the acoustic model pattern is executed, and the number of cancellations in which the corresponding command information is cancelled so as to refer them to one another.
  • the command table 123 a stores each command information and the command to be displayed, which indicates the control contents of the command information, so as to refer them to each other.
  • the storage unit 123 stores the command table 123 a , and functions as a command information storage unit.
  • the speech recognition program 123 b is a program that allows the CPU 121 to realize a function to perform speech recognition processing for the speech information acquired by the speech information acquisition unit 11 .
  • the speech recognition is realized, for example, by using speech recognition software named Julius, and by accumulating statistical data of enormous speech patterns and language patterns.
  • a basic principle of the speech recognition is as follows. Specifically, frequency patterns of acoustic models (phonemes, each of which substantially corresponds to one alphabetical character and syllables, each of which corresponds to a kana character of Japanese) are held. Then, while a word dictionary is referred, each of the model patterns is matched with a signal of a syllable row obtained by cutting out a monosyllabic portion of a speech wave signal of the input speech. In such a way, the speech recognition is performed.
  • the CPU 121 performs the speech recognition for the speech information acquired by the speech information acquisition unit 11 by using one acoustic model pattern of the plurality of preset acoustic model patterns, and extracts command information that coincides with a result of the speech recognition from the command table 123 a . Moreover, when the command information that coincides with the result is not present in the command table 123 a as a result of the speech recognition by the one acoustic model pattern, the CPU 121 changes the acoustic model pattern, performs the speech recognition one more time, and extracts command information that coincides with the result of the speech recognition from the command table 123 a.
  • the display control program 123 c is a program that allows the CPU 121 to realize a function to display the control content corresponding to the command information based on the result of the speech recognition, on the receiver 10 .
  • the CPU 121 extracts a command to be displayed from the command table 123 a , which corresponds to the one command information obtained based on the result of the speech recognition, for example, as shown in FIG. 2 . Then, the CPU 121 processes the command to be displayed in the OSD processing unit 9 , and displays the command to be displayed on the receiver 10 as the display unit.
  • the CPU 121 executes the display control program 123 c , and functions as the display control unit.
  • the control program 123 d is a program that allows the CPU 121 to realize a function to control the instrument body portion 1 by using the command information based on the speech recognition result.
  • the CPU 121 performs the control based on the command information based on the result of the speech recognition by executing the control program 123 d after the predetermined standby time elapses. Meanwhile, when by the speech information, the speech information acquisition unit 11 instructs that the control for the control content is cancelled, the CPU 121 cancels the control based on the command information based on the result of the speech recognition by the control program 123 d.
  • the CPU 121 executes the control program 123 d , and functions as a control unit.
  • the shortening program 123 e is a program that allows the CPU 121 to realize a function to shorten the predetermined standby time since the control content corresponding to the command information based on the result of the speech recognition start to be displayed on the receiver 10 by executing the display control program 123 c.
  • the CPU 121 shortens the standby time by executing the shortening program 123 e.
  • the CPU 121 executes the shortening program 123 e , and functions as a shortening unit.
  • the standby time may be set so as to be shortened in the following manner. Specifically, when the ratio of the number of executions in which the one kind of command information based on the result of the speech recognition is executed with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the receiver 10 by the display control program 123 c is 85% or more, the standby time is set to five seconds. When the ratio is 90% or more, the standby time is shortened to three seconds. When the ratio is 95% or more, the standby time is shortened to zero second.
  • the command information deletion program 123 f is a program that allows the CPU 121 to realize a function to delete one acoustic model pattern relating to the one kind of command information stored in the command table 123 a.
  • the CPU 121 deletes the one acoustic model pattern relating to the one kind of command information from the command table 123 a by executing the command information deletion program 123 e.
  • the CPU 121 executes the command information deletion program 123 f , and functions as a command information deletion unit.
  • Step S 1 when, in Step S 1 , the instrument body portion 1 is in a standby state for acquiring the speech information, and the CPU 121 determines that the speech information is acquired by the speech information acquisition unit 11 (Step S 1 : Yes), the CPU 121 proceeds to Step S 2 . Meanwhile, when the CPU 121 determines that the speech information is not acquired (Step S 1 : No), the instrument body portion 1 is in the speech information acquisition standby state again.
  • Step S 2 the CPU 121 performs the speech recognition for the acquired speech information by executing the speech recognition program 123 b.
  • the CPU 121 performs the speech information by using an acoustic model pattern “1”, and extracts the command information “ON” that coincides with the result of the speech recognition, from the command table 123 a.
  • Step S 3 the CPU 121 extracts a command to be displayed, which corresponds to the command information based on the result of the speech recognition, from the command table 123 a by executing the display control program 123 b , and displays the command to be displayed on the receiver 10 by processing the command to be displayed in the OSD processing unit 9 .
  • the CPU 121 extracts a command to be displayed “Result of speech recognition: Are you sure the speech is ‘ON’?”, which corresponds the command information, and as shown in FIG. 3 , displays the command to be displayed “Result of speech recognition: Are you sure the speech is ‘ON’?” on the receiver 10 .
  • Step S 4 by executing the control program 123 c , the CPU 121 determines whether or not the predetermined standby time since the control content corresponding to the command information based on the result of the speech recognition start to be displayed on the receiver 10 has elapsed. Then, in the case that it is determined that the predetermined standby time has elapsed (Step S 4 : Yes), the CPU 121 proceeds to Step S 5 , and executes the control for the instrument body portion 1 in accordance with the command information obtained based the result of the speech recognition. Then, the CPU 121 proceeds to Step S 9 .
  • Step S 4 in the case that it is determined that the predetermined standby time has not elapsed (Step S 4 : No), the CPU 121 proceeds to Step S 6 .
  • the instrument body portion 1 enters the standby state for acquiring the speech information from the user.
  • Step S 6 in the case that it is determined that the speech information is acquired by the speech information acquisition unit 11 (Step S 6 : Yes), the CPU 121 proceeds to Step S 7 . Then, the CPU 121 performs the speech recognition for the speech information acquired from the outside by executing the speech recognition program 123 b , and determines whether or not the obtained command information is a cancellation command. Meanwhile, in Step S 6 , in the case that it is determined that the speech information is not acquired (Step S 6 : No), the CPU 121 returns to Step S 4 .
  • the user when the user desires to cancel the control corresponding to the display “Result of speech recognition: Are you sure the speech is ‘ON’?”, for example, as shown in FIG. 3 , the user inputs speech information “Cancel” to the speech information acquisition unit 11 .
  • the CPU 121 performs the speech recognition for the speech information “Cancel”.
  • Step S 7 in the case that it is determined that there is command information corresponding to the cancellation command as a result of the speech recognition (Step S 7 : Yes), the CPU 121 proceeds to Step S 8 . Meanwhile, in Step S 7 , in the case that it is determined that there is no command information corresponding to the cancellation command as the result of the speech recognition (Step S 7 : No), the CPU 121 returns to Step S 4 .
  • Step S 8 by executing the control program 123 c , the CPU 121 cancels the control in accordance with the command information obtained based on the result of the speech recognition. Then, the CPU 121 proceeds to Step S 9 .
  • Step S 9 the CPU 121 executes the shortening program 123 d and the command information deletion program 123 e , and performs command table update processing which will be described later. In such a way, the CPU 121 finishes the processing.
  • Step S 11 when the control based on the command information obtained based on the result of the speech recognition is executed by executing the control program 123 d (Step S 11 : Yes), the CPU 121 proceeds to Step S 12 . Then, the CPU 121 adds “1” to the number of control executions, and proceeds to Step S 13 .
  • the CPU 121 adds “1” to the number of control executions.
  • the number of executions becomes “13” from “12”.
  • Step S 11 when the control based on the command information obtained based on the result of the speech recognition is cancelled by executing the control program 123 d (Step S 11 : No), the CPU 121 proceeds to Step S 15 . Then, the CPU 121 adds “1” to the number of control cancellations, and proceeds to Step S 16 .
  • the CPU 121 adds “1” to the number of control cancellations.
  • the number of cancellations becomes “20” from “19”.
  • Step S 13 in the case that it is determined that the ratio of the number of executions exceeds the first set value (Step S 13 : Yes), the CPU 121 proceeds to Step S 14 .
  • the CPU 121 shortens the predetermined standby time by executing the shortening program. Meanwhile, in Step S 13 , in the case that it is determined that the ratio does not exceed the first set value (Step S 13 : No), the CPU 121 finishes the processing.
  • the standby time is shortened, for example, to two seconds from five seconds.
  • Step S 16 in the case that it is determined that the ratio of the number of cancellations exceeds the second set value (Step S 16 : Yes), the CPU 121 proceeds to Step S 17 .
  • the CPU 121 deletes the one acoustic model pattern relating to the command information from the command table 123 a by executing the command information deletion program 123 e .
  • Step S 16 in the case that it is determined that the ratio does not exceed the second set value (Step S 16 : No), the CPU 121 finishes the processing.
  • the CPU 121 deletes the acoustic model pattern 2 relating to the command information from the command table 123 a.
  • the control content corresponding to the command information based on the result of the speech recognition by the acoustic model pattern can be displayed on the receiver 10 .
  • the speech information acquisition unit 11 the speech information that the control is cancelled, is acquired. As a result, it is possible to instruct that the control for the control content displayed on the receiver 10 by executing the display control program 123 b is cancelled.
  • control can be performed by executing the control program 123 c using the CPU 121 based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the receiver 10 by executing the display control program 123 c , in case that there is no instruction that the control for the control content obtained when the speech information acquisition unit 11 acquires the speech information that the control is cancelled, is cancelled within the predetermined standby time. Meanwhile, in case that there is instruction that the control for the control content obtained when the speech information acquisition unit 11 acquires the speech information that the control is cancelled, is cancelled, it is possible to cancel the control based on the command information based on the result of the speech recognition.
  • the command table 123 a the one kind of command information and the plurality of acoustic model patterns can be stored so as to refer the one kind of command information to the plurality of acoustic model patterns. Furthermore, the standby time can be shortened by executing the shortening program 123 e when the ratio of the number of executions in which the one kind of command information based on the result of the speech recognition is executed with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the receiver 10 by executing the display control program 123 c , exceeds the first set value.
  • the one acoustic model pattern relating to the one kind of command information can be deleted from the command table 123 a by executing the command information deletion program 123 e using the CPU 121 when the ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the receiver 10 by executing the display control program 123 c , exceeds the second set value.
  • the one acoustic model pattern relating to the command information in which a possibility of the erroneous recognition is high it is possible to delete the one acoustic model pattern from the command table 123 a .
  • the instruction unit is not limited to the case of inputting the speech information that the control is cancelled, from the outside.

Abstract

An electronic instrument includes: a display control unit for displaying a control content corresponding to the command information based on the result of the speech recognition; an instruction unit for instructing that a control for the control content displayed by the display control unit, is cancelled; a control unit for performing the control based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time, and for canceling the control based on the command information based on the result of the speech recognition when the instruction unit instructs that the control is cancelled within the predetermined standby time.

Description

    BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The present invention relates to an electronic instrument capable of performing various controls by speech recognition.
  • 2. Description of the Related Art
  • In recent years, as an electronic instrument such as a television and a radio has been more multifunctional and more functionally sophisticated, such an electronic instrument that is ready for various functions and is easily operable has been required. At present, an electronic instrument using a speech recognition technology for such an operation is used.
  • The electronic instrument includes a digital broadcast receiving apparatus that searches a broadcast program based on a keyword, for example, inputted by voice by a user and displays a result of the search based on the keyword (refer to JP-Tokukai-2002-290859A as Patent Document 1).
  • Moreover, there is a program reservation system, in which a user can select a desired category from among categories for guiding the programs, for example, such as “Weather Forecast”, “News”, and “Baseball”, which are displayed as a list on TV display unit, and further can select a name of a desired program by voice from among programs displayed as a list after confirming programs names, broadcasting times, broadcast channels, and broadcast contents relating to programs which coincide with the category selected by voice (refer to JP-Tokukai-H09-322086A as Patent Document 2).
  • Furthermore, there is a speech recognition apparatus in which when the electronic instrument is controlled based on a speech signal inputted from a microphone, it is made easy to recognize a mode of the control by dynamically displaying a character and the like on a monitor (refer to JP-Tokukai-2000-322086A as Patent Document 3).
  • However, although the apparatuses disclosed in Patent Documents 1 to 3 have a function to display results of the speech recognition, the function merely makes it easy to grasp the results of the speech recognition by means of visual recognition, and cannot prevent in advance a malfunction caused by erroneous recognition of the speech recognition.
  • SUMMARY OF THE INVENTION
  • An object of the present invention to provide an electronic instrument capable of preventing in advance the malfunction caused by the erroneous recognition of the speech recognition.
  • In accordance with a first aspect of the invention, in an electronic instrument which performs speech recognition by an acoustic model pattern for speech information inputted from an outside, and which is controlled by command information obtained based on a result of the speech recognition, the electronic instrument comprises:
  • a display control unit for displaying a control content corresponding to the command information based on the result of the speech recognition, on a display unit;
  • an instruction unit for instructing that a control for the control content displayed on the display unit by the display control unit, is cancelled;
  • a control unit for performing the control based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time, and for canceling the control based on the command information based on the result of the speech recognition when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time;
  • a shortening unit for shortening the standby time when a ratio of the number of executions in which one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a first set value;
  • a command information storage unit for storing the one kind of command information and a plurality of the acoustic model patterns so as to refer the one kind of command information to the plurality of the acoustic model patterns; and
  • a command information deletion unit for deleting one acoustic model pattern relating to the one kind of command information from the command information storage unit when a ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a second set value.
  • According to the first aspect of the invention, the control content corresponding to the command information based on the result of the speech recognition by the acoustic model pattern can be displayed on the display unit by the display control unit. Moreover, the control for the control content displayed on the display unit by the display control unit can be instructed to be cancelled by the instruction unit. Furthermore, the control can be performed by the control unit based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time. Meanwhile, the control based on the command information based on the result of the speech recognition can be cancelled by the control unit when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time.
  • Hence, it is possible to control the electronic instrument after the user confirms whether or not the control content corresponding to the command information obtained based on the result of the speech recognition is one intended by the user. In such a way, a malfunction of the electronic instrument, which is caused by erroneous recognition, can be prevented in advance.
  • Moreover, by the command information storage unit, the one kind of command information and the plurality of acoustic model patterns can be stored so as to refer the one kind of command information to the plurality of acoustic model patterns. Furthermore, the standby time can be shortened by the shortening unit when the ratio of the number of executions in which the one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the first set value.
  • Hence, with regard to the command information in which a success rate of the speech recognition is high, it is possible to shorten the standby time since the control content corresponding to the command information based on the result of the speech recognition start to be displayed on the display unit. Thus, it is possible to rapidly perform the control corresponding to the command information obtained based on the result of the speech recognition.
  • Furthermore, the one acoustic model pattern relating to the one kind of command information can be deleted from the command information storage unit by the command information deletion unit when the ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the second set value.
  • Hence, with regard to the one acoustic model pattern relating to the command information in which a possibility of the erroneous recognition is high, it is possible to delete the one acoustic model pattern from the command information storage unit. In such a way, it is possible to prevent in advance the malfunction of the electronic instrument, which is caused by the erroneous recognition, and it is possible to customize the command information storage unit.
  • In accordance with a second aspect of the invention, in an electronic instrument which performs speech recognition by an acoustic model pattern for speech information inputted from an outside, and which is controlled by command information obtained based on a result of the speech recognition, the electronic instrument comprises:
  • a display control unit for displaying a control content corresponding to the command information based on the result of the speech recognition, on a display unit;
  • an instruction unit for instructing that a control for the control content displayed on the display unit by the display control unit, is cancelled;
  • a control unit for performing the control based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time, and for canceling the control based on the command information based on the result of the speech recognition when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time.
  • According to the second aspect of the invention, the control content corresponding to the command information based on the result of the speech recognition by the acoustic model pattern can be displayed on the display unit by the display control unit. Moreover, the control for the control content displayed on the display unit by the display control unit can be instructed to be cancelled by the instruction unit. Furthermore, the control can be performed by the control unit based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time. Meanwhile, the control based on the command information based on the result of the speech recognition can be cancelled by the control unit when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time.
  • Hence, it is possible to control the electronic instrument after the user confirms whether or not the control content corresponding to the command information obtained based on the result of the speech recognition is one intended by the user. In such away, a malfunction of the electronic instrument, which is caused by erroneous recognition, can be prevented in advance.
  • Preferably, the electronic instrument further comprises:
  • a shortening unit for shortening the standby time when a ratio of the number of executions in which one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a first set value.
  • According to the above invention, the similar effect to that of the second aspect of the invention can be obtained. Further, the standby time can be shortened by the shortening unit when the ratio of the number of executions in which the one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays that the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the first set value.
  • Hence, with regard to the command information in which a success rate of the speech recognition is high, it is possible to shorten the standby time since the control content corresponding to the command information based on the result of the speech recognition start to be displayed on the display unit. Thus, it is possible to rapidly perform the control corresponding to the command information obtained based on the result of the speech recognition.
  • Preferably, the electronic instrument further comprises:
  • a command information storage unit for storing one kind of command information and a plurality of the acoustic model patterns so as to refer the one kind of command information to the plurality of the acoustic model patterns; and
  • a command information deletion unit for deleting one acoustic model pattern relating to the one kind of command information from the command information storage unit when a ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a second set value.
  • According to the above invention, the similar effect to that of the second aspect of the invention can be obtained. Further, by the command information storage unit, the one kind of command information and the plurality of acoustic model patterns can be stored so as to refer the one kind of command information to the plurality of acoustic model patterns. Moreover, the one acoustic model pattern relating to the one kind of command information can be deleted from the command information storage unit by the command information deletion unit when the ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds the second set value.
  • Hence, with regard to the one acoustic model pattern relating to the command information in which a possibility of the erroneous recognition is high, it is possible to delete the one acoustic model pattern from the command information storage unit. In such a way, it is possible to prevent in advance the malfunction of the electronic instrument, which is caused by the erroneous recognition, and it is possible to customize the command information storage unit.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The present invention will become fully understood from the detailed description given hereinafter and the accompanying drawings given by way of illustration only, and thus are not intended as a definition of the limits of the present invention, wherein:
  • FIG. 1 is a block diagram showing a configuration of main portions of a digital television set according to the present invention;
  • FIG. 2 is a view showing an example of a command table in the present invention;
  • FIG. 3 is a view showing a display example by display control unit of the present invention;
  • FIG. 4 is a flowchart showing operation processing of an instrument body portion in the present invention; and
  • FIG. 5 is a flowchart showing command table update processing in the present invention.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
  • A description will be made below in detail of an embodiment of the present invention while referring to the drawings.
  • Note that the description will be made of the embodiment by taking a digital television set as an example of an electronic instrument. However, the electronic instrument is not limited to the digital television set, and for example, may be electrical appliances such as a videotape recorder and an air conditioner.
  • First, a description will be made of a configuration of the entire instrument and a configuration of main portions thereof while referring to FIG. 1.
  • For example, a digital television set 100 of this embodiment comprises an instrument body portion 1 that receives television broadcast signals (hereinafter, referred to as broadcast signals), converts the received broadcast signals into predetermined output signals, and outputs video/audio data, and a microphone-equipped remote control device (hereinafter, referred to as a microphone-equipped remote controller) 2 as an instruction unit.
  • Next, the description will be made of the configuration of the main portions of the digital television set 100.
  • The instrument body portion 1 comprises an antenna 3 that receives the broadcast signals, a tuner 4 that selects a broadcast signal of a predetermined broadcast channel from the broadcast signals received by the antenna 3, a demodulation unit 5 that performs demodulation and error correction processing for the broadcast signal outputted from the tuner 4, a descrambler 6 that descrambles a scramble single for preventing illegal watching, which is added to the broadcast signal demodulated by the demodulation unit 5, a demultiplexer 7 that acquires the video/audio data by separating and extracting respective data from the broadcast signal descrambled by the descrambler 6, a decoder 8 that performs decompression processing for the video/audio data acquired by the demultiplexer 7, an on-screen character display (OSD) processing unit 9 as a display control unit for adding OSD data to the video data decompressed by the decoder 8, a receiver 10 as a display unit for outputting the video/audio data processed by the decoder 8 and the OSD processing unit 9, a speech information acquisition unit 11 that acquires speech information, a control unit 12 that controls the entirety of the instrument body portion 1 in a centralized manner, and a control bus 13 that interconnects the respective units.
  • For example, a microphone is used as the speech information acquisition unit 11, and the speech information acquisition unit 11 acquires speech directly as the speech information.
  • The speech information acquisition unit 11 functions as an instruction unit by acquiring speech information from a user, that control contents displayed in such a manner that a central processing unit (CPU) 121 executes a display control program 123 c, are cancelled.
  • The speech information acquisition unit 11 may be designed not only to acquire the speech directly as the speech information but also to acquire a speech signal as the speech information, which is obtained after converting the speech into the speech signal.
  • For example, as shown in FIG. 1, the control unit 12 comprises the CPU 121, a random access memory (RAM) 122, a storage unit 123, and the like.
  • The CPU 121 performs a variety of control operations in accordance with a command table and a variety of processing programs, which are stored in the storage unit 123.
  • The RAM 122 comprises a program storage area for expanding the processing programs and the like, which are executed by the CPU 121, a data storage area that stores input data, processing results generated when the above-described processing programs are executed, and the like.
  • The storage unit 123 stores a system program executable by the instrument body portion 1, the variety of processing programs executable by the system program, date for use in the case of executing these variety of processing programs, data of results of arithmetic processing executed by the CPU 121, and the like. Note that the programs are stored in the storage unit 123 in a form of computer-readable program codes.
  • Specifically, for example, as shown in FIG. 1, in the storage unit 123, a command table 123 a, a speech recognition program 123 b, a display control program 123 c, a control program 123 d, a shortening program 123 e, a command information deletion program 123 f, and the like, are stored.
  • The command table 123 a is a table that stores command information including speech information for controlling the instrument body portion 1.
  • Specifically, for example, as shown in FIG. 2, the command table 123 a stores one kind of the command information and a plurality of acoustic model patterns so as to refer one kind of the command information to a plurality of acoustic model patterns. Moreover, the command table 123 a stores a plurality of acoustic model patterns relating to each command information, the number of executions in which the command information corresponding to the acoustic model pattern is executed, and the number of cancellations in which the corresponding command information is cancelled so as to refer them to one another.
  • Furthermore, the command table 123 a stores each command information and the command to be displayed, which indicates the control contents of the command information, so as to refer them to each other.
  • The storage unit 123 stores the command table 123 a, and functions as a command information storage unit.
  • The speech recognition program 123 b is a program that allows the CPU 121 to realize a function to perform speech recognition processing for the speech information acquired by the speech information acquisition unit 11.
  • Here, the speech recognition is realized, for example, by using speech recognition software named Julius, and by accumulating statistical data of enormous speech patterns and language patterns. A basic principle of the speech recognition is as follows. Specifically, frequency patterns of acoustic models (phonemes, each of which substantially corresponds to one alphabetical character and syllables, each of which corresponds to a kana character of Japanese) are held. Then, while a word dictionary is referred, each of the model patterns is matched with a signal of a syllable row obtained by cutting out a monosyllabic portion of a speech wave signal of the input speech. In such a way, the speech recognition is performed.
  • Specifically, the CPU 121 performs the speech recognition for the speech information acquired by the speech information acquisition unit 11 by using one acoustic model pattern of the plurality of preset acoustic model patterns, and extracts command information that coincides with a result of the speech recognition from the command table 123 a. Moreover, when the command information that coincides with the result is not present in the command table 123 a as a result of the speech recognition by the one acoustic model pattern, the CPU 121 changes the acoustic model pattern, performs the speech recognition one more time, and extracts command information that coincides with the result of the speech recognition from the command table 123 a.
  • The display control program 123 c is a program that allows the CPU 121 to realize a function to display the control content corresponding to the command information based on the result of the speech recognition, on the receiver 10.
  • Specifically, as shown in FIG. 3, by executing the display control program 123 c, the CPU 121 extracts a command to be displayed from the command table 123 a, which corresponds to the one command information obtained based on the result of the speech recognition, for example, as shown in FIG. 2. Then, the CPU 121 processes the command to be displayed in the OSD processing unit 9, and displays the command to be displayed on the receiver 10 as the display unit.
  • The CPU 121 executes the display control program 123 c, and functions as the display control unit.
  • The control program 123 d is a program that allows the CPU 121 to realize a function to control the instrument body portion 1 by using the command information based on the speech recognition result.
  • Specifically, for example, when the speech information acquisition unit 11 does not instruct that the control for the control content corresponding to the command information based on the result of the speech recognition is cancelled by the speech information within a predetermined standby time since the control content starts to be displayed on the receiver 10 by executing the display control program 123 c, the CPU 121 performs the control based on the command information based on the result of the speech recognition by executing the control program 123 d after the predetermined standby time elapses. Meanwhile, when by the speech information, the speech information acquisition unit 11 instructs that the control for the control content is cancelled, the CPU 121 cancels the control based on the command information based on the result of the speech recognition by the control program 123 d.
  • The CPU 121 executes the control program 123 d, and functions as a control unit.
  • The shortening program 123 e is a program that allows the CPU 121 to realize a function to shorten the predetermined standby time since the control content corresponding to the command information based on the result of the speech recognition start to be displayed on the receiver 10 by executing the display control program 123 c.
  • Specifically, when the ratio of the number of executions in which one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which the control content corresponding to the one kind of command information are displayed on the receiver 10 by the display control program 123 c exceeds a first set value, the CPU 121 shortens the standby time by executing the shortening program 123 e.
  • The CPU 121 executes the shortening program 123 e, and functions as a shortening unit.
  • Note that it is possible to provide the first set value of the standby time in stages. For example, the standby time may be set so as to be shortened in the following manner. Specifically, when the ratio of the number of executions in which the one kind of command information based on the result of the speech recognition is executed with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the receiver 10 by the display control program 123 c is 85% or more, the standby time is set to five seconds. When the ratio is 90% or more, the standby time is shortened to three seconds. When the ratio is 95% or more, the standby time is shortened to zero second.
  • The command information deletion program 123 f is a program that allows the CPU 121 to realize a function to delete one acoustic model pattern relating to the one kind of command information stored in the command table 123 a.
  • Specifically, when a ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the receiver 10 by executing the display control program 123 c exceeds a second set value, the CPU 121 deletes the one acoustic model pattern relating to the one kind of command information from the command table 123 a by executing the command information deletion program 123 e.
  • The CPU 121 executes the command information deletion program 123 f, and functions as a command information deletion unit.
  • Next, a description will be made of an operation of the instrument body portion 1 of the present invention by using FIG. 4.
  • First, when, in Step S1, the instrument body portion 1 is in a standby state for acquiring the speech information, and the CPU 121 determines that the speech information is acquired by the speech information acquisition unit 11 (Step S1: Yes), the CPU 121 proceeds to Step S2. Meanwhile, when the CPU 121 determines that the speech information is not acquired (Step S1: No), the instrument body portion 1 is in the speech information acquisition standby state again.
  • Subsequently, in Step S2, the CPU 121 performs the speech recognition for the acquired speech information by executing the speech recognition program 123 b.
  • Specifically, for example, when the speech information “ON” is acquired by the speech information acquisition unit 11, the CPU 121 performs the speech information by using an acoustic model pattern “1”, and extracts the command information “ON” that coincides with the result of the speech recognition, from the command table 123 a.
  • Subsequently, in Step S3, the CPU 121 extracts a command to be displayed, which corresponds to the command information based on the result of the speech recognition, from the command table 123 a by executing the display control program 123 b, and displays the command to be displayed on the receiver 10 by processing the command to be displayed in the OSD processing unit 9.
  • Specifically, for example, when the command information “ON” is extracted as the result of the speech recognition, the CPU 121 extracts a command to be displayed “Result of speech recognition: Are you sure the speech is ‘ON’?”, which corresponds the command information, and as shown in FIG. 3, displays the command to be displayed “Result of speech recognition: Are you sure the speech is ‘ON’?” on the receiver 10.
  • Subsequently, in Step S4, by executing the control program 123 c, the CPU 121 determines whether or not the predetermined standby time since the control content corresponding to the command information based on the result of the speech recognition start to be displayed on the receiver 10 has elapsed. Then, in the case that it is determined that the predetermined standby time has elapsed (Step S4: Yes), the CPU 121 proceeds to Step S5, and executes the control for the instrument body portion 1 in accordance with the command information obtained based the result of the speech recognition. Then, the CPU 121 proceeds to Step S9.
  • Meanwhile, in Step S4, in the case that it is determined that the predetermined standby time has not elapsed (Step S4: No), the CPU 121 proceeds to Step S6. The instrument body portion 1 enters the standby state for acquiring the speech information from the user.
  • Subsequently, in Step S6, in the case that it is determined that the speech information is acquired by the speech information acquisition unit 11 (Step S6: Yes), the CPU 121 proceeds to Step S7. Then, the CPU 121 performs the speech recognition for the speech information acquired from the outside by executing the speech recognition program 123 b, and determines whether or not the obtained command information is a cancellation command. Meanwhile, in Step S6, in the case that it is determined that the speech information is not acquired (Step S6: No), the CPU 121 returns to Step S4.
  • Specifically, when the user desires to cancel the control corresponding to the display “Result of speech recognition: Are you sure the speech is ‘ON’?”, for example, as shown in FIG. 3, the user inputs speech information “Cancel” to the speech information acquisition unit 11. The CPU 121 performs the speech recognition for the speech information “Cancel”.
  • Subsequently, in Step S7, in the case that it is determined that there is command information corresponding to the cancellation command as a result of the speech recognition (Step S7: Yes), the CPU 121 proceeds to Step S8. Meanwhile, in Step S7, in the case that it is determined that there is no command information corresponding to the cancellation command as the result of the speech recognition (Step S7: No), the CPU 121 returns to Step S4.
  • Subsequently, in Step S8, by executing the control program 123 c, the CPU 121 cancels the control in accordance with the command information obtained based on the result of the speech recognition. Then, the CPU 121 proceeds to Step S9.
  • Subsequently, in Step S9, the CPU 121 executes the shortening program 123 d and the command information deletion program 123 e, and performs command table update processing which will be described later. In such a way, the CPU 121 finishes the processing.
  • Next, by using FIG. 5, a description will be made of the update processing for the command table 123 a in Step S9 in FIG. 4.
  • First, in Step S11, when the control based on the command information obtained based on the result of the speech recognition is executed by executing the control program 123 d (Step S11: Yes), the CPU 121 proceeds to Step S12. Then, the CPU 121 adds “1” to the number of control executions, and proceeds to Step S13.
  • Specifically, for example, as shown in FIG. 2, when the control based on the command “ON” corresponding to an acoustic model pattern 1 is executed, the CPU 121 adds “1” to the number of control executions. The number of executions becomes “13” from “12”.
  • Meanwhile, in Step S11, when the control based on the command information obtained based on the result of the speech recognition is cancelled by executing the control program 123 d (Step S11: No), the CPU 121 proceeds to Step S15. Then, the CPU 121 adds “1” to the number of control cancellations, and proceeds to Step S16.
  • Specifically, for example, as shown in FIG. 2, when the control based on the command “ON” corresponding to an acoustic model pattern 2 is cancelled, the CPU 121 adds “1” to the number of control cancellations. The number of cancellations becomes “20” from “19”.
  • Subsequently, in Step S13, in the case that it is determined that the ratio of the number of executions exceeds the first set value (Step S13: Yes), the CPU 121 proceeds to Step S14. The CPU 121 shortens the predetermined standby time by executing the shortening program. Meanwhile, in Step S13, in the case that it is determined that the ratio does not exceed the first set value (Step S13: No), the CPU 121 finishes the processing.
  • Specifically, for example, as shown in FIG. 2, when it is determined that the ratio of the number of control executions based on the command “ON” corresponding to the acoustic model pattern 1 exceeds the first set value, the standby time is shortened, for example, to two seconds from five seconds.
  • Subsequently, in Step S16, in the case that it is determined that the ratio of the number of cancellations exceeds the second set value (Step S16: Yes), the CPU 121 proceeds to Step S17. The CPU 121 deletes the one acoustic model pattern relating to the command information from the command table 123 a by executing the command information deletion program 123 e. Meanwhile, in Step S16, in the case that it is determined that the ratio does not exceed the second set value (Step S16: No), the CPU 121 finishes the processing.
  • Specifically, for example, as shown in FIG. 2, when it is determined that the ratio of the number of control cancellations for the command “ON” corresponding to the acoustic model pattern 2 exceeds the second set value, the CPU 121 deletes the acoustic model pattern 2 relating to the command information from the command table 123 a.
  • In accordance with the digital television set 100 according to the present invention, which is described above, when the CPU 121 executes the display control program 123 c, the control content corresponding to the command information based on the result of the speech recognition by the acoustic model pattern can be displayed on the receiver 10. Moreover, by the speech information acquisition unit 11, the speech information that the control is cancelled, is acquired. As a result, it is possible to instruct that the control for the control content displayed on the receiver 10 by executing the display control program 123 b is cancelled. Furthermore, the control can be performed by executing the control program 123 c using the CPU 121 based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the receiver 10 by executing the display control program 123 c, in case that there is no instruction that the control for the control content obtained when the speech information acquisition unit 11 acquires the speech information that the control is cancelled, is cancelled within the predetermined standby time. Meanwhile, in case that there is instruction that the control for the control content obtained when the speech information acquisition unit 11 acquires the speech information that the control is cancelled, is cancelled, it is possible to cancel the control based on the command information based on the result of the speech recognition.
  • Hence, it is possible to control the electronic instrument after the user confirms whether or not the control content corresponding to the command information obtained based on the result of the speech recognition is one intended by the user. In such a way, a malfunction of the electronic instrument, which is caused by erroneous recognition, can be prevented in advance.
  • Moreover, by the command table 123 a, the one kind of command information and the plurality of acoustic model patterns can be stored so as to refer the one kind of command information to the plurality of acoustic model patterns. Furthermore, the standby time can be shortened by executing the shortening program 123 e when the ratio of the number of executions in which the one kind of command information based on the result of the speech recognition is executed with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the receiver 10 by executing the display control program 123 c, exceeds the first set value.
  • Hence, with regard to the command information in which a success rate of the speech recognition is high, it is possible to shorten the standby time since the control content corresponding to the command information based on the result of the speech recognition start to be displayed on the receiver 10. Thus, it is possible to rapidly perform the control corresponding to the command information obtained based on the result of the speech recognition.
  • Furthermore, the one acoustic model pattern relating to the one kind of command information can be deleted from the command table 123 a by executing the command information deletion program 123 e using the CPU 121 when the ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the receiver 10 by executing the display control program 123 c, exceeds the second set value.
  • Hence, with regard to the one acoustic model pattern relating to the command information in which a possibility of the erroneous recognition is high, it is possible to delete the one acoustic model pattern from the command table 123 a. In such a way, it is possible to prevent in advance the malfunction of the instrument body portion 1, which is caused by the erroneous recognition, and it is possible to customize the command table 123 a.
  • Note that the present invention is not limited to the above-described embodiment, and a variety of modifications and design changes may be performed within the scope without departing from the gist of the present invention.
  • For example, the instruction unit is not limited to the case of inputting the speech information that the control is cancelled, from the outside. The instruction unit in which an operation key for cancelling the control is provided on a remote controller or the like, may be adopted. Thereby, the control is cancelled.
  • The entire disclosure of Japanese Patent Application No. 2006-113635 filed on Apr. 17, 2006 is incorporated herein by reference in its entirety.

Claims (4)

1. An electronic instrument which performs speech recognition by an acoustic model pattern for speech information inputted from an outside, and which is controlled by command information obtained based on a result of the speech recognition, the electronic instrument comprising:
a display control unit for displaying a control content corresponding to the command information based on the result of the speech recognition, on a display unit;
an instruction unit for instructing that a control for the control content displayed on the display unit by the display control unit, is cancelled;
a control unit for performing the control based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time, and for canceling the control based on the command information based on the result of the speech recognition when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time;
a shortening unit for shortening the standby time when a ratio of the number of executions in which one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a first set value;
a command information storage unit for storing the one kind of command information and a plurality of the acoustic model patterns so as to refer the one kind of command information to the plurality of the acoustic model patterns; and
a command information deletion unit for deleting one acoustic model pattern relating to the one kind of command information from the command information storage unit when a ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which the control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a second set value.
2. An electronic instrument which performs speech recognition by an acoustic model pattern for speech information inputted from an outside, and which is controlled by command information obtained based on a result of the speech recognition, the electronic instrument comprising:
a display control unit for displaying a control content corresponding to the command information based on the result of the speech recognition, on a display unit;
an instruction unit for instructing that a control for the control content displayed on the display unit by the display control unit, is cancelled;
a control unit for performing the control based on the command information based on the result of the speech recognition after a predetermined standby time elapses since the control content corresponding to the command information based on the result of the speech recognition starts to be displayed on the display unit by the display control unit when the instruction unit does not instruct that the control for the control content is cancelled within the predetermined standby time, and for canceling the control based on the command information based on the result of the speech recognition when the instruction unit instructs that the control for the control content is cancelled within the predetermined standby time.
3. The electronic instrument as claimed in claim 2, further comprising:
a shortening unit for shortening the standby time when a ratio of the number of executions in which one kind of the command information based on the result of the speech recognition is executed with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a first set value.
4. The electronic instrument as claimed in claim 2, further comprising:
a command information storage unit for storing one kind of command information and a plurality of the acoustic model patterns so as to refer the one kind of command information to the plurality of the acoustic model patterns; and
a command information deletion unit for deleting one acoustic model pattern relating to the one kind of command information from the command information storage unit when a ratio of the number of cancellations in which the one kind of command information based on the result of the speech recognition by the one acoustic model pattern is cancelled with respect to the number of displays in which a control content corresponding to the one kind of command information is displayed on the display unit by the display control unit, exceeds a second set value.
US11/785,241 2006-04-17 2007-04-16 Electronic instrument for speech recognition with standby time shortening and acoustic model deletion Expired - Fee Related US7853448B2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2006-113635 2006-04-17
JP2006113635A JP2007286356A (en) 2006-04-17 2006-04-17 Electronic equipment

Publications (2)

Publication Number Publication Date
US20070244705A1 true US20070244705A1 (en) 2007-10-18
US7853448B2 US7853448B2 (en) 2010-12-14

Family

ID=38024530

Family Applications (1)

Application Number Title Priority Date Filing Date
US11/785,241 Expired - Fee Related US7853448B2 (en) 2006-04-17 2007-04-16 Electronic instrument for speech recognition with standby time shortening and acoustic model deletion

Country Status (3)

Country Link
US (1) US7853448B2 (en)
EP (1) EP1847987A1 (en)
JP (1) JP2007286356A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130024187A1 (en) * 2011-07-18 2013-01-24 At&T Intellectual Property I, Lp Method and apparatus for social network communication over a media network
US20180211661A1 (en) * 2017-01-20 2018-07-26 Alpine Electronics, Inc. Speech recognition apparatus with cancellation period
US11423888B2 (en) * 2010-06-07 2022-08-23 Google Llc Predicting and learning carrier phrases for speech input
CN116095377A (en) * 2022-12-30 2023-05-09 无锡威达智能电子股份有限公司 Remote controller control method and device based on voice recognition and electronic equipment

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9082407B1 (en) * 2014-04-15 2015-07-14 Google Inc. Systems and methods for providing prompts for voice commands
CN105895093A (en) * 2015-11-02 2016-08-24 乐视致新电子科技(天津)有限公司 Voice information processing method and device

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6148105A (en) * 1995-11-15 2000-11-14 Hitachi, Ltd. Character recognizing and translating system and voice recognizing and translating system
US6173266B1 (en) * 1997-05-06 2001-01-09 Speechworks International, Inc. System and method for developing interactive speech applications
US6718304B1 (en) * 1999-06-30 2004-04-06 Kabushiki Kaisha Toshiba Speech recognition support method and apparatus
US6766295B1 (en) * 1999-05-10 2004-07-20 Nuance Communications Adaptation of a speech recognition system across multiple remote sessions with a speaker
US20050171775A1 (en) * 2001-12-14 2005-08-04 Sean Doyle Automatically improving a voice recognition system
US6937984B1 (en) * 1998-12-17 2005-08-30 International Business Machines Corporation Speech command input recognition system for interactive computer display with speech controlled display of recognized commands
US20070033053A1 (en) * 2003-10-14 2007-02-08 Daimler Chrysler Ag User-adaptive dialog support for speech dialog systems
US7194409B2 (en) * 2000-11-30 2007-03-20 Bruce Balentine Method and system for preventing error amplification in natural language dialogues
US7228276B2 (en) * 2001-03-30 2007-06-05 Sony Corporation Sound processing registering a word in a dictionary
US20070150287A1 (en) * 2003-08-01 2007-06-28 Thomas Portele Method for driving a dialog system
US7321856B1 (en) * 2005-08-03 2008-01-22 Microsoft Corporation Handling of speech recognition in a declarative markup language
US7349851B2 (en) * 1998-11-12 2008-03-25 Microsoft Corporation Speech recognition user interface
US20080109220A1 (en) * 2006-11-03 2008-05-08 Imre Kiss Input method and device
US7533023B2 (en) * 2003-02-12 2009-05-12 Panasonic Corporation Intermediary speech processor in network environments transforming customized speech parameters
US7552055B2 (en) * 2004-01-10 2009-06-23 Microsoft Corporation Dialog component re-use in recognition systems
US7680658B2 (en) * 2002-12-31 2010-03-16 Samsung Electronics Co., Ltd. Method and apparatus for speech recognition

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0752354B2 (en) * 1985-09-11 1995-06-05 沖電気工業株式会社 Voice recognizer
JPH0832663A (en) * 1994-07-13 1996-02-02 Hitachi Ltd Telephone set with voice dialing function
JPH08115096A (en) * 1994-10-14 1996-05-07 Sanyo Electric Co Ltd Voice processor
JPH09322086A (en) 1996-05-31 1997-12-12 Hitachi Ltd Program reservation system
GB2344917B (en) * 1998-12-16 2003-04-02 Ibm Speech command input recognition system
JP2000322086A (en) 1999-05-06 2000-11-24 Sony Corp Voice recognizing device
JP2002290859A (en) 2001-03-26 2002-10-04 Sanyo Electric Co Ltd Digital broadcast receiver
JP2006058641A (en) * 2004-08-20 2006-03-02 Nissan Motor Co Ltd Speech recognition device

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6148105A (en) * 1995-11-15 2000-11-14 Hitachi, Ltd. Character recognizing and translating system and voice recognizing and translating system
US6173266B1 (en) * 1997-05-06 2001-01-09 Speechworks International, Inc. System and method for developing interactive speech applications
US7349851B2 (en) * 1998-11-12 2008-03-25 Microsoft Corporation Speech recognition user interface
US6937984B1 (en) * 1998-12-17 2005-08-30 International Business Machines Corporation Speech command input recognition system for interactive computer display with speech controlled display of recognized commands
US6766295B1 (en) * 1999-05-10 2004-07-20 Nuance Communications Adaptation of a speech recognition system across multiple remote sessions with a speaker
US6718304B1 (en) * 1999-06-30 2004-04-06 Kabushiki Kaisha Toshiba Speech recognition support method and apparatus
US7194409B2 (en) * 2000-11-30 2007-03-20 Bruce Balentine Method and system for preventing error amplification in natural language dialogues
US7228276B2 (en) * 2001-03-30 2007-06-05 Sony Corporation Sound processing registering a word in a dictionary
US7103542B2 (en) * 2001-12-14 2006-09-05 Ben Franklin Patent Holding Llc Automatically improving a voice recognition system
US20050171775A1 (en) * 2001-12-14 2005-08-04 Sean Doyle Automatically improving a voice recognition system
US7680658B2 (en) * 2002-12-31 2010-03-16 Samsung Electronics Co., Ltd. Method and apparatus for speech recognition
US7533023B2 (en) * 2003-02-12 2009-05-12 Panasonic Corporation Intermediary speech processor in network environments transforming customized speech parameters
US20070150287A1 (en) * 2003-08-01 2007-06-28 Thomas Portele Method for driving a dialog system
US20070033053A1 (en) * 2003-10-14 2007-02-08 Daimler Chrysler Ag User-adaptive dialog support for speech dialog systems
US7552055B2 (en) * 2004-01-10 2009-06-23 Microsoft Corporation Dialog component re-use in recognition systems
US7321856B1 (en) * 2005-08-03 2008-01-22 Microsoft Corporation Handling of speech recognition in a declarative markup language
US20080109220A1 (en) * 2006-11-03 2008-05-08 Imre Kiss Input method and device

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11423888B2 (en) * 2010-06-07 2022-08-23 Google Llc Predicting and learning carrier phrases for speech input
US20130024187A1 (en) * 2011-07-18 2013-01-24 At&T Intellectual Property I, Lp Method and apparatus for social network communication over a media network
US8825493B2 (en) * 2011-07-18 2014-09-02 At&T Intellectual Property I, L.P. Method and apparatus for social network communication over a media network
US9246868B2 (en) 2011-07-18 2016-01-26 At&T Intellectual Property I, Lp Method and apparatus for social network communication over a media network
US9461957B2 (en) 2011-07-18 2016-10-04 At&T Intellectual Property I, L.P. Method and apparatus for social network communication over a media network
US9979690B2 (en) 2011-07-18 2018-05-22 Nuance Communications, Inc. Method and apparatus for social network communication over a media network
US20180211661A1 (en) * 2017-01-20 2018-07-26 Alpine Electronics, Inc. Speech recognition apparatus with cancellation period
CN116095377A (en) * 2022-12-30 2023-05-09 无锡威达智能电子股份有限公司 Remote controller control method and device based on voice recognition and electronic equipment

Also Published As

Publication number Publication date
EP1847987A1 (en) 2007-10-24
US7853448B2 (en) 2010-12-14
JP2007286356A (en) 2007-11-01

Similar Documents

Publication Publication Date Title
JP4887911B2 (en) Electronics
US11024312B2 (en) Apparatus, system, and method for generating voice recognition guide by transmitting voice signal data to a voice recognition server which contains voice recognition guide information to send back to the voice recognition apparatus
US8000972B2 (en) Remote controller with speech recognition
US7853448B2 (en) Electronic instrument for speech recognition with standby time shortening and acoustic model deletion
US8706502B2 (en) Electronic equipment and television receiver utilizing multimodal multifunction voice commands
US10720162B2 (en) Display apparatus capable of releasing a voice input mode by sensing a speech finish and voice control method thereof
US7653548B2 (en) Television receiver
EP3089157B1 (en) Voice recognition processing device, voice recognition processing method, and display device
US6876970B1 (en) Voice-activated tuning of broadcast channels
US10832669B2 (en) Electronic device and method for updating channel map thereof
JP2008003474A (en) Electronic apparatus
KR100672518B1 (en) Television with voice recognition
US20190261048A1 (en) Set-Top Box with Enhanced Functionality and System and Method for Use of Same
JP2007286174A (en) Electronic apparatus
US20090300682A1 (en) Control device and method for channel searching in image display device
JP4561774B2 (en) Display device
JP2005004032A (en) Device and method for speech recognition
KR20060106057A (en) Display apparatus and channel search method
JP2001309257A (en) Receiver of digital tv broadcasting
KR20230146497A (en) Display apparatus for performing a voice control and method thereof
JP2008103905A (en) Digital television receiver
WO2013102954A1 (en) Broadcast receiving device and voice dictionary construction processing method
KR20100005577U (en) Device for informing schedule of digital television

Legal Events

Date Code Title Description
AS Assignment

Owner name: FUNAI ELECTRIC CO., LTD., JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NARITA, SHUSUKE;TOKOSHIMA, SUSUMU;REEL/FRAME:019430/0114

Effective date: 20070404

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FEPP Fee payment procedure

Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.)

LAPS Lapse for failure to pay maintenance fees

Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20181214