WO2011057507A1 - Method and apparatus for emphasizing video conference on-site atmosphere - Google Patents

Method and apparatus for emphasizing video conference on-site atmosphere Download PDF

Info

Publication number
WO2011057507A1
WO2011057507A1 PCT/CN2010/075229 CN2010075229W WO2011057507A1 WO 2011057507 A1 WO2011057507 A1 WO 2011057507A1 CN 2010075229 W CN2010075229 W CN 2010075229W WO 2011057507 A1 WO2011057507 A1 WO 2011057507A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
ambience
audio data
atmosphere
processing
Prior art date
Application number
PCT/CN2010/075229
Other languages
French (fr)
Chinese (zh)
Inventor
黄杰华
Original Assignee
华为终端有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为终端有限公司 filed Critical 华为终端有限公司
Publication of WO2011057507A1 publication Critical patent/WO2011057507A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • the present invention claims priority to Chinese Patent Application No. 200910221646. filed on Nov. 11, 2009, the entire contents of which is hereby incorporated by reference.
  • BACKGROUND OF THE INVENTION 1 Field of the Invention The present invention relates to the field of image processing, and more particularly to a method and apparatus for highlighting a live atmosphere of a video conference.
  • a video conference system refers to an individual or a group of two or more different places, and transmits sound, image, and document data to each other through a transmission line and a multimedia device (such as a video camera) to achieve instant and interactive communication to complete
  • the system of conference purposes the system is a typical image communication system.
  • the image and sound signals are encoded into digital signals, which are then decoded and displayed as visual and audible information at the receiving end, which is intuitive and has a large amount of information compared with the conference call.
  • the current video conferencing system can not reflect the atmosphere theme of the conference site. For example, it is now necessary to use the video conferencing system to open a sad-themed memorial service, while the existing video conferencing system can only capture the live video captured by the camera. Passed over, the picture is only a real reflection of the scene, and people can only perceive the sadness of the field according to the content in the picture, and can not intuitively perceive the sadness.
  • Embodiments of the present invention provide a method and apparatus for highlighting a live atmosphere of a video conference, which is used to clearly highlight the live atmosphere of the conference, so that the participant can intuitively feel the atmosphere of the conference.
  • An embodiment of the present invention provides a method for highlighting a live atmosphere of a video conference, the method comprising: receiving video and audio data of each venue; performing image and sound processing on the video and audio data to highlight an atmosphere of the current video conference; The video and audio data after the image sound processing is fed back to each venue.
  • the embodiment of the present invention further provides an apparatus for highlighting a live atmosphere of a video conference, comprising: a receiving unit, configured to receive video and audio data of each venue; and an ambience processing unit, configured to perform image and sound processing on the video and audio data.
  • a receiving unit configured to receive video and audio data of each venue
  • an ambience processing unit configured to perform image and sound processing on the video and audio data.
  • the sending unit is configured to feed back the video and audio data processed by the image sound to each venue.
  • the image processing technology performs image sound processing on the video and audio data of each venue by image processing technology to highlight the atmosphere of the current video conference, so that the participants can intuitively feel the live atmosphere of the conference from the video and audio.
  • FIG. 1 is a structural diagram of a video conference system according to Embodiment 1 of the present invention.
  • FIG. 2 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 1 of the present invention
  • FIG. 3 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 2 of the present invention
  • FIG. 5 is a schematic structural diagram of a method for highlighting a live atmosphere of a video conference according to Embodiment 4 of the present invention
  • FIG. 6 is a schematic structural diagram of a device for highlighting a live atmosphere of a video conference according to Embodiment 4 of the present invention
  • FIG. 1 is a structural diagram of a video conference system according to Embodiment 1 of the present invention.
  • the system includes: a multi-point control unit 101 (MCU), site cameras 102, 103, and 104, wherein the site cameras 102, 103, and 104 are distributed in three different venues, and they are transmitted through multiple transmission lines.
  • the point control unit 101 is connected, for example, via an IP network.
  • the multipoint control unit 101 can forward the video and audio data from each site to other sites, such as forwarding the data collected by the field camera 102 to the site cameras 103 and 104.
  • the multipoint control unit 101 can forward the data during the forwarding process. Perform certain processing, such as splicing each picture to enhance the conference experience.
  • FIG. 2 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 1 of the present invention, where the method includes the following steps:
  • S201 Receive video and audio data of each site, specifically, the multi-point control unit receives video and audio data from each site, and the process of receiving data belongs to the prior art, and is not described herein.
  • S202 Perform image and sound processing on the video and audio data to highlight the atmosphere of the current video conference.
  • the multi-point control unit needs to perform special image and sound processing on the received video and audio data, so that the participants can intuitively feel the atmosphere of the conference from the video and audio.
  • the above-described image sound processing may include various processing techniques such as rhythm control processing, color rendering processing, line optimization processing, background fusion processing, special effect generation superimposition processing, or texture simulation processing.
  • the rhythm control process is to change the frame rate of the received video and audio data, and change the progress of the entire video and audio data processing to make the playback speed faster or slower.
  • Speeding up can highlight a cheerful or rushing live atmosphere, while slowing down can highlight a heavy or serious live atmosphere. For example, converting 25 frames per second of video and audio sequences to 30 frames per second for a cheerful, rushing, and tense feeling; or converting 30 frames per second of video and audio sequences to 25 frames per second.
  • processing in order to make people feel heavy or serious atmosphere, of course, embodiments of the present invention are not limited to such a frame rate conversion.
  • the color rendering process is to enhance the color and color of the received video and audio data: for example, for a positive and positive scene atmosphere, color enhancement, and more warm colors, giving a feeling of sunshine upwards; Negative live atmosphere, using color fade, and more use of cool colors, giving a dark and cloudy feeling.
  • the line optimization process detects the received video data, finds large outlines in the image, and has obvious edges. For example, for a positive positive scene atmosphere, the curve of the contour lines is optimized to make the shape of the object in the image appear more Graceful; for the negative atmosphere, the contour lines are optimized in a straight line, so that the shape and contour of the objects in the image appear more monotonous, to highlight dignity and seriousness.
  • the background fusion process is to select some backgrounds of the theme atmosphere in advance.
  • the selection range includes different themes, different industries, different seasons, different meeting places, and some theme of different meeting sizes. Then, according to the actual meeting, select the materials that are suitable for the meeting. , the fusion of the background and the actual video and audio received.
  • Special effects generation overlay processing is based on the needs of some special applications, to generate some special effects, to highlight the atmosphere of the meeting. For example: When you open a meeting with a sad theme, you can make an image of tears, superimposed on some objects, such as tables and cups superimposed on the meeting scene.
  • the texture simulation process first detects the texture area in the image, and then processes the texture area according to the current environmental parameters, temperature, humidity, brightness, etc., so that other people in the venue can see the texture of the table and the meeting site. People see the same.
  • the image and sound processing performed by the multipoint control unit of the embodiment of the present invention is not limited to the above-mentioned several modes, and is not limited to the simultaneous use of several of the above processing modes.
  • S203 The video and audio data processed by the image sound is fed back to each venue.
  • the multipoint control unit 101 receives the video and audio data of the conference camera 102, and after performing the image sound processing in step S202, retransmits the processed image.
  • the site cameras 102, 103, and 104 the same processing for the data of the received sites 103 and 104.
  • the image processing technology performs image sound processing on the video and audio data of each venue by image processing technology to highlight the atmosphere of the current video conference, so that the participants can intuitively feel the live atmosphere of the conference from the video and audio.
  • Embodiment 2
  • FIG. 3 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 2 of the present invention. It should be noted that the embodiment of the present invention also describes the present invention from the perspective of a multipoint control unit. Including the following steps:
  • S301 Set an ambient ambience mode of the current video conference, where the ambient ambience mode includes an ambience preset value.
  • an environment atmosphere mode conforming to the atmosphere of the conference is set in the multi-point control unit, and the environment atmosphere mode can be pre-stored in the storage unit in the multi-point control unit, and can be selected when needed to be called. One of them will be run.
  • These ambient climate models include A series of ambience presets, a sad atmosphere mode can include a series of presets as follows:
  • the preset value of this group is divided into five parts, which correspond to the following five processing methods: rhythm control processing, color rendering processing, line optimization processing, background fusion processing, special effect generation superposition processing, and their meanings are:
  • the rhythm control processing uses a slow playback processing method that converts a 30 frame/second sequence into 25 frames/second.
  • Color rendering processing adopts color fade processing
  • Line optimization processing is used to optimize the line of the corridor line
  • the background fusion process uses a preset background numbered 4;
  • Special effects generation overlay processing uses a preset special effect numbered 5.
  • S 302 Receive video and audio data and ambient parameters of each venue.
  • the originally moderated conference becomes more and more tense due to some disputes. Therefore, if the original ambience default value is used, it will not match the actual situation.
  • the multi-point control unit receives the video and audio data of each site, and also receives the ambience parameters sent by each site.
  • each site can use the ambience parameter and video.
  • the audio data is sent to the multi-point control unit together, and the ambient parameter can also be sent through the auxiliary stream, which can reflect the change of the atmosphere in each venue.
  • the transmission of the ambient parameters can be performed by the person in charge of the recording in each venue.
  • step S 303 According to the ambience parameter, it is determined whether the ambience preset value needs to be corrected: if necessary, the atmosphere The preset value is corrected, and if not, the process proceeds to step S304.
  • the ambience parameter may include information about whether the ambience preset value needs to be modified, and how to modify the information.
  • the multi-point control unit only needs to perform corresponding operations according to the information.
  • the ambience parameter may also be The above information is directly included, but the multi-point control unit is required to perform corresponding processing to obtain the above information, for example, a certain comparison between the ambient parameter and the ambience preset value.
  • S304 Perform image and sound processing on the video and audio data according to the ambience preset value to highlight an atmosphere of the current video conference.
  • step S301 the audio and video processing of the video and audio data according to the ambience preset value is similar to that described in step S301, and the specific content of the image sound processing is similar to that of the first embodiment, so it is no longer I will go into details.
  • S 305 Perform an effect evaluation on the video-audio-processed video and audio to determine whether it is necessary to update the ambience preset value.
  • the embodiment performs the effect evaluation on the processed video and audio in this step.
  • the effect evaluation can be achieved by comparing the processed video and audio with a preset template.
  • the multi-point control unit judges whether it is necessary to update the ambience preset value according to the evaluation result, if necessary, modifies the ambience preset value, and returns to step S304; if not, proceeds to step S306.
  • S 306 The video and audio data processed by the image sound is fed back to each venue.
  • the image processing technology performs image and sound processing on the video and audio data of each site by the image processing technology to highlight the atmosphere of the current video conference, so that the participant can intuitively feel the atmosphere of the conference from the video and audio.
  • the embodiment of the present invention not only detects the situation of each venue in real time. In other words, the ambience preset value is changed according to the specific situation, and the processed video and audio are also evaluated for effects, so that the embodiment of the present invention is more in line with the actual atmosphere of the conference and the effect of highlighting the conference atmosphere is more obvious.
  • Embodiment 3 Embodiment 3
  • FIG. 4 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 3 of the present invention, where the method includes the following steps:
  • Each venue selects its own ambient atmosphere mode according to the atmosphere of the venue, and the ambient atmosphere mode includes an atmosphere preset value.
  • Each site performs pre-image sound processing on the video and audio data collected according to the respective preset values of the atmosphere.
  • steps S401 and S402 are similar to the steps S 301 and S 304 in the second embodiment, respectively, except that the steps S 301 and S 304 in the second embodiment are all performed by the multipoint control unit, and in this embodiment,
  • the operation of the step is performed by each site end. Specifically, the operation may be performed by a video recording device in each site, or may be performed by a separate device connected to the video recording device. of.
  • each site will send the processed video and audio data to the multipoint control unit.
  • the multipoint control unit performs uniform adaptation optimization on the video and audio data processed by the prior image sound to highlight the overall atmosphere of the current video conference.
  • S405 The multi-point control unit feeds back the video and audio data after the unified adaptation optimization processing to each site.
  • each site may also perform an effect evaluation on the video and audio data after the previous image sound processing to determine whether it is necessary to modify the preset value of the atmosphere, if necessary After the modification, the audio and preset data are processed again, and the previous image sound processing is performed again. If no modification is needed, the video and audio data are sent to the multi-point control unit.
  • the embodiment of the present invention highlights the atmosphere of the current video conference by performing pre-image sound processing on the respective video and audio data by each site, so that the participant can intuitively feel the atmosphere of the conference from the video and audio.
  • the burden on the multipoint controller is greatly reduced.
  • FIG. 5 is a structural diagram of a device for highlighting a live atmosphere of a video conference according to Embodiment 4 of the present invention.
  • the device includes: a receiving unit 510, an ambience processing unit 520, and a sending unit 530, which are sequentially connected.
  • the receiving unit 510 is configured to receive video and audio data of each site.
  • the receiving unit 51 0 can receive video and audio data from each site through the Internet, a dedicated line network, or a direct cable, etc., specifically, Receive video and audio data from the camera unit of each venue.
  • the ambience processing unit 520 is configured to perform image sound processing on the video and audio data to highlight an atmosphere of the current video conference.
  • the apparatus of an embodiment requires special image and sound processing of the received video and audio data so that the participant can intuitively feel the atmosphere of the meeting from the video and audio.
  • the specific image sound processing methods can be various, such as described in the first embodiment: rhythm control processing, color rendering processing, line optimization processing, background fusion processing, special effect generation overlay processing or texture simulation processing, etc.
  • the processing principle and process are similar to those in the first embodiment, and will not be described again.
  • the sending unit 530 is configured to feed back the video and audio data processed by the image sound to each venue.
  • the receiving unit 510 is further configured to receive video and audio data that has undergone prior image sound processing, which is similar to the above image sound processing, such as rhythm control processing, color rendering processing. , line optimization processing, background fusion processing, special effect generation overlay processing or texture simulation processing.
  • prior image sound processing is performed by each site end.
  • the operation may be performed by a video recording device in each site, or may be an independent connection between the video recording device and the video recording device. The device to complete the operation.
  • the ambience processing unit 520 is also used to perform uniform adaptation of the video and audio data that has undergone prior image sound processing to highlight the overall ambience of the current video conference.
  • FIG. 6 is a structural diagram of a device for highlighting a live atmosphere of a video conference according to Embodiment 5 of the present invention.
  • the device includes: a receiving unit 610, an ambience processing unit 620, a sending unit 630, a mode setting unit 640, and a tampering unit 650. , a judging unit 660 and an updating unit 670.
  • the receiving unit 610 is configured to receive video and audio data of each site and the ambience parameters of each site. Since the atmosphere of the video conference may change according to the progress of the conference, if the original ambience preset value is always used, The actual situation does not match.
  • the ambience parameter received by the receiving unit 610 in this embodiment can reflect the change of the atmosphere in each venue. In practical applications, the sending of the ambience parameter can be performed by the person in charge of Nie Lu in each venue.
  • each venue may send the ambience parameter together with the video and audio data to the multipoint control unit, and the ambience parameter may also be sent through the auxiliary stream.
  • the mode setting unit 640 is configured to set an ambient ambience mode of the current video conference, where the ambient ambience mode includes an ambience preset value, and the ambient ambience mode may be pre-stored in a storage unit of the device, and may be selected when needed to be called. One of them will be run.
  • the ambience processing unit 620 may include an ambience processing sub-unit for performing audiovisual processing on the video and audio data received by the receiving unit 610 according to the ambience preset value to highlight the ambience of the current video conference.
  • the modifying unit 650 is configured to modify the ambience preset value according to the ambience parameter received by the receiving unit 610.
  • the ambience parameter may include information about whether the ambience preset value needs to be modified, and how to modify the information, and the modifying unit 650 only needs to According to the information, the corresponding operation may be performed.
  • the ambience parameter may not directly include the above information, but the modification unit 650 is required to perform corresponding processing to obtain the above information, for example, the ambience parameter and the ambience preset value. Make a certain comparison.
  • the determining unit 660 is configured to perform an effect evaluation on the video and audio data subjected to the image sound processing to determine whether it is necessary to update the ambience preset value. In order to ensure the image sound passing through the atmosphere processing unit 620 The processed video and audio conform to the atmosphere of the current conference site, or the processing effect is satisfactory.
  • the determining unit 660 of the embodiment performs an effect evaluation on the processed video and audio, and the effect evaluation can be performed by using the processed video. And audio is compared with a preset template.
  • the updating unit 670 is configured to update the ambience preset value according to the effect evaluation of the judging unit 660, so that the video and audio ambience processed by the ambience processing unit 620 is better.
  • the image processing technology performs image and sound processing on the video and audio data of each site by the image processing technology to highlight the atmosphere of the current video conference, so that the participant can intuitively feel the atmosphere of the conference from the video and audio.
  • the embodiment of the present invention not only detects the situation of each site in real time, but also changes the preset value of the atmosphere according to the specific situation, and also evaluates the effect of the processed video and audio, so that the embodiment of the present invention is more in line with the actual atmosphere of the conference and The effect of highlighting the atmosphere of the meeting is even more obvious.
  • This may be accomplished by a computer program instructing the associated hardware, which may be stored in a computer readable storage medium, which, when executed, may include the flow of an embodiment of the methods described above.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Abstract

A method and apparatus for emphasizing video conference on-site atmosphere is provided in the embodiment of the present invention. The method includes the following steps: receiving video and audio data from each assembly room; performing image and voice processing for said video and audio data to emphasize the current video conference atmosphere; feeding back said video and audio data the image and voice processing has been performed for to each assembly room. The image and voice processing is performed for the video and audio data from each assembly room by means of the image processing technology to emphasize the current video conference atmosphere in the embodiment of the present invention, such that conferees can feel the on-site atmosphere of the conference visually from the video and audio.

Description

突出视频会议现场氛围的方法和装置 本申请要求于 2009 年 11 月 11 日提交中国专利局、 申请号为 200910221646. 1 的中国专利申请的优先权, 其全部内容通过引用结合在本申 请中。 技术领域 本发明是涉及图像处理领域 ,尤其是涉及一种突出视频会议现场氛围的方 法和装置。 背景技术 视频会议系统是指两个或两个以上不同地方的个人或群体 ,通过传输线路 及多媒体设备(比如摄像机), 将声音、 影像及文件资料互相传送, 达到即时 且互动的沟通, 以完成会议目的之系统, 该系统是一种典型的图像通信系统。 在视频会议系统的发送端,将图像和声音信号编码成数字信号,在接收端再把 它解码并显示为视觉、 听觉可获取的信息, 与电话会议相比, 具有直观性强, 信息量大等特点。  The present invention claims priority to Chinese Patent Application No. 200910221646. filed on Nov. 11, 2009, the entire contents of which is hereby incorporated by reference. BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to the field of image processing, and more particularly to a method and apparatus for highlighting a live atmosphere of a video conference. BACKGROUND A video conference system refers to an individual or a group of two or more different places, and transmits sound, image, and document data to each other through a transmission line and a multimedia device (such as a video camera) to achieve instant and interactive communication to complete The system of conference purposes, the system is a typical image communication system. At the transmitting end of the video conferencing system, the image and sound signals are encoded into digital signals, which are then decoded and displayed as visual and audible information at the receiving end, which is intuitive and has a large amount of information compared with the conference call. Features.
但是, 目前的视频会议系统无法体现出会议现场的氛围主题, 比如, 现在 需要利用视频会议系统开一个以悲伤为主题的追悼会,而现有的视频会议系统 只能将摄像机拍摄到的现场画面传过来, 该画面只是现场的真实体现, 而人们 也只能根据画面中的内容, 由经验来感知出现场的悲伤, 并不能直观地感知到 悲伤。  However, the current video conferencing system can not reflect the atmosphere theme of the conference site. For example, it is now necessary to use the video conferencing system to open a sad-themed memorial service, while the existing video conferencing system can only capture the live video captured by the camera. Passed over, the picture is only a real reflection of the scene, and people can only perceive the sadness of the field according to the content in the picture, and can not intuitively perceive the sadness.
因此,总的来说现有技术中的视频会议系统还无法明确突出会议的现场氛 围。 发明内容 本发明实施例提供了一种突出视频会议现场氛围的方法和装置,用于明确 突出会议的现场氛围, 从而使与会者可以直观地感受到该会议的现场气氛。 Therefore, in general, the video conferencing system in the prior art cannot clearly highlight the atmosphere of the meeting. Wai. SUMMARY OF THE INVENTION Embodiments of the present invention provide a method and apparatus for highlighting a live atmosphere of a video conference, which is used to clearly highlight the live atmosphere of the conference, so that the participant can intuitively feel the atmosphere of the conference.
本发明实施例提出了一种突出视频会议现场氛围的方法,该方法包括:接 收各会场的视频和音频数据;对所述视频和音频数据进行图像声音处理以突出 当前视频会议的氛围;将经过图像声音处理后的所述视频和音频数据反馈给各 会场。  An embodiment of the present invention provides a method for highlighting a live atmosphere of a video conference, the method comprising: receiving video and audio data of each venue; performing image and sound processing on the video and audio data to highlight an atmosphere of the current video conference; The video and audio data after the image sound processing is fed back to each venue.
本发明实施例还提出了一种突出视频会议现场氛围的装置, 包括:接收单 元, 用于接收各会场的视频和音频数据; 氛围处理单元, 用于对所述视频和音 频数据进行图像声音处理以突出当前视频会议的氛围; 发送单元, 用于将经过 图像声音处理后的所述视频和音频数据反馈给各会场。  The embodiment of the present invention further provides an apparatus for highlighting a live atmosphere of a video conference, comprising: a receiving unit, configured to receive video and audio data of each venue; and an ambience processing unit, configured to perform image and sound processing on the video and audio data. To highlight the atmosphere of the current video conference; the sending unit is configured to feed back the video and audio data processed by the image sound to each venue.
本发明实施例通过图像处理技术对各会场的视频和音频数据进行图像声 音处理来突出了当前视频会议的氛围 ,从而使与会者可以直观地从该视频和音 频内感受到该会议的现场气氛。  The image processing technology performs image sound processing on the video and audio data of each venue by image processing technology to highlight the atmosphere of the current video conference, so that the participants can intuitively feel the live atmosphere of the conference from the video and audio.
附图说明 DRAWINGS
图 1为本发明实施例一提供的一种视频会议系统结构图;  1 is a structural diagram of a video conference system according to Embodiment 1 of the present invention;
图 2为本发明实施例一提供的一种突出视频会议现场氛围的方法流程图; 图 3为本发明实施例二提供的一种突出视频会议现场氛围的方法流程图; 图 4为本发明实施例三提供的一种突出视频会议现场氛围的方法流程图; 图 5为本发明实施例四提供的一种突出视频会议现场氛围的装置结构图; 图 6为本发明实施例五提供的一种突出视频会议现场氛围的装置结构图。 具体实施方式 FIG. 2 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 1 of the present invention; FIG. 3 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 2 of the present invention; FIG. 5 is a schematic structural diagram of a method for highlighting a live atmosphere of a video conference according to Embodiment 4 of the present invention; FIG. 6 is a schematic structural diagram of a device for highlighting a live atmosphere of a video conference according to Embodiment 4 of the present invention; A device structure diagram highlighting the atmosphere of a video conference. detailed description
实施例一  Embodiment 1
首先对本发明实施例的工作环境进行简单的介绍,如图 1所示为本发明实 施例一提供的一种视频会议系统结构图。 该系统包括: 多点控制单元 101 ( Mul t ipoint Control Uni t , MCU )、 会场摄像机 102、 103和 104, 其中会场 摄像机 102、 103和 104分布在三个不同的会场, 它们通过传输线路和多点控 制单元 101相连, 比如通过 IP网络相连。 多点控制单元 101可以将来自各会 场的视频和音频数据转发至其他会场,比如将会场摄像机 102采集的数据转发 给会场摄像机 103和 104 , 当然多点控制单元 101在转发过程中可以对该数据 进行一定的处理, 比如将各个画面予以拼接以增强会议体验等。  The working environment of the embodiment of the present invention is briefly introduced. FIG. 1 is a structural diagram of a video conference system according to Embodiment 1 of the present invention. The system includes: a multi-point control unit 101 (MCU), site cameras 102, 103, and 104, wherein the site cameras 102, 103, and 104 are distributed in three different venues, and they are transmitted through multiple transmission lines. The point control unit 101 is connected, for example, via an IP network. The multipoint control unit 101 can forward the video and audio data from each site to other sites, such as forwarding the data collected by the field camera 102 to the site cameras 103 and 104. Of course, the multipoint control unit 101 can forward the data during the forwarding process. Perform certain processing, such as splicing each picture to enhance the conference experience.
如图 2 所示为本发明实施例一提供的一种突出视频会议现场氛围的方法 流程图, 该方法包括如下步驟:  FIG. 2 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 1 of the present invention, where the method includes the following steps:
S201 :接收各会场的视频和音频数据,具体来说是多点控制单元接收来自 各会场的视频和音频数据,该接收数据过程属于现有技术,在此就不再进行赘 述了。  S201: Receive video and audio data of each site, specifically, the multi-point control unit receives video and audio data from each site, and the process of receiving data belongs to the prior art, and is not described herein.
S202:对所述视频和音频数据进行图像声音处理以突出当前视频会议的氛 围。 为了能让视频和音频突出当前视频会议的氛围, 多点控制单元需要对接收 到的视频和音频数据进行特殊的图像声音处理,使与会者可以直观地从该视频 和音频中感受到会议氛围。  S202: Perform image and sound processing on the video and audio data to highlight the atmosphere of the current video conference. In order to make the video and audio highlight the atmosphere of the current video conference, the multi-point control unit needs to perform special image and sound processing on the received video and audio data, so that the participants can intuitively feel the atmosphere of the conference from the video and audio.
作为本发明的一个实施例,上述图像声音处理可以包括多种处理技术: 比 如节奏控制处理、 色彩渲染处理、 线条优化处理、 背景融合处理、 特殊效果生 成叠加处理或质感模拟处理。 其中, 节奏控制处理是对接收到的视频和音频数据,通过改变其帧率的方 法, 改变整个视频和音频数据处理的进度, 使其播放速度加快或者变緩。 加快 播放可以突出欢快或者急促的现场氛围,而减緩播放则可以突出沉重或者严肃 的现场氛围。 比如将 25帧 /秒的视频和音频序列转换为 30帧 /秒来处理,从而 带给人欢快、 急促及紧张的感觉; 或者将 30 帧 /秒的视频和音频序列转换为 25帧 /秒来处理, 以让人感觉到沉重或者严肃的氛围, 当然本发明实施例并不 限于该种帧率的变换。 As an embodiment of the present invention, the above-described image sound processing may include various processing techniques such as rhythm control processing, color rendering processing, line optimization processing, background fusion processing, special effect generation superimposition processing, or texture simulation processing. Among them, the rhythm control process is to change the frame rate of the received video and audio data, and change the progress of the entire video and audio data processing to make the playback speed faster or slower. Speeding up can highlight a cheerful or rushing live atmosphere, while slowing down can highlight a heavy or serious live atmosphere. For example, converting 25 frames per second of video and audio sequences to 30 frames per second for a cheerful, rushing, and tense feeling; or converting 30 frames per second of video and audio sequences to 25 frames per second. Processing, in order to make people feel heavy or serious atmosphere, of course, embodiments of the present invention are not limited to such a frame rate conversion.
色彩渲染处理是对接收到的视频和音频数据进行色彩增强或者色彩淡化 处理: 比如对于正面积极的现场氛围, 采用色彩增强, 并且多采用暖色调, 给 人一种阳光向上的感觉; 而对于负面消极的现场氛围, 则采用色彩淡化, 并且 多采用冷色调, 给人一种阴暗阴霾的感觉。  The color rendering process is to enhance the color and color of the received video and audio data: for example, for a positive and positive scene atmosphere, color enhancement, and more warm colors, giving a feeling of sunshine upwards; Negative live atmosphere, using color fade, and more use of cool colors, giving a dark and cloudy feeling.
线条优化处理是对接收到的视频数据进行检测,找出图像中大轮廓, 明显 的边缘, 比如对于正面积极的现场氛围, 对轮廓线条作曲线优化处理, 使图像 中物体的形状轮廊显得更优美; 而对于消极的氛围, 则对轮廓线条作直线优化 处理, 使图像中物体的形状轮廓显得更单调, 来突出端庄和严肃。  The line optimization process detects the received video data, finds large outlines in the image, and has obvious edges. For example, for a positive positive scene atmosphere, the curve of the contour lines is optimized to make the shape of the object in the image appear more Graceful; for the negative atmosphere, the contour lines are optimized in a straight line, so that the shape and contour of the objects in the image appear more monotonous, to highlight dignity and seriousness.
背景融合处理是事先选取一些主题氛围的背景, 选取范围包括不同氛围 下, 不同行业, 不同季节, 不同会议地点, 以及不同的会议规模大小的一些题 材, 然后根据实际会议, 选取与之适应的素材, 进行背景与接收到的实际视频 和音频的融合。  The background fusion process is to select some backgrounds of the theme atmosphere in advance. The selection range includes different themes, different industries, different seasons, different meeting places, and some theme of different meeting sizes. Then, according to the actual meeting, select the materials that are suitable for the meeting. , the fusion of the background and the actual video and audio received.
特殊效果生成叠加处理是根据一些特殊应用场合的需求,生成一些特殊的 效果, 用于突出会议氛围。 比如: 在开一些悲伤主题的会议时, 可以制作眼泪 的图像, 叠加在一些物体上, 例如叠加在会议场景中的桌子和杯子上等。 质感模拟处理是首先检测出图像中的纹理区域, 然后根据当前环境参数, 温度, 湿度, 明暗程度等等, 对该纹理区域做处理, 使其它会场的人, 看到桌 子的质感感受和会议现场的人看到的一样。 Special effects generation overlay processing is based on the needs of some special applications, to generate some special effects, to highlight the atmosphere of the meeting. For example: When you open a meeting with a sad theme, you can make an image of tears, superimposed on some objects, such as tables and cups superimposed on the meeting scene. The texture simulation process first detects the texture area in the image, and then processes the texture area according to the current environmental parameters, temperature, humidity, brightness, etc., so that other people in the venue can see the texture of the table and the meeting site. People see the same.
当然, 需要指出的是, 本发明实施例的多点控制单元所进行的图像声音处 理并不限于上述列举的几种方式, 同时也不限于同时使用几种上述处理方式。  Of course, it should be noted that the image and sound processing performed by the multipoint control unit of the embodiment of the present invention is not limited to the above-mentioned several modes, and is not limited to the simultaneous use of several of the above processing modes.
S203: 将经过图像声音处理后的所述视频和音频数据反馈给各会场。 以图 1所示的视频会议场景为例来说, 多点控制单元 101在收到会场摄像机 102的 视频和音频数据, 并对其经过步驟 S202的图像声音处理后, 将经过处理的图 像再发送给会场摄像机 102、 103和 104 , 对接收的会场 103和 104的数据也 是做同样的处理  S203: The video and audio data processed by the image sound is fed back to each venue. Taking the video conference scene shown in FIG. 1 as an example, the multipoint control unit 101 receives the video and audio data of the conference camera 102, and after performing the image sound processing in step S202, retransmits the processed image. Give the site cameras 102, 103, and 104 the same processing for the data of the received sites 103 and 104.
本发明实施例通过图像处理技术对各会场的视频和音频数据进行图像声 音处理来突出了当前视频会议的氛围 ,从而使与会者可以直观地从该视频和音 频内感受到该会议的现场气氛。 实施例二  The image processing technology performs image sound processing on the video and audio data of each venue by image processing technology to highlight the atmosphere of the current video conference, so that the participants can intuitively feel the live atmosphere of the conference from the video and audio. Embodiment 2
如图 3 所示为本发明实施例二提供的一种突出视频会议现场氛围的方法 流程图, 需要指出的是, 本发明实施例也是从多点控制单元的角度对本发明所 作的描述, 该方法包括如下步骤:  FIG. 3 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 2 of the present invention. It should be noted that the embodiment of the present invention also describes the present invention from the perspective of a multipoint control unit. Including the following steps:
S301 : 设定当前视频会议的环境氛围模式, 所述环境氛围模式包括氛围预 设值。在视频会议开始之前, 首先在多点控制单元内设置符合这次会议氛围的 环境氛围模式,这些环境氛围模式可以预先存储在多点控制单元内的存储单元 之中, 当需要调用时, 可以选择其中一种予以运行。 这些环境氛围模式中包括 了一系列的氛围预设值, 一种悲伤的氛围模式可以包括如下一系列预设值:S301: Set an ambient ambience mode of the current video conference, where the ambient ambience mode includes an ambience preset value. Before the start of the video conference, firstly, an environment atmosphere mode conforming to the atmosphere of the conference is set in the multi-point control unit, and the environment atmosphere mode can be pre-stored in the storage unit in the multi-point control unit, and can be selected when needed to be called. One of them will be run. These ambient climate models include A series of ambience presets, a sad atmosphere mode can include a series of presets as follows:
30-25 ; 0; 0; 4 ; 5。 该组预设值分为 5个部分, 其对应了以下 5种处理方式: 节奏控制处理、 色彩渲染处理、 线条优化处理、 背景融合处理、 特殊效果生成 叠加处理, 其含义分别是: 30-25; 0; 0; 4; 5. The preset value of this group is divided into five parts, which correspond to the following five processing methods: rhythm control processing, color rendering processing, line optimization processing, background fusion processing, special effect generation superposition processing, and their meanings are:
节奏控制处理采用将 30帧 /秒序列转换为 25帧 /秒来处理的减緩播放处理 方式;  The rhythm control processing uses a slow playback processing method that converts a 30 frame/second sequence into 25 frames/second.
色彩渲染处理采用色彩淡化处理方式;  Color rendering processing adopts color fade processing;
线条优化处理釆用对轮廊线条作直线优化处理;  Line optimization processing is used to optimize the line of the corridor line;
背景融合处理采用预设的编号为 4的背景;  The background fusion process uses a preset background numbered 4;
特殊效果生成叠加处理采用预设的编号为 5的特殊效果。  Special effects generation overlay processing uses a preset special effect numbered 5.
当然, 上述只是列举了一种可能的预设值设置, 本发明实施例并不限于此 种形式, 其它各种预设值的形式也是可以的。  Of course, the above is only a list of possible preset values. The embodiment of the present invention is not limited to this form, and other forms of various preset values are also possible.
S 302 : 接收各会场的视频和音频数据及氛围参数。  S 302: Receive video and audio data and ambient parameters of each venue.
由于视频会议的氛围随着会议的进行可能会发生某种变化,比如本来緩和 的会议由于某些争论逐步变得紧张,因此如果一直采用原先的氛围预设值的话 会与实际情况不符。  As the atmosphere of the video conference may change with the progress of the conference, for example, the originally moderated conference becomes more and more tense due to some disputes. Therefore, if the original ambience default value is used, it will not match the actual situation.
针对上述情况,本步骤中多点控制单元除了接收各会场的视频和音频数据 夕卜, 同时还接收各会场发来的氛围参数, 在本实施例中, 各会场可以将该氛围 参数和视频及音频数据一起发送至多点控制单元,同时也可以通过辅流来发送 该氛围参数, 该氛围参数可以体现各会场内氛围的变化。 在实际应用中, 该氛 围参数的发送可以由各会场内负责摄录的人员进行操作。  In this case, in this step, the multi-point control unit receives the video and audio data of each site, and also receives the ambience parameters sent by each site. In this embodiment, each site can use the ambience parameter and video. The audio data is sent to the multi-point control unit together, and the ambient parameter can also be sent through the auxiliary stream, which can reflect the change of the atmosphere in each venue. In practical applications, the transmission of the ambient parameters can be performed by the person in charge of the recording in each venue.
S 303: 根据氛围参数判断是否需要修正氛围预设值: 如果需要, 则对氛围 预设值进行修正, 如果不需要, 则进入步驟 S 304。 S 303: According to the ambience parameter, it is determined whether the ambience preset value needs to be corrected: if necessary, the atmosphere The preset value is corrected, and if not, the process proceeds to step S304.
该氛围参数内可以包括是否需要对氛围预设值进行修改的信息,以及如何 修改的信息, 多点控制单元只需依照该些信息进行相应的操作即可, 当然, 该 氛围参数内也可以不直接包括上述信息,而是需要多点控制单元进行相应的处 理后, 才能得到上述信息, 比如将氛围参数和氛围预设值进行一定的比对等。  The ambience parameter may include information about whether the ambience preset value needs to be modified, and how to modify the information. The multi-point control unit only needs to perform corresponding operations according to the information. Of course, the ambience parameter may also be The above information is directly included, but the multi-point control unit is required to perform corresponding processing to obtain the above information, for example, a certain comparison between the ambient parameter and the ambience preset value.
S 304 :根据所述氛围预设值对所述视频和音频数据进行图像声音处理以突 出当前视频会议的氛围。  S304: Perform image and sound processing on the video and audio data according to the ambience preset value to highlight an atmosphere of the current video conference.
本步驟中根据氛围预设值对视频和音频数据进行图像声音处理和步驟 S 301 中描述的相类似, 且图像声音处理的具体内容也和实施例一相类似, 因 此在此就不再对其进行赘述了。  In this step, the audio and video processing of the video and audio data according to the ambience preset value is similar to that described in step S301, and the specific content of the image sound processing is similar to that of the first embodiment, so it is no longer I will go into details.
S 305 :对所述经过图像声音处理后的视频和音频进行效果评价判断是否需 要更新氛围预设值。  S 305: Perform an effect evaluation on the video-audio-processed video and audio to determine whether it is necessary to update the ambience preset value.
为了保证经过步骤 S 304的图像声音处理后的视频和音频符合当前会场的 氛围,或者说处理效果让人满意, 本实施例在本步骤中对该处理后的视频和音 频进行了效果评价,该效果评价可以通过将处理后的视频和音频与一预设的模 板进行比对来实现。 效果评价后, 多点控制单元会根据评价结果判断是否需要 更新氛围预设值, 如果需要, 则修改上述氛围预设值, 并返回步骤 S 304 ; 如 果不需要, 则进入步骤 S 306。  In order to ensure that the video and audio processed by the image sound processing in step S304 conform to the atmosphere of the current venue, or the processing effect is satisfactory, the embodiment performs the effect evaluation on the processed video and audio in this step. The effect evaluation can be achieved by comparing the processed video and audio with a preset template. After the effect evaluation, the multi-point control unit judges whether it is necessary to update the ambience preset value according to the evaluation result, if necessary, modifies the ambience preset value, and returns to step S304; if not, proceeds to step S306.
S 306: 将经过图像声音处理后的所述视频和音频数据反馈给各会场。 本发明实施例通过图像处理技术对各会场的视频和音频数据进行图像声 音处理来突出了当前视频会议的氛围,从而使与会者可以直观地从该视频和音 频内感受到该会议的现场气氛, 另外本发明实施例不但实时检测各会场的情 况,根据具体情况来改变氛围预设值, 而且还对经过处理的视频和音频进行效 果评价,从而使得本发明实施例更符合会议的实际氛围情况以及突出会议氛围 的效果更加明显。 实施例三 S 306: The video and audio data processed by the image sound is fed back to each venue. The image processing technology performs image and sound processing on the video and audio data of each site by the image processing technology to highlight the atmosphere of the current video conference, so that the participant can intuitively feel the atmosphere of the conference from the video and audio. In addition, the embodiment of the present invention not only detects the situation of each venue in real time. In other words, the ambience preset value is changed according to the specific situation, and the processed video and audio are also evaluated for effects, so that the embodiment of the present invention is more in line with the actual atmosphere of the conference and the effect of highlighting the conference atmosphere is more obvious. Embodiment 3
如图 4 所示为本发明实施例三提供的一种突出视频会议现场氛围的方法 流程图, 该方法包括如下步驟:  FIG. 4 is a flowchart of a method for highlighting a live atmosphere of a video conference according to Embodiment 3 of the present invention, where the method includes the following steps:
S401 : 各会场根据各自的会场氛围选择各自的环境氛围模式, 所述环境氛 围模式包括氛围预设值。  S401: Each venue selects its own ambient atmosphere mode according to the atmosphere of the venue, and the ambient atmosphere mode includes an atmosphere preset value.
S402 :各会场根据各自的氛围预设值对其采集的视频和音频数据进行在先 图像声音处理。  S402: Each site performs pre-image sound processing on the video and audio data collected according to the respective preset values of the atmosphere.
上述步驟 S401和 S402分别和实施例二中步驟 S 301及 S 304相类似,所不 同的是, 实施例二中步骤 S 301及 S 304都是由多点控制单元完成的, 而本实施 例中则是由各个会场端完成该步驟的操作的,具体来说, 可以是由各个会场内 的摄录设备完成该操作,或者也可以是由一和摄录设备相连的独立的设备来完 成该操作的。  The above steps S401 and S402 are similar to the steps S 301 and S 304 in the second embodiment, respectively, except that the steps S 301 and S 304 in the second embodiment are all performed by the multipoint control unit, and in this embodiment, The operation of the step is performed by each site end. Specifically, the operation may be performed by a video recording device in each site, or may be performed by a separate device connected to the video recording device. of.
当完成该在先图像声音处理步骤后 ,各会场会将经过处理后的视频和音频 数据发送至多点控制单元。  After the previous image sound processing step is completed, each site will send the processed video and audio data to the multipoint control unit.
S404 :多点控制单元对所述经过在先图像声音处理的视频和音频数据进行 统一适配优化以突出当前视频会议的整体氛围。  S404: The multipoint control unit performs uniform adaptation optimization on the video and audio data processed by the prior image sound to highlight the overall atmosphere of the current video conference.
在视频会议中, 各个会场由于视频、 音频协议, 格式, 网络带宽等等都不 一定相同, 因此, 为了让所有会场都能够正常观看其他会场的视频数据, 需要 将各个会场的数据进行一定的转换, 这种转换过程就叫做 "适配优化"。In a video conference, the video, audio protocol, format, network bandwidth, and so on are not necessarily the same. Therefore, in order to allow all sites to view video data of other sites, you need The data of each venue is converted to a certain extent. This conversion process is called "adaptation optimization".
S405 :多点控制单元将经过统一适配优化处理后的所述视频和音频数据反馈给 各会场。 S405: The multi-point control unit feeds back the video and audio data after the unified adaptation optimization processing to each site.
作为本发明的一个实施例, 各会场在完成步骤 S402后, 也可以对该经过 在先图像声音处理后的视频和音频数据进行效果评价,来判断是否需要对氛围 预设值进行修改, 如果需要修改, 则修改氛围预设值后对该视频和音频数据再 次进行在先图像声音处理, 如果不需要修改, 则将该视频和音频数据发送给多 点控制单元。  As an embodiment of the present invention, after completing step S402, each site may also perform an effect evaluation on the video and audio data after the previous image sound processing to determine whether it is necessary to modify the preset value of the atmosphere, if necessary After the modification, the audio and preset data are processed again, and the previous image sound processing is performed again. If no modification is needed, the video and audio data are sent to the multi-point control unit.
本发明实施例通过各会场对各自的视频和音频数据进行在先图像声音处 理来突出了当前视频会议的氛围 ,从而使与会者可以直观地从该视频和音频内 感受到该会议的现场气氛,另外由于图像声音处理大部分是在会场端分担完成 的, 因此也大大减轻了多点控制器的负担。 实施例四  The embodiment of the present invention highlights the atmosphere of the current video conference by performing pre-image sound processing on the respective video and audio data by each site, so that the participant can intuitively feel the atmosphere of the conference from the video and audio. In addition, since the image sound processing is mostly done at the site end, the burden on the multipoint controller is greatly reduced. Embodiment 4
如图 5 为本发明实施例四提供的一种突出视频会议现场氛围的装置结构 图, 该装置包括: 接收单元 510、 氛围处理单元 520和发送单元 530 , 它们之 间依次相连。  FIG. 5 is a structural diagram of a device for highlighting a live atmosphere of a video conference according to Embodiment 4 of the present invention. The device includes: a receiving unit 510, an ambience processing unit 520, and a sending unit 530, which are sequentially connected.
接收单元 510用于接收各会场的视频和音频数据,在本实施例中,接收单 元 51 0可以通过互联网、专线网络或者直连线缆等从各会场接收视频和音频数 据, 具体来说, 是接收各会场的摄像单元发出的视频和音频数据。  The receiving unit 510 is configured to receive video and audio data of each site. In this embodiment, the receiving unit 51 0 can receive video and audio data from each site through the Internet, a dedicated line network, or a direct cable, etc., specifically, Receive video and audio data from the camera unit of each venue.
氛围处理单元 520 用于对所述视频和音频数据进行图像声音处理以突出 当前视频会议的氛围。 为了能让视频和音频突出当前视频会议的氛围, 本发明 实施例的装置需要对接收到的视频和音频数据进行特殊的图像声音处理,使与 会者可以直观地从该视频和音频中感受到会议氛围。而具体的图像声音处理方 式可以多种多样, 比如实施例一中所介绍的: 节奏控制处理、 色彩渲染处理、 线条优化处理、 背景融合处理、 特殊效果生成叠加处理或质感模拟处理等, 其 具体的处理原理及过程和实施例一中相类似, 就不再赘述了。 The ambience processing unit 520 is configured to perform image sound processing on the video and audio data to highlight an atmosphere of the current video conference. In order to enable video and audio to highlight the atmosphere of the current video conference, the present invention The apparatus of an embodiment requires special image and sound processing of the received video and audio data so that the participant can intuitively feel the atmosphere of the meeting from the video and audio. The specific image sound processing methods can be various, such as described in the first embodiment: rhythm control processing, color rendering processing, line optimization processing, background fusion processing, special effect generation overlay processing or texture simulation processing, etc. The processing principle and process are similar to those in the first embodiment, and will not be described again.
发送单元 530 用于将经过图像声音处理后的所述视频和音频数据反馈给 各会场。  The sending unit 530 is configured to feed back the video and audio data processed by the image sound to each venue.
作为本发明的一个实施例,接收单元 510还用于接收经过了在先图像声音 处理的视频和音频数据,该在先图像声音处理同上述图像声音处理相类似, 比 如节奏控制处理、 色彩渲染处理、 线条优化处理、 背景融合处理、 特殊效果生 成叠加处理或质感模拟处理等。所不同的是, 该在先图像声音处理由各个会场 端予以完成, 具体来说, 可以是由各个会场内的摄录设备完成该操作, 或者也 可以是由一和摄录设备相连的独立的设备来完成该操作的。  As an embodiment of the present invention, the receiving unit 510 is further configured to receive video and audio data that has undergone prior image sound processing, which is similar to the above image sound processing, such as rhythm control processing, color rendering processing. , line optimization processing, background fusion processing, special effect generation overlay processing or texture simulation processing. The difference is that the prior image sound processing is performed by each site end. Specifically, the operation may be performed by a video recording device in each site, or may be an independent connection between the video recording device and the video recording device. The device to complete the operation.
氛围处理单元 520 还用于对经过了在先图像声音处理的视频和音频数据 进行统一适配优化以突出当前视频会议的整体氛围。  The ambience processing unit 520 is also used to perform uniform adaptation of the video and audio data that has undergone prior image sound processing to highlight the overall ambience of the current video conference.
这样, 由各个会场端分担大部分的图像声音处理,可以大大减轻本发明实 施例的负担。  Thus, most of the image and sound processing is shared by the respective venues, and the burden on the embodiment of the present invention can be greatly alleviated.
本发明实施例通过图像处理技术对各会场的视频和音频数据进行图像声 音处理来突出了当前视频会议的氛围 ,从而使与会者可以直观地从该视频和音 频内感受到该会议的现场气氛。 实施例五 如图 6 为本发明实施例五提供的一种突出视频会议现场氛围的装置结构 图, 该装置包括: 接收单元 610、 氛围处理单元 620、 发送单元 630、 模式设 定单元 640、 爹改单元 650、 判断单元 660和更新单元 670。 The image processing technology performs image and sound processing on the video and audio data of each site by image processing technology to highlight the atmosphere of the current video conference, so that the participant can intuitively feel the live atmosphere of the conference from the video and audio. Embodiment 5 FIG. 6 is a structural diagram of a device for highlighting a live atmosphere of a video conference according to Embodiment 5 of the present invention. The device includes: a receiving unit 610, an ambience processing unit 620, a sending unit 630, a mode setting unit 640, and a tampering unit 650. , a judging unit 660 and an updating unit 670.
接收单元 610用于接收各会场的视频和音频数据以及各会场的氛围参数, 由于视频会议的氛围随着会议的进行可能会发生某种变化,因此如果一直采用 原先的氛围预设值的话会与实际情况不符。本实施例中接收单元 610所接收的 氛围参数就可以体现各会场内氛围的变化,在实际应用中,该氛围参数的发送 可以由各会场内负责聂录的人员进行操作。  The receiving unit 610 is configured to receive video and audio data of each site and the ambience parameters of each site. Since the atmosphere of the video conference may change according to the progress of the conference, if the original ambience preset value is always used, The actual situation does not match. The ambience parameter received by the receiving unit 610 in this embodiment can reflect the change of the atmosphere in each venue. In practical applications, the sending of the ambience parameter can be performed by the person in charge of Nie Lu in each venue.
在本实施例中 ,各会场可以将该氛围参数和视频及音频数据一起发送至多 点控制单元, 同时也可以通过辅流来发送该氛围参数。  In this embodiment, each venue may send the ambience parameter together with the video and audio data to the multipoint control unit, and the ambience parameter may also be sent through the auxiliary stream.
模式设定单元 640用于设定当前视频会议的环境氛围模式,该环境氛围模 式包括氛围预设值, 这些环境氛围模式可以预先存储在本装置的存储单元之 中, 当需要调用时, 可以选择其中一种予以运行。 氛围处理单元 620可以包括 氛围处理子单元,用于根据该氛围预设值对接收单元 610所接收的视频和音频 数据进行图像声音处理以突出当前视频会议的氛围。  The mode setting unit 640 is configured to set an ambient ambience mode of the current video conference, where the ambient ambience mode includes an ambience preset value, and the ambient ambience mode may be pre-stored in a storage unit of the device, and may be selected when needed to be called. One of them will be run. The ambience processing unit 620 may include an ambience processing sub-unit for performing audiovisual processing on the video and audio data received by the receiving unit 610 according to the ambience preset value to highlight the ambience of the current video conference.
修改单元 650用于根据接收单元 610所接收的氛围参数修改上述氛围预设 值, 上述氛围参数内可以包括是否需要对氛围预设值进行修改的信息, 以及如 何修改的信息, 修改单元 650只需依照这些信息进行相应的操作即可, 当然, 该氛围参数内也可以不直接包括上述信息,而是需要修改单元 650进行相应的 处理后,才能得到上述信息,比如将氛围参数和氛围预设值进行一定的比对等。  The modifying unit 650 is configured to modify the ambience preset value according to the ambience parameter received by the receiving unit 610. The ambience parameter may include information about whether the ambience preset value needs to be modified, and how to modify the information, and the modifying unit 650 only needs to According to the information, the corresponding operation may be performed. Of course, the ambience parameter may not directly include the above information, but the modification unit 650 is required to perform corresponding processing to obtain the above information, for example, the ambience parameter and the ambience preset value. Make a certain comparison.
判断单元 660 用于对经过图像声音处理后的视频和音频数据进行效果评 价判断是否需要更新氛围预设值。为了保证经过氛围处理单元 620的图像声音 处理后的视频和音频符合当前会场的氛围, 或者说处理效果让人满意, 本实施 例的判断单元 660对该处理后的视频和音频进行了效果评价,该效果评价可以 通过将处理后的视频和音频与一预设的模板进行比对来实现。 The determining unit 660 is configured to perform an effect evaluation on the video and audio data subjected to the image sound processing to determine whether it is necessary to update the ambience preset value. In order to ensure the image sound passing through the atmosphere processing unit 620 The processed video and audio conform to the atmosphere of the current conference site, or the processing effect is satisfactory. The determining unit 660 of the embodiment performs an effect evaluation on the processed video and audio, and the effect evaluation can be performed by using the processed video. And audio is compared with a preset template.
更新单元 670用于根据判断单元 660的效果评价更新所述氛围预设值,使 得氛围处理单元 620处理后的视频和音频氛围效果更好。  The updating unit 670 is configured to update the ambience preset value according to the effect evaluation of the judging unit 660, so that the video and audio ambience processed by the ambience processing unit 620 is better.
本发明实施例通过图像处理技术对各会场的视频和音频数据进行图像声 音处理来突出了当前视频会议的氛围 ,从而使与会者可以直观地从该视频和音 频内感受到该会议的现场气氛, 另外本发明实施例不但实时检测各会场的情 况,根据具体情况来改变氛围预设值, 而且还对经过处理的视频和音频进行效 果评价,从而使得本发明实施例更符合会议的实际氛围情况以及突出会议氛围 的效果更加明显。 可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算机 可读取存储介质中, 该程序在执行时, 可包括如上述各方法的实施例的流程。 其中, 所述的存储介质可为磁碟、 光盘、 只读存储记忆体(Read-Only Memory, ROM )或随机存储记忆体 ( Random Acces s Memory, RAM )等。  The image processing technology performs image and sound processing on the video and audio data of each site by the image processing technology to highlight the atmosphere of the current video conference, so that the participant can intuitively feel the atmosphere of the conference from the video and audio. In addition, the embodiment of the present invention not only detects the situation of each site in real time, but also changes the preset value of the atmosphere according to the specific situation, and also evaluates the effect of the processed video and audio, so that the embodiment of the present invention is more in line with the actual atmosphere of the conference and The effect of highlighting the atmosphere of the meeting is even more obvious. This may be accomplished by a computer program instructing the associated hardware, which may be stored in a computer readable storage medium, which, when executed, may include the flow of an embodiment of the methods described above. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
以上所述的具体实施方式,对本发明的目的、技术方案和有益效果进行了 进一步详细说明 , 所应理解的是, 以上所述仅为本发明的具体实施方式而已 , 并不用于限定本发明的保护范围, 凡在本发明的精神和原则之内, 所做的任何 修改、 等同替换、 改进等, 均应包含在本发明的保护范围之内。  The specific embodiments of the present invention have been described in detail with reference to the preferred embodiments of the present invention. The scope of the protection, any modifications, equivalents, improvements, etc., made within the spirit and scope of the invention are intended to be included within the scope of the invention.

Claims

权 利 要 求 书 Claim
1.一种突出视频会议现场氛围的方法, 其特征在于, 所述方法包括: 接收各会场的视频和音频数据; A method for highlighting a live atmosphere of a video conference, the method comprising: receiving video and audio data of each venue;
对所述视频和音频数据进行图像声音处理以突出当前视频会议的氛围; 将经过图像声音处理后的所述视频和音频数据反馈给各会场。  Performing image and sound processing on the video and audio data to highlight the atmosphere of the current video conference; and feeding the video and audio data processed by the image sound to each venue.
2.如权利要求 1所述的方法,其特征在于, 所述接收各会场的视频和音频 数据之前还包括:  The method according to claim 1, wherein before receiving the video and audio data of each site, the method further comprises:
设定当前视频会议的环境氛围模式 , 所述环境氛围模式包括氛围预设值; 所述对所述视频和音频数据进行图像声音处理以突出当前视频会议的氛 围包括:  Setting an ambient ambience mode of the current video conference, the ambient ambience mode including an ambience preset value; and performing image sound processing on the video and audio data to highlight an atmosphere of the current video conference includes:
根据所述氛围预设值对所述视频和音频数据进行图像声音处理以突出当 前视频会议的氛围。  The video and audio data are subjected to image sound processing according to the ambience preset value to highlight the atmosphere of the current video conference.
3.如权利要求 2所述的方法, 其特征在于, 还包括:  3. The method of claim 2, further comprising:
接收各会场的氛围参数, 根据所述氛围参数修改所述氛围预设值。  Receiving the ambience parameter of each site, and modifying the ambience preset value according to the ambience parameter.
4.如权利要求 2所述的方法, 其特征在于, 还包括:  4. The method of claim 2, further comprising:
对所述经过图像声音处理后的视频和音频数据进行效果评价判断是否需 要更新氛围预设值, 如果需要更新氛围预设值, 则根据更新后的氛围预设值再 对所述视频和音频数据进行图像声音处理。  Performing an effect evaluation on the video and audio data after the image sound processing to determine whether it is necessary to update the ambience preset value, and if the ambience preset value needs to be updated, re-pairing the video and audio data according to the updated ambience preset value. Perform image sound processing.
5.如权利要求 1所述的方法,其特征在于, 所述接收各会场的视频和音频 数据包括:  The method of claim 1, wherein the receiving video and audio data of each venue comprises:
接收各会场的经过在先图像声音处理的视频和音频数据;  Receiving video and audio data of each site that has undergone prior image sound processing;
所述对所述视频和音频数据进行图像声音处理以突出当前视频会议的氛 围包括: Performing image and sound processing on the video and audio data to highlight the current video conference atmosphere The surrounding includes:
对所述经过在先图像声音处理的视频和音频数据进行统一适配优化以突 出当前视频会议的整体氛围。  The video and audio data processed by the prior image sound is uniformly adapted to highlight the overall atmosphere of the current video conference.
6. 如权利要求 5所述的方法, 其特征在于, 所述接收各会场的经过在先 图像声音处理的视频和音频数据包括:  The method according to claim 5, wherein the receiving the video and audio data processed by the prior image sound of each site comprises:
接收各会场根据各自的氛围预设值来进行在先图像声音处理的视频和音 频数据。  The video and audio data of the prior image sound processing are received by each site according to the respective preset values of the atmosphere.
7.如权利要求 1至 6任一所述的方法,其特征在于, 所述图像声音处理包 括: 节奏控制处理、 色彩渲染处理、 线条优化处理、 背景融合处理、 特殊效果 生成叠加处理或质感模拟处理。  The method according to any one of claims 1 to 6, wherein the image sound processing comprises: rhythm control processing, color rendering processing, line optimization processing, background fusion processing, special effect generation superimposition processing, or texture simulation deal with.
8.—种突出视频会议现场氛围的装置, 其特征在于, 包括:  8. A device for highlighting the atmosphere of a video conference, characterized in that it comprises:
接收单元, 用于接收各会场的视频和音频数据;  a receiving unit, configured to receive video and audio data of each site;
氛围处理单元,用于对所述视频和音频数据进行图像声音处理以突出当前 视频会议的氛围;  An ambience processing unit, configured to perform image and sound processing on the video and audio data to highlight an atmosphere of the current video conference;
发送单元,用于将经过图像声音处理后的所述视频和音频数据反馈给各会 场。  And a sending unit, configured to feed back the video and audio data processed by the image sound to each venue.
9.如权利要求 8所述的装置, 其特征在于, 还包括:  The device according to claim 8, further comprising:
模式设定单元,用于设定当前视频会议的环境氛围模式, 所述环境氛围模 式包括氛围预设值;  a mode setting unit, configured to set an ambient ambience mode of the current video conference, where the ambient ambience mode includes an ambience preset value;
所述氛围处理单元包括: 氛围处理子单元, 用于根据所述氛围预设值对所 述视频和音频数据进行图像声音处理以突出当前视频会议的氛围。  The ambience processing unit includes: an ambience processing sub-unit, configured to perform image sound processing on the video and audio data according to the ambience preset value to highlight an atmosphere of the current video conference.
10.如权利要求 9所述的装置, 其特征在于, 所述接收单元还用于接收各会场的氛围参数; 10. Apparatus according to claim 9 wherein: The receiving unit is further configured to receive an ambient parameter of each site;
所述装置还包括:  The device also includes:
修改单元, 用于根据所述氛围参数修改所述氛围预设值。  And a modifying unit, configured to modify the ambience preset value according to the ambience parameter.
11.如权利要求 9所述的装置, 其特征在于, 还包括: 判断单元,用于对所述经过图像声音处理后的视频和音频数据进行效果评 价判断是否需要更新氛围预设值;  The device according to claim 9, further comprising: a determining unit, configured to perform an effect evaluation on the video and audio data after the image and sound processing, whether it is necessary to update an ambience preset value;
更新单元, 用于根据所述效果评价更新所述氛围预设值。  And an updating unit, configured to update the ambience preset value according to the effect evaluation.
12.如权利要求 8所述的装置, 其特征在于, 所述接收单元还用于接收经过了在先图像声音处理的视频和音频数据; 所述氛围处理单元还用于对经过了在先图像声音处理的视频和音频数据 进行统一适配优化以突出当前视频会议的整体氛围。  The device according to claim 8, wherein the receiving unit is further configured to receive video and audio data that has undergone prior image sound processing; the ambience processing unit is further configured to: pass the previous image The sound-processed video and audio data are uniformly adapted to highlight the overall atmosphere of the current video conference.
PCT/CN2010/075229 2009-11-11 2010-07-19 Method and apparatus for emphasizing video conference on-site atmosphere WO2011057507A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200910221646.1 2009-11-11
CN2009102216461A CN102065266B (en) 2009-11-11 2009-11-11 Method and device for highlighting scene atmosphere of video conference

Publications (1)

Publication Number Publication Date
WO2011057507A1 true WO2011057507A1 (en) 2011-05-19

Family

ID=43991191

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/075229 WO2011057507A1 (en) 2009-11-11 2010-07-19 Method and apparatus for emphasizing video conference on-site atmosphere

Country Status (2)

Country Link
CN (1) CN102065266B (en)
WO (1) WO2011057507A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103856742B (en) * 2012-12-07 2018-05-11 华为技术有限公司 Processing method, the device and system of audiovisual information
CN103903074B (en) * 2012-12-24 2018-10-30 华为技术有限公司 A kind of information processing method and device of video exchange
CN103051862B (en) * 2013-01-29 2016-03-02 陈进 Based on video call system and its implementation of Virtual Space
CN112327720B (en) * 2020-11-20 2022-09-20 北京瞰瞰智域科技有限公司 Atmosphere management method and system
CN112488650A (en) * 2020-11-26 2021-03-12 万翼科技有限公司 Conference atmosphere adjusting method, electronic equipment and related products

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006033657A (en) * 2004-07-21 2006-02-02 Ics:Kk Chairman leadership type video conferencing system and method
CN1968119A (en) * 2006-09-07 2007-05-23 华为技术有限公司 Method for resource sharing among MCUs in videoconference system
US20070299912A1 (en) * 2006-06-26 2007-12-27 Microsoft Corporation, Corporation In The State Of Washington Panoramic video in a live meeting client
CN101534413A (en) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 System, method and apparatus for remote representation

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006033657A (en) * 2004-07-21 2006-02-02 Ics:Kk Chairman leadership type video conferencing system and method
US20070299912A1 (en) * 2006-06-26 2007-12-27 Microsoft Corporation, Corporation In The State Of Washington Panoramic video in a live meeting client
CN1968119A (en) * 2006-09-07 2007-05-23 华为技术有限公司 Method for resource sharing among MCUs in videoconference system
CN101534413A (en) * 2009-04-14 2009-09-16 深圳华为通信技术有限公司 System, method and apparatus for remote representation

Also Published As

Publication number Publication date
CN102065266B (en) 2012-07-25
CN102065266A (en) 2011-05-18

Similar Documents

Publication Publication Date Title
EP2348671B1 (en) Conference terminal, conference server, conference system and method for data processing
US9148625B2 (en) Transition control in a videoconference
US8760485B2 (en) System and method for displaying participants in a videoconference between locations
US8289367B2 (en) Conferencing and stage display of distributed conference participants
WO2018209879A1 (en) Method and device for automatically selecting camera image, and audio and video system
WO2011057507A1 (en) Method and apparatus for emphasizing video conference on-site atmosphere
KR20160125972A (en) Displaying a presenter during a video conference
WO2011057511A1 (en) Method, apparatus and system for implementing audio mixing
CN105763832A (en) Video interaction and control method and device
WO2015070558A1 (en) Video shooting control method and device
WO2012055335A1 (en) Conference control method, apparatus and system thereof
WO2011026382A1 (en) Method, device and system for presenting virtual conference site of video conference
JP2004128614A (en) Image display controller and image display control program
WO2014044059A1 (en) Method, device and system for video conference recording and playing
WO2014094461A1 (en) Method, device and system for processing video/audio information in video conference
WO2014172907A1 (en) Video conference processing method and device
US20120127268A1 (en) Method and apparatus for controlling broadcasting network and home network for 4d broadcasting service
CN107027004B (en) Main unit of audio-video linkage microphone
US11405587B1 (en) System and method for interactive video conferencing
KR102424150B1 (en) An automatic video production system
CN208862988U (en) A kind of intelligent sound box and video conferencing system
WO2013067898A1 (en) Method and terminal for transmitting information
WO2019242726A1 (en) Conference control method and multipoint control unit
CN107197147A (en) The method of controlling operation thereof and device of a kind of panorama camera
JP2024500956A (en) Systems and methods for expanded views in online meetings

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10829471

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10829471

Country of ref document: EP

Kind code of ref document: A1