US20010055429A1 - Image synthesis and communication apparatus - Google Patents

Image synthesis and communication apparatus Download PDF

Info

Publication number
US20010055429A1
US20010055429A1 US09/162,024 US16202498A US2001055429A1 US 20010055429 A1 US20010055429 A1 US 20010055429A1 US 16202498 A US16202498 A US 16202498A US 2001055429 A1 US2001055429 A1 US 2001055429A1
Authority
US
United States
Prior art keywords
image
search range
feature point
images
section
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US09/162,024
Other versions
US6434276B2 (en
Inventor
Masashi Hirosawa
Yasuhisa Nakamura
Hiroshi Akagi
Shiro Ida
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Assigned to SHARP KABUSHIKI KAISHA reassignment SHARP KABUSHIKI KAISHA ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AKAGI, HIROSHI, HIROSAWA, MASASHI, IDA, SHIRO, NAKAMURA, YASUHISA
Publication of US20010055429A1 publication Critical patent/US20010055429A1/en
Application granted granted Critical
Publication of US6434276B2 publication Critical patent/US6434276B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/537Motion estimation other than block-based
    • H04N19/54Motion estimation other than block-based using feature points or meshes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/04Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa
    • H04N1/19Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays
    • H04N1/195Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays the array comprising a two-dimensional array or a combination of two-dimensional arrays
    • H04N1/19594Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays the array comprising a two-dimensional array or a combination of two-dimensional arrays using a television camera or a still video camera
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/387Composing, repositioning or otherwise geometrically modifying originals
    • H04N1/3876Recombination of partial images to recreate the original image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/2624Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects for obtaining an image which is composed of whole input images, e.g. splitscreen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/04Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa
    • H04N1/19Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays
    • H04N1/195Scanning arrangements, i.e. arrangements for the displacement of active reading or reproducing elements relative to the original or reproducing medium, or vice versa using multi-element arrays the array comprising a two-dimensional array or a combination of two-dimensional arrays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/04Scanning arrangements
    • H04N2201/0402Arrangements not specific to a particular one of the scanning methods covered by groups H04N1/04 - H04N1/207
    • H04N2201/0414Scanning an image in a series of overlapping zones

Definitions

  • the present invention relates to an image synthesis and communication apparatus which continuously captures images by using an imaging apparatus such as a video camera, detects a motion amount of an object image on an imaging plane, synthesizes a wide-angle and high-definition image on the basis of the detected motion amount, and transmits and receives data required for the synthesization.
  • an imaging apparatus such as a video camera
  • the representative point method is used mainly in compensation of motion such as a shake of a video camera.
  • FIG. 19 diagrammatically shows the representative point method.
  • a representative point is set at a fixed position in the image of the previous frame, and, for an image of a current frame, a correlation calculation and an accumulative addition operation are performed on corresponding pixels while conducting a two-dimensional shifting operation. An amount from which the highest calculation value is obtained is detected as a motion amount.
  • Japanese Unexamined Patent Publication JP-A 6-86149 (1994) discloses a technique in which a Laplacian filter or the like is applied at a preset representative point, a luminance gradient is obtained, and a calculation is performed by using the obtained value, thereby enhancing the accuracy.
  • a motion amount may be detected also by using the block matching method in which a correlation calculation is performed on duplicated portion of images to determine a position where synthesization is to be conducted.
  • FIG. 20 diagrammatically shows the block matching method.
  • a specific region which is to be referred is set in the image of the previous frame, a correlation calculation is performed on the image of the current frame while conducting a two-dimensional shifting operation, and a motion amount is obtained in the same manner as the representative point method.
  • the representative point method it is required only to obtain correlations from several points and then perform an accumulative addition.
  • an accumulative addition must be performed on all points in the specific region, and hence processing of a higher speed is required.
  • a low-resolution and narrow-angle camera which is economical has a disadvantage that, in an operation of imaging a document or the like, when an image of the whole of the document is taken, small characters are defaced and illegible, and, when an imaging operation is performed with a resolution at which characters are readable, it is impossible to grasp the whole of the document.
  • a certain representative point in the representative point method, a certain representative point must have a luminance gradient of a given degree.
  • black and white data such as a document image are to be handled, for example, there arises a problem in that the luminance gradient is low in all representative points and a motion amount cannot be correctly detected.
  • the method is effective when the luminance gradient is uniformly distributed in an object image.
  • the luminance gradient is not uniformly distributed, such as the case of a document having a large background, a low luminance gradient is obtained in the background and hence it is difficult to detect a motion amount.
  • the block matching method has a problem in that the method requires a large amount of calculation and hence it is difficult to detect a motion amount in real time.
  • an image synthesis and communication apparatus comprising:
  • imaging means for inputting images in time series
  • image storing means for storing the images inputted from the imaging means
  • feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point
  • search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation
  • correlation calculating means for obtaining correlations between the feature point and pixels in the search range
  • image synthesizing means for synthesizing an image on the basis of a motion amount obtained from the correlation calculating means
  • image transmitting means for transmitting an image obtained from the image synthesizing means
  • image receiving means for receiving the image transmitted from the image transmitting means.
  • the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames.
  • the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame
  • the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame
  • the correlation calculating means performs a correlation calculation for each of the regions.
  • an image synthesis and communication apparatus comprising:
  • imaging means for inputting images in time series
  • image transmitting means for transmitting images obtained from the imaging means
  • image receiving means for receiving the images transmitted from the image transmitting means
  • image storing means for storing the images inputted from the image receiving means
  • feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point
  • search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation
  • correlation calculating means for obtaining correlations between the feature point and pixels in the search range
  • image synthesizing means for receiving a motion amount obtained from the correlation calculating means, and synthesizing an image.
  • the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames.
  • the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame
  • the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame
  • the correlation calculating means performs a correlation calculation for each of the regions.
  • the image synthesis and communication apparatus comprising:
  • imaging means for inputting images in time series
  • image storing means for storing the images inputted from the imaging means
  • feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point
  • search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation
  • correlation calculating means for obtaining correlations between the feature point and pixels in the search range
  • update image calculating means for obtaining an image of a newly-imaged portion from the motion amount obtained from the correlation calculating means
  • image transmitting means for transmitting the motion amount obtained from the correlation calculating means, and a partial image obtained from the update image calculating means
  • image receiving means for receiving the motion amount and the partial image transmitted from the image transmitting means
  • image synthesizing means for synthesizing an image from the motion amount and the partial image obtained from the image receiving means.
  • the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames.
  • the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame
  • the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame
  • the correlation calculating means performs a correlation calculation for each of the regions.
  • the place where the image synthesization is performed is not restricted to the transmission side where an image of an object is taken.
  • images are transmitted as they are from the transmission side and the images are synthesized together in the reception side, a wide-angle and high-definition image can be obtained in the reception side by using a low-resolution and narrow-angle camera and a transmission facility even in the case where the transmission side has only a conventional TV phone system, TV conference system, or the like.
  • the obtained images are not transmitted as they are, but only portions of the obtained images which are judged as not overlapping with the previous image by checking the overlapping state of the images with respect to the previous image are transmitted. Therefore, it is possible to transmit only images of a minimum necessary total size, and hence the information amount in the communication can be reduced.
  • the amount of calculation can be reduced by two orders and hence it is possible to perform a real time processing at a frame rate.
  • FIG. 1 is a block diagram showing the configuration of an image synthesis and communication apparatus of a first embodiment of the invention
  • FIG. 2 is a block diagram showing the configuration of an image synthesis and communication apparatus of a second embodiment of the invention
  • FIG. 3 is a block diagram showing the configuration of an image synthesis and communication apparatus of a third embodiment of the invention.
  • FIG. 4 is a view showing images which are continuously taken in the image synthesization of the invention.
  • FIG. 5 is a block diagram of a feature point extracting section 3 of the image synthesis and communication apparatus of the invention.
  • FIG. 6 is a diagram illustrating an advantage of a technique of the invention in which a point of a high luminance gradient is selected
  • FIG. 7 is a diagram illustrating the configuration of a correlation calculating section 5 of the image synthesis and communication apparatus of the invention.
  • FIG. 8 is a block diagram of a search range determining section 4 of the image synthesis and communication apparatus of the invention.
  • FIG. 9 is a diagram illustrating a procedure of determining a search range for a correlation calculation in the invention.
  • FIG. 10 is a diagram illustrating a procedure of detecting an affine transformation in the invention.
  • FIG. 11 is a diagram illustrating a problem of the block matching method in the invention.
  • FIG. 12 is a flowchart of a synthesizing process in the invention.
  • FIG. 13 is a diagram illustrating a manner of a synthesized image in the invention.
  • FIG. 14 is a timing chart illustrating states of signal lines in the case where data are transmitted in the invention.
  • FIG. 15 is a timing chart illustrating an operation of transmitting a synthesized image in the invention.
  • FIG. 16 is a diagram illustrating an operation of transmitting an updated image in the invention.
  • FIG. 17 is a flowchart illustrating a procedure of determining transmission pixels in the invention.
  • FIG. 18 is a flowchart illustrating a procedure of storing reception pixels in the invention.
  • FIG. 19 is a view diagrammatically showing the representative point method of the prior art.
  • FIG. 20 is a view diagrammatically showing the block matching method of the prior art.
  • FIG. 1 is a block diagram showing an image synthesis and communication apparatus of a first embodiment of the invention.
  • the image synthesis and communication apparatus comprises: an imaging section 1 which is realized by an optical system for taking an image, a CCD, or the like; an image storing section 2 which stores image data transferred from the imaging section 1 ; a feature point extracting section 3 which extracts a point used for a pixel calculation; a search range determining section 4 which determines a search range for a correlation calculation; a correlation calculating section 5 which obtains correlations among pixels and outputs a motion amount; an image synthesizing section 36 which synthesizes an image; an image transmitting section 30 which transmits the synthesized image; a control section 31 which controls the components 1 to 5 , 30 , and 36 ; and an image receiving section 32 which receives the synthesized image.
  • the image receiving section 32 may be disposed integrally with the image transmitting section 30 in the image synthesis and communication apparatus which is used mainly as a transmission side. Alternatively, the image receiving section may be independently disposed so as to be separated from the apparatus, or may be disposed in another image synthesis and communication apparatus existing in a remote place.
  • FIG. 4 shows midway portions of images which are continuously taken at constant time intervals.
  • the reference numerals T 1 , T 2 , T 3 , . . . designate start times of the image capturing. Captured images are indicated by reference numerals F 1 , F 2 , F 3 , . . . . During a period between times T 1 and T 2 , the image F 1 is captured through the imaging section 1 and then stored into the image storing section 2 . At the same time, the image is sent also to the feature point extracting section 3 and feature points of the image F 1 are extracted.
  • the image F 2 is captured and then stored into the image storing section 2 .
  • the image is sent also to the feature point extracting section 3 and all feature points of the image F 2 are extracted.
  • FIG. 5 shows the configuration of the feature point extracting section 3 .
  • the feature point extracting section 3 comprises: a line memory 7 which concurrently reads adjacent pixels; an adder-subtractor 8 which obtains a luminance gradient; an absolute value calculator 9 which obtains the absolute value of the luminance gradient; a comparator 10 which judges whether the luminance gradient exceeds a threshold or not; a feature point information register 11 which stores the luminances and coordinates of obtained feature points; and a search range controlling section 12 which controls the search range.
  • a high luminance gradient means a large difference between adjacent pixels and corresponds to an edge portion of a character in a document image. It is assumed that the absolute value of a difference is used in a correlation calculation.
  • the feature point extraction is performed on the basis of a judgement whether the luminance gradient exceeds a certain threshold or not, i.e., whether the absolute difference of adjacent pixels exceeds the certain threshold or not. If the difference exceeds the certain threshold, the luminance and coordinates of the feature point are transferred to the search range determining section 4 .
  • data of each pixel are first read into the line memory 7 in synchronization with the data transfer from the imaging section 1 to the image storing section 2 .
  • the line memory 7 has a buffer for one line and is configured so that a certain pixel and 4-neighboring pixels are simultaneously referred by the adder-subtractor 8 .
  • the adder-subtractor 8 obtains the difference between adjacent pixel values and the absolute value calculator 9 obtains the absolute value of the difference.
  • the absolute value is transferred to the comparator 10 .
  • the comparator 10 judges whether the absolute value is larger than the threshold or not.
  • the luminances and coordinates of an applicable pixel, and a feature point number indicative of the order of the extraction of the feature point are stored into the feature point information register 11 .
  • the search range controlling section 12 is used for preventing a feature point which is further inside the search region than the coordinates stored in the feature point information register 11 , from being newly stored.
  • This can be realized by a control in which a feature point on the reference side is uniquely determined for an image of the search side. Specifically, such a control can be realized by obtaining the distances in the x and y directions between the coordinates of the feature point stored in the feature point information register 11 and those of the feature point to be obtained, and, when the distance is equal to or smaller than a fixed value, not setting as a feature point.
  • the feature point extraction can be performed in parallel with the operation of capturing an image.
  • FIG. 7 shows a feature point obtained by the feature point extracting section 3 , and a concept of a correlation calculation.
  • the upper left image is the image F 1 which is taken at time T 1
  • the upper right image is the image F 2 which is taken at time T 2 .
  • FIG. 8 is a block diagram showing the search range determining section 4 .
  • the search range determining section 4 comprises: a coordinate register 13 which stores coordinates of feature points transferred from the feature point extracting section 3 ; a previous-frame motion amount storing section 14 which stores the motion amount obtained by the correlation calculating section 5 ; an address generator 15 which generates coordinates corrected by the motion amount of the previous frame; and an address converter 16 which performs a conversion process on the basis of the values of the address generator 15 and the coordinate register 13 , to obtain an address in which the upper right end point of the search range is set as the origin, and a feature point number.
  • the relative position is obtained in the search range, and the results are sent in synchronization with the luminance of the corresponding address to the correlation calculating section 5 .
  • the feature point extracting section 3 determines a middle point of the portion “ ” of “ (Chinese character)” which is displayed, as a feature point, and the address of the feature point is sent to the coordinate register 13 .
  • the address generator 15 converts the address into an address which is corrected by the motion amount. This results in the generation of an image F 2 ′ of FIG. 7.
  • the address converter 16 generates an image of M ⁇ N pixels which is centered at the same coordinates as those of the feature point. This corresponds to the lower right image in FIG. 7. Results of a correlation calculation on the luminance of the feature point of the image F 1 and the luminance of M ⁇ N pixels of the image F 2 ′ captured from the imaging section are outputted. For each of the other feature points of the same frame, similarly, an image of M ⁇ N pixels is generated and a correlation calculation is performed.
  • the correlation calculating section 5 will be described.
  • the correlation calculating section 5 obtains (x, y) satisfying the following expressions:
  • k is the number of the feature points
  • M is the search width of a motion amount in the lateral direction
  • N is the search width of a motion amount in the vertical direction
  • S(x, y)(i) is the luminance of a pixel on the search side and corresponding to an i-th feature point in the case where the motion amount is (x, y)
  • R(i) is the luminance of feature point i.
  • the motion amount is (x, y)
  • FIG. 9 is a block diagram of the correlation calculating section 5 .
  • the correlation calculating section 5 comprises: a feature point register 17 which stores the luminance of the feature point obtained by the feature point extracting section 3 ; a pixel calculation section 18 which obtains a correlation of pixels; an accumulation memory 19 which stores an accumulation value; and a minimum value detecting section 20 which obtains the minimum value.
  • the corresponding feature point of the feature point register 17 is selected and sent to the pixel calculation section 18 .
  • a corresponding section of the accumulation memory 19 is selected as the coordinates in the range.
  • the luminance is given, the correlation value is obtained by a difference-sum operation, and the obtained value is returned to the accumulation memory 19 .
  • the minimum value detecting section 20 detects a portion of the minimum correlation value from the M ⁇ N pixels on the search side of the accumulation memory 19 , and outputs it as a motion amount. According to this configuration, a motion amount can be accurately detected in real time.
  • the search range can be arbitrarily set, and hence a modification may be done, for example, in the following manner.
  • the section where the coordinates in the range to be outputted from the search range determining section 4 are generated is slightly changed.
  • rectangular regions L 1 and L 2 on the reference side are set.
  • extraction of a feature point and the search range are determined.
  • a correlation calculation is performed on each region, and two motion amounts are detected.
  • parameters of an affine transformation including rotation, expansion, and reduction it is possible to obtain parameters of an affine transformation including rotation, expansion, and reduction.
  • a feature amount is obtained from each of the rectangular regions L 1 and L 2 , a correlation calculation with respect to an image on the search side is performed, and rectangular regions R 1 and R 2 which respectively coincide with the rectangular regions L 1 and L 2 are obtained.
  • the center coordinates of the rectangular regions L 1 and L 2 are (X(F 1 , 1), Y(F 1 , 1)) and (X(F 1 , 2), Y(F 1 , 2)), those of the rectangular regions R 1 and R 2 are (X(F 2 , 1), Y(F 2 , 1)) and (X(F 2 , 2), Y(F 2 , 2)).
  • [ X R Y R ⁇ ] ⁇ 1 X F1 , 1 ⁇ Y F1 , 2 - X F1 , 2 ⁇ Y F1 , 1 ⁇ ⁇ [ X F2 , 1 ⁇ Y F1 , 2 - X F2 , 2 ⁇ Y F1 , 1 - X F2 , 1 ⁇ X F1 , 2 + X F2 , 2 ⁇ X F1 , 1 X F2 , 1 ⁇ Y F1 , 2 - X F2 , 2 ⁇ Y F1 , 1 - Y F2 , 1 ⁇ X F1 , 2 + Y F2 , 2 ⁇ X F1 , 1 ] ⁇ [
  • [0110] (where X F1,1 has the same meaning as X(F 1 , 1) in the description)
  • [ X L Y L ⁇ ] ⁇ 1 X F2 , 1 ⁇ Y F2 , 2 - X F2 , 2 ⁇ Y F2 , 1 ⁇ ⁇ [ X F1 , 1 ⁇ Y F2 , 2 - X F1 , 2 ⁇ Y F2 , 1 - X F1 , 1 ⁇ X F2 , 2 + X F1 , 2 ⁇ X F2 , 1 X F1 , 1 ⁇ Y F2 , 2 - X F1 , 2 ⁇ Y F2 , 1 - Y F1 , 1 ⁇ X F2 , 2 + Y F1 , 2 ⁇ X F2 , 1 ] ⁇ [ X R Y R ] ( 4 )
  • the time period required for detecting feature points is proportional to the area of an image to be referenced, and that required for a correlation calculation is proportional to the area of the search region and the number of the feature points.
  • ⁇ and ⁇ are calculation amounts which are substantially equal to each other
  • the time period required for a calculation in the feature point method is 88,000 ⁇ .
  • the block matching method requires a calculation amount which is 300 or more times that required in the feature point method.
  • FIG. 12 is a flowchart showing the flow of a synthesizing process.
  • the image storing section 2 is used as a memory for storing a synthesized image. Another region which is different from that for storing the image sent from the imaging section 1 is previously allocated to a synthesized image.
  • the other region corresponds to a synthesized-image memory 37 shown in FIG. 13.
  • the upper left end point of the synthesized-image memory 37 is set as the origin.
  • a pixel at position (x, y) is indicated by f(x, y).
  • a memory can be accessed by designating an address.
  • the address of f(x, y) is represented by z+y ⁇ w+x.
  • the first image F 1 is overwritten onto the synthesized-image memory 37 so that the upper left point of F 1 is located at (x 0 , y 0 ) of the synthesized-image memory 37 (step S 1 ).
  • Initial position (x 0 , y 0 ) is previously determined. It is assumed that a pixel of F 1 can be accessed by g 1 (x, y). In the same manner as f(x, y), the address of each pixel can be calculated by using the width of F 1 and the address of the origin.
  • FIG. 13 shows a state in which the images F 1 , F 2 , and F 3 are overwritten.
  • the motion amount obtained from the correlation calculating section 5 includes not only parallel movement but also rotation, expansion, and reduction, it is obtained in the form of a matrix of an affine transformation described above. It is assumed that, in the image F 1 and the second image F 2 , the correspondence positions in the two regions of each of the images, i.e., the rectangular regions L 1 and L 2 , and R 1 and R 2 are previously obtained.
  • [0124] are obtained by the correlation calculating section 5 in accordance with the above-mentioned expression of an affine transformation (step S 4 ).
  • the expression is expressed by: [ A 1 , 2 B 1 , 2 C 1 , 2 D 1 , 2 ] ( 6 )
  • a pixel position in the second image F 2 is indicated by (X F2 , Y F2 ), and a corresponding pixel position in the synthesized image to be overwritten is indicated by (X′ F2 , Y′ F2 ).
  • step S 5 pixel position (X F2 , Y F2 ) in the second image F 2 corresponding to position (X′ F2 , Y′ F2 ) is obtained (step S 5 ).
  • (X′ F2 , Y F2 ) is an integer
  • the fractional portion of (X F2 , Y F2 ) is rounded of f so that (X F2 , Y F2 ) is an integer, i.e., the most neighboring point is instead used.
  • round( ) the most neighboring pixel position of (X F2 , Y F2 ) is indicated by (round(X F2 ), round(Y F2 )).
  • pixel f(X′ F2′ , Y′ F2′ ) at position (X′ F2 , Y′ F2 ) in the synthesized-image memory 37 may be overwritten by pixel g 2 (round(X F2 ), round(Y F2 )) in the second image F 2 , and can be indicated as follows (step S 6 ):
  • w 2 is the width of the second image F 2
  • h 2 is the height of the second image F 2 .
  • step S 7 it is checked whether the corresponding pixel value of the second image F 2 can be obtained at all positions of the synthesized image or not. Thereafter, it is checked whether, in the synthesized image, the pixel overlaps with the first image F 1 which is the previous image or not. This can be done by checking whether the following expressions are satisfied or not (step S 11 ):
  • a pixel position in the third image F 3 is indicated by (X F3 , Y F3 ), a corresponding pixel position in the synthesized image to be overwritten is indicated by (X′ F3 , Y′ F3 ), and the motion information between the second and third images F 2 and F 3 is indicated by: [ A 2 , 3 B 2 , 3 C 2 , 3 D 2 , 3 ] ( 11 )
  • step s 7 The judgement on which pixel in the synthesized image is to be overwritten can be performed in the same manner as the case of the second image F 2 . First, it is checked whether the point in the synthesized image is within the third image F 3 or not, by using the following expressions (step s 7 ):
  • step S 10 If all the above expressions are satisfied, the position is within the second image F 2 , and the portion of the third image F 3 overlaps with the second image F 2 (steps S 10 and S 11 ).
  • step S 13 and S 14 each of all the points of the synthesized image is checked whether it is a point corresponding to the third image F 3 and not corresponding to the second image F 2 or not. A point which is true in the check is judged to be overwritable, and the point is overwritten by the pixel value of the third image F 3 (step S 12 ).
  • [ X F4 Y F4 ] [ A 3 , 4 B 3 , 4 C 3 , 4 D 3 , 4 ] ⁇ [ A 2 , 3 B 2 , 3 C 2 , 3 D 2 , 3 ] ⁇ [ A 1 , 2 B 1 , 2 C 1 , 2 D 1 , 2 ] ⁇ [ X F4 ′ - x 0 Y F4 ′ - y 0 ] ( 17 )
  • the image transmitting section 30 receives synthesized image information from the image synthesizing section 36 , and transmits the information.
  • the communication line 40 is an example of a simple serial connection consisting of only five lines for a clock signal, an enable signal, a data signal, a ground voltage, and a supply voltage (+5 V).
  • the ground voltage and the supply voltage are constant independent of time.
  • the polygonal lines respectively corresponding to the signals show the voltage levels of the signals.
  • the lower side is the ground level
  • the upper side is the supply voltage level.
  • an operation of setting the voltage of each signal to the supply voltage level is expressed by using a term “raise,” an operation of setting the voltage to the ground level is expressed by using a term “lower,” the ground level is expressed as “Low,” and the supply voltage level is expressed as “High.”
  • the Low and High states alternatingly appear at equal time intervals.
  • the synthesized image has a width of W and a height of H
  • the upper left end point of the synthesized image is set as the origin
  • the rightward direction from the origin is +X direction
  • the downward direction is +Y direction.
  • Each pixel consists of the three primary colors of RGB.
  • the R component is indicated by f R (X, y)
  • the G component by f G (X, y)
  • the B component by f B (x, y).
  • the value of each component is expressed by using one byte.
  • the width W of the image is transmitted.
  • the value of W consists of 2 bytes, and the bits are transmitted in the sequence of the most significant bit (bit 15 ) to the least significant bit (bit 0 ).
  • the enable signal is raised as shown at time T 0 in the figure.
  • the enable signal is lowered.
  • the timing when the enable signal is lowered is set to be coincident with a timing when the clock signal is lowered, such as time T 1 .
  • a start bit is transmitted. In other words, the signal level is set to be Low at time T 2 , High at time T 3 , and Low at time T 4 .
  • bits indicative of W are set to be Low and High, respectively.
  • the bits from bit 15 to bit 0 are sequentially set at a falling edge of the clock signal. In the same manner as the start bit, a stop bit is then transmitted. Finally, the enable signal is raised, and the transmission of the data set is ended.
  • the height H of the image is transmitted.
  • the procedure of the transmission is strictly identical with that of W. Namely, data of the height H are transmitted in place of the data of W.
  • FIG. 15 is a view illustrating the transmission.
  • the timings of the clock signal and the enable signal are identical with those of FIG. 14, and hence these signals are not shown.
  • the time advances in the direction from the left side to the right side, and from the top to the bottom.
  • data are transmitted with being interposed between the start and stop bits. Since each of the R, G, and B components consists of 8-bit data, data of each pixel are transmitted in the form of three separate blocks, or in the sequence of R, G, and B.
  • the transmission of all the pixels can be realized by performing (H ⁇ W ⁇ 3) times the transmission of 8-bit data.
  • the sequence of the transmission of the pixels is previously determined. For example, f R (0, 0) is first transmitted, and f G (0, 0) and f B (0, 0) are then transmitted in sequence. Thereafter, f R (1, 0), f G (1, 0), and f B (1, 0) are transmitted, and f R (2, 0), f G (2, 0), and f B (2, 0) are then transmitted.
  • f B (W ⁇ 1, 0) at the right end is transmitted, data of the next line are then transmitted in the sequence starting from f R (0, 1) at the left end, in the same manner.
  • the data which is finally transmitted is f B (W ⁇ 1, H ⁇ 1).
  • the enable signal is monitored.
  • the enable signal becomes Low
  • the value of the signal is read at a rising edge of the clock signal.
  • the reading of the signal is started, it is confirmed whether the signal is changed in the sequence of Low, High, and Low, i.e., the start bit is transmitted or not.
  • data of bits of a predetermined number are read.
  • the enable signal becomes High
  • the reading of the value of the signal is ended.
  • the predetermined bit number is 16 because the data indicative of the total width and height consist of 16 bits, respectively.
  • the predetermined bit number is 8 because these data are pixel data.
  • the total width W and height H are known.
  • the reception of 8-bit data is performed (W ⁇ H ⁇ 3) times.
  • the received pixel data are interpreted in a predetermined sequence.
  • f R (0, 0) is first interpreted, and thereafter f G (0, 0), f B (0, ), and f R (1, 0) are interpreted in this sequence.
  • FIG. 2 is a block diagram showing the configuration of an image synthesis and communication apparatus of a second embodiment of the invention.
  • the image synthesis and communication apparatus comprises: an imaging section 1 which is realized by an optical system for taking an image, a CCD, or the like; an image transmitting section 30 which transmits an image transferred from the imaging section 1 ; a control section 33 which controls the components of the transmission side; an image receiving section 32 which receives the image transmitted from the image transmitting section 30 ; an image storing section 2 which stores image data transferred from the image receiving section 32 ; a feature point extracting section 3 which extracts a point used for a pixel calculation; a search range determining section 4 which determines a search range for a correlation calculation; a correlation calculating section 5 which obtains correlations among pixels and outputs a motion amount; an image synthesizing section 36 which synthesizes an image; and a control section 31 which controls the components of the reception side.
  • the imaging section 1 , the image transmitting section 30 , and the control section 33 which are shown in FIG. 2 may be disposed integrally with or separately from the image synthesis and communication apparatus on the image information reception side. Alternatively, these sections may be disposed in another image synthesis and communication apparatus.
  • images taken by the imaging section 1 are transmitted and received as they are. Also the transmission and reception can be realized by a communication method which is strictly identical with that used in the above-described transmission and reception of a synthesized image.
  • the width W and the height H of an image taken by the imaging section 1 are first transmitted, and pixel information of (W ⁇ H ⁇ 3) bytes is then transmitted.
  • the image receiving section 32 also processes received data in the strictly same manner as the image receiving section 32 of the first embodiment.
  • the images received by the image receiving section 32 are stored in the image storing section 2 , and then synthesized together by the image synthesizing section 36 .
  • the synthesization may be performed in the same manner as that of the first embodiment.
  • FIG. 3 is a block diagram showing the configuration of an image synthesis and communication apparatus of a third embodiment of the invention.
  • the image synthesis and communication apparatus comprises: an imaging section 1 which is realized by an optical system for taking an image, a CCD, or the like; an image storing section 2 which stores image data transferred from the imaging section 1 ; a feature point extracting section 3 which extracts a point used for a pixel calculation; a search range determining section 4 which determines a search range for a correlation calculation; a correlation calculating section 5 which obtains correlations among pixels and outputs a motion amount; an update image calculating section 35 which calculates a portion (update image) that does not overlap with the previous image, on the basis of the motion amount obtained by the correlation calculating section 5 and the image data of the image storing section 2 ; an image transmitting section 30 which transmits the motion amount obtained from the correlation calculating section 5 , and the update image obtained by the update image calculating section 35 ; a control section 31 which controls the components of the transmission side
  • FIG. 16 is a diagram illustrating the format of transmission and reception information of the image synthesis and communication apparatus of the third embodiment of the invention.
  • Information to be transmitted and received includes the motion amount obtained from the correlation calculating section 5 , and the partial image obtained from the update image calculating section 35 .
  • lines such as an enable signal, a clock signal, a supply voltage, and a ground level, a start bit and stop bit in a data signal serving as a transmission sequence, and the like are required. The procedure is identical with that of the first embodiment, and hence only the data portion is illustrated.
  • the motion amount may be a value of the matrix of 2 ⁇ 2 which has been used in the description of the image synthesization.
  • the synthesization is to be performed without rotation, expansion, and reduction, and with using only a parallel movement amount, only shift amounts in X and Y directions are required.
  • values of a matrix for performing an affine transformation between the previous image and the current image are transmitted.
  • the third image F 3 for example, a transformation between the second image F 2 and the third image F 3 is to be performed.
  • a 2, 3 , B 2, 3 , C 2, 3 , and D 2, 3 are sequentially transmitted.
  • the bit number depends on the required accuracy.
  • 32 bits are sufficient.
  • width and height of an image are transmitted.
  • these values are 2-byte data.
  • a partial image obtained from the update image calculating section 35 is transmitted.
  • the update image calculating section 35 calculates a portion which does not overlap with the previous image.
  • the manner of the calculation is substantially identical with the method described in conjunction with the image synthesizing section 36 .
  • transformation expressions from (x, y) to (x′, y′) are as follows.
  • (x 1 , y 1 ) and (x 2 , Y 2 ) are points in the third image F 3 and correspond to points (x′ 1 , y′ 1 ) and (x′ 2 , y′ 2 ) in the second image F 2 , respectively.
  • [ x ′ y ′ ] 1 x 1 ⁇ y 2 - x 2 ⁇ y 1 ⁇ [ x 1 ′ ⁇ y 2 - x 2 ′ ⁇ y 1 x 1 ′ ⁇ x 2 - x 2 ′ ⁇ x 1 y 1 ′ ⁇ y 2 - y 2 ′ ⁇ y 1 y 1 ′ ⁇ x 2 - y 2 ′ ⁇ y 1 ′ ⁇ x 2 - y 2 ′ ⁇ x 1 ] ⁇ [ x y ] 0 ⁇ round( x ) ⁇ w 2
  • the points overlap with each other. This process is performed on all points in the third image F 3 , and then it is possible to know points in the third image F 3 which do not overlap with the second image F 2 .
  • a point which does not overlap with the previous image is called a transmission pixel, and that which overlaps with the previous image is called a nontransmission pixel.
  • the transmission must be performed so that the reception side can distinguish a transmission pixel from a nontransmission pixel.
  • a mask image may be separately transmitted. In the alternative, however, the amount of data to be transmitted is increased. Therefore, the transmission is performed by using the feature that regions which do not overlap with the previous image are usually continuous regions.
  • FIG. 17 is a flowchart illustrating a procedure of judging a transmission pixel and a nontransmission pixel.
  • the position “judgement point” where the judgement on a transmission pixel and a nontransmission pixel is started is set to be the upper left point of the third image F 3 (step S 21 ).
  • “number of pixels before the next transmission pixel” in the line is counted in the rightward direction as seen from the judgement point (step S 22 ).
  • the judgement point is a transmission pixel
  • the number is 0.
  • the judgement point is a nontransmission pixel
  • a p number of nontransmission pixels are continuous, and a transmission pixel exists after the nontransmission pixels
  • “number of pixels before the next transmission pixel” is p.
  • the number of remaining pixels in the line is set to be “number of pixels before the next transmission pixel.”
  • “number of pixels before the next transmission pixel” is obtained, the number is expressed as a 2-byte data and then transmitted.
  • step S 23 the judgement point is rightward advanced by “number of next transmission pixels” (step S 23 ). If it is judged in step S 24 that, at this timing, the judgement point reaches the right end of the line, the control proceeds to step S 25 . If the line is not the last line, the control proceeds to step S 26 . The left end point of the next line is set to be the judgement point, and the control then returns to the process of step s 22 of counting “number of pixels before the next transmission pixel.” In the example of FIG. 16, after “value (t) before the next transmission pixel” is transmitted, the judgement point is jumped to the next line. If it is judged in step S 24 that the judgement point has not yet reached the right end, the control proceeds to step S 27 .
  • the judgement point is set to be the next transmission pixel, and “number of pixels before the next nontransmission pixel” is counted in the rightward direction as seen from the judgement point. If a q number of transmission pixels are continuous, the number is q. Then, “number of pixels before the next nontransmission pixel” is expressed as a 2-byte data and then transmitted (step S 27 ).
  • data of pixels i.e., the values of R, G, and B the number of each of which corresponds to the number of the pixels are sequentially transmitted.
  • data starting from R(p, 0) and ending at B(p+q ⁇ 1, 0) are transmitted (step S 28 ).
  • step S 29 If it is judged in step S 29 that, at this timing, the judgement point reaches the right end of the line, and the line is not the last line, the left end point of the next line is set to be the judgement point, and the control then returns to step S 25 . If the judgement point has not yet reached the right end, the judgement point is set to be the next nontransmission pixel, and the control then returns to the initial process, i.e., the process in step S 22 of counting “number of pixels before the next transmission pixel.” If it is judged in step S 25 that the judgement point reaches the right end of the last line, the process is totally ended. In this way, an image of a portion which does not overlap with the previous image can be transmitted.
  • FIG. 18 is a flowchart illustrating a procedure of storing reception pixels.
  • a region for storing received images is previously reserved in the image storing section 34 on the reception side in FIG. 3.
  • the size of the region is determined on the basis of the width and height of the images which have been already received.
  • a pixel data can be written at a specific position in the storage region by determining a position “drawing point” relative to the origin, i.e., the upper left end of the reserved region. First, the drawing point is set to be the upper left end of the reserved storage region (step S 31 ).
  • step S 32 “number of pixels before the next transmission pixel” is received (step S 32 ). Since pixels of this number are nontransmission pixels and are not transmitted, the drawing point is rightward shifted by this number (step S 33 ). In step S 34 , it is judged whether the drawing point reaches the right end or not. If the drawing point reaches the right end, the control proceeds to step S 35 to judge whether the line is the last line or not. If the line is not the last line, the drawing point is set in step S 36 to be the left end of the next line. The control then returns to step S 32 or the process of receiving the value before the next transmission pixel.
  • step S 34 If it is judged in step S 34 that the drawing point has not yet reached the right end, the control proceeds to step S 37 and “number of pixels before the next nontransmission pixel” is received.
  • step S 38 since data of pixels of this number are transmitted, the data are received, and, concurrently with the data reception, the data are written into the image storing section 34 . Specifically, since data of each pixel are received in the sequence of R, G, and B, the pixel data are written at the position of the drawing point by using these values. Next, the drawing point is rightward shifted by one, and the R, G, and B data of the next pixel are received. The above process is repeatedly performed for each of the pixels.
  • step S 39 it is judged in step S 39 whether the drawing point reaches the right end or not. If the drawing point reaches the right end, the control proceeds to step S 35 to judge whether the line is the last line or not. If the line is not the last line, the drawing point is set to be the left end of the next line. The control then returns to step S 32 or the process of receiving the value before the next transmission pixel. If it is judged in step S 35 that the drawing point reaches the right end of the last line, the reception of the image data is ended.
  • transmitted data may be used. Although all of pixel data of each image are not transmitted, the partial image data recorded on the image storing section are sufficient for the synthesization because only a portion which does not overlap with the previous image is used in the synthesization. Therefore, an image can be synthesized from the received data.

Abstract

In the case where an image of a document or the like is taken by an economical camera in a TV conference system and the image is transmitted and received through a communication line, when an image of the whole of the document is taken, small characters are defaced and illegible, and, when an imaging operation is performed with a resolution at which characters are readable, it is impossible to take an image of the whole of the document. An image F1 at time T1 which is taken while moving an imaging section is captured into an image storing section and a feature point extracting section. The feature point extracting section extracts feature points of the image F1. An image F2 at time T2 when the subsequent frame is started is captured into the image storing section and the feature point extracting section. The image F2 in a region designated by a search range determining section, and the feature points of the image F1 are subject to a calculation in a correlation calculating section, to obtain a motion amount, and then a high-resolution image is synthesized.

Description

    BACKGROUND OF THE INVENTION
  • 1. Field of the Invention [0001]
  • The present invention relates to an image synthesis and communication apparatus which continuously captures images by using an imaging apparatus such as a video camera, detects a motion amount of an object image on an imaging plane, synthesizes a wide-angle and high-definition image on the basis of the detected motion amount, and transmits and receives data required for the synthesization. [0002]
  • 2. Description of the Related Art [0003]
  • As a prior art, a technique is well known in which, in order to transmit a document image in a TV conference system, an imaging apparatus dedicated to a drawing, using a facsimile apparatus, a scanner, or the like is used in addition to an imaging apparatus for taking an image of attendants. [0004]
  • As another prior art, a technique is well known in which, in order to obtain a wide-angle image in a TV conference system or a TV phone system, the field angle of a lens itself is changed as in the case of a video camera, an imaging apparatus is rotated so as to pan the field of view, or a plurality of still pictures are combined together to be synthesized into one total screen as disclosed in Japanese Unexamined Patent Publication JP-A 59-44184 (1984). [0005]
  • As a further prior art, a technique is well known in which, in order to obtain a high-definition and wide-angle image by an economical imaging apparatus, as disclosed in Japanese Unexamined Patent Publication JP-A 5-260264 (1993), an image of a part of an object is taken with a desired resolution, the object is scanned by rotating or moving the imaging apparatus, and the obtained images are synthesized together. [0006]
  • When images are to be synthesized together, the motion amounts of images must be calculated. This calculation can be performed by several methods. [0007]
  • The representative point method is used mainly in compensation of motion such as a shake of a video camera. [0008]
  • FIG. 19 diagrammatically shows the representative point method. In the method, for continuously captured images, a representative point is set at a fixed position in the image of the previous frame, and, for an image of a current frame, a correlation calculation and an accumulative addition operation are performed on corresponding pixels while conducting a two-dimensional shifting operation. An amount from which the highest calculation value is obtained is detected as a motion amount. [0009]
  • Japanese Unexamined Patent Publication JP-A 6-86149 (1994) discloses a technique in which a Laplacian filter or the like is applied at a preset representative point, a luminance gradient is obtained, and a calculation is performed by using the obtained value, thereby enhancing the accuracy. [0010]
  • A motion amount may be detected also by using the block matching method in which a correlation calculation is performed on duplicated portion of images to determine a position where synthesization is to be conducted. FIG. 20 diagrammatically shows the block matching method. In the method, a specific region which is to be referred is set in the image of the previous frame, a correlation calculation is performed on the image of the current frame while conducting a two-dimensional shifting operation, and a motion amount is obtained in the same manner as the representative point method. In the representative point method, it is required only to obtain correlations from several points and then perform an accumulative addition. By contrast, in the block matching method, an accumulative addition must be performed on all points in the specific region, and hence processing of a higher speed is required. [0011]
  • Among cameras dedicated to a drawing and television cameras used in a facsimile apparatus in the prior art, a low-resolution and narrow-angle camera which is economical has a disadvantage that, in an operation of imaging a document or the like, when an image of the whole of the document is taken, small characters are defaced and illegible, and, when an imaging operation is performed with a resolution at which characters are readable, it is impossible to grasp the whole of the document. [0012]
  • The prior art in which the field angle of a lens itself is changed has a disadvantage that, when the field of view is widened, the resolution is impaired. [0013]
  • The prior art in which an imaging apparatus is rotated has a disadvantage that a wide-angle image cannot be obtained by a single operation. [0014]
  • The technique of Japanese Unexamined Patent Publication JP-A 59-44184 (1984) has a disadvantage that an image cannot be synthesized with a high accuracy and synthesization cannot be attained in an imaging operation in which a hand-held camera is swung. The technique has another disadvantage that a synthesizing apparatus must be disposed in a camera. [0015]
  • In the prior art technique disclosed in Japanese Unexamined Patent Publication JP-A 5-260264 (1993) in which partial images are synthesized together to obtain a high-definition and wide-angle image, there is a disadvantage that, when this technique is combined with a communication system, the transmission side must be provided with a synthesizing apparatus. [0016]
  • Even in the case where synthesization is performed in the receiver side, when all partial images are simply transmitted, data of an amount which is greater than that required in synthesization are transmitted. This produces a disadvantage that the transmission amount is wastefully increased. [0017]
  • Among the methods of detecting a motion amount, in the representative point method, a certain representative point must have a luminance gradient of a given degree. When black and white data such as a document image are to be handled, for example, there arises a problem in that the luminance gradient is low in all representative points and a motion amount cannot be correctly detected. [0018]
  • In other words, the method is effective when the luminance gradient is uniformly distributed in an object image. In the case where the luminance gradient is not uniformly distributed, such as the case of a document having a large background, a low luminance gradient is obtained in the background and hence it is difficult to detect a motion amount. [0019]
  • By contrast, the block matching method has a problem in that the method requires a large amount of calculation and hence it is difficult to detect a motion amount in real time. [0020]
  • SUMMARY OF THE INVENTION
  • It is an object of the invention to provide an image synthesis and communication apparatus which, by using a low-resolution and narrow-angle camera that is economical, can obtain a motion amount between images accurately and rapidly from an object image wherein the luminance gradient is not uniform, such as a document image, synthesize a high-definition and wide-angle image, and handle such an image by means of communication. [0021]
  • In order to attain the object, in a first aspect of the invention there is provided an image synthesis and communication apparatus comprising: [0022]
  • imaging means for inputting images in time series; [0023]
  • image storing means for storing the images inputted from the imaging means; [0024]
  • feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point; [0025]
  • search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation; [0026]
  • correlation calculating means for obtaining correlations between the feature point and pixels in the search range; [0027]
  • image synthesizing means for synthesizing an image on the basis of a motion amount obtained from the correlation calculating means; [0028]
  • image transmitting means for transmitting an image obtained from the image synthesizing means; and [0029]
  • image receiving means for receiving the image transmitted from the image transmitting means. [0030]
  • Preferably, the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames. [0031]
  • Preferably, the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame, the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame, and the correlation calculating means performs a correlation calculation for each of the regions. [0032]
  • In another aspect of the invention, in order to attain the object, there is provided an image synthesis and communication apparatus comprising: [0033]
  • imaging means for inputting images in time series; [0034]
  • image transmitting means for transmitting images obtained from the imaging means; [0035]
  • image receiving means for receiving the images transmitted from the image transmitting means; [0036]
  • image storing means for storing the images inputted from the image receiving means; [0037]
  • feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point; [0038]
  • search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation; [0039]
  • correlation calculating means for obtaining correlations between the feature point and pixels in the search range; and [0040]
  • image synthesizing means for receiving a motion amount obtained from the correlation calculating means, and synthesizing an image. [0041]
  • Preferably, the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames. [0042]
  • Preferably, the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame, the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame, and the correlation calculating means performs a correlation calculation for each of the regions. [0043]
  • Instill another aspect of the invention, in order to attain the object, the image synthesis and communication apparatus comprising: [0044]
  • imaging means for inputting images in time series; [0045]
  • image storing means for storing the images inputted from the imaging means; [0046]
  • feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point; [0047]
  • search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation; [0048]
  • correlation calculating means for obtaining correlations between the feature point and pixels in the search range; [0049]
  • update image calculating means for obtaining an image of a newly-imaged portion from the motion amount obtained from the correlation calculating means; [0050]
  • image transmitting means for transmitting the motion amount obtained from the correlation calculating means, and a partial image obtained from the update image calculating means; [0051]
  • image receiving means for receiving the motion amount and the partial image transmitted from the image transmitting means; and [0052]
  • image synthesizing means for synthesizing an image from the motion amount and the partial image obtained from the image receiving means. [0053]
  • Preferably, the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames. [0054]
  • Preferably, the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame, the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame, and the correlation calculating means performs a correlation calculation for each of the regions. [0055]
  • As described above, according to the invention, plural images of a part of an object are taken, the images are synthesized together, and a resulting image is transmitted. Even when a low-resolution and narrow-angle camera is used as the imaging means, therefore, a wide-angle and high-definition image can be transmitted to a remote place. [0056]
  • The place where the image synthesization is performed is not restricted to the transmission side where an image of an object is taken. When obtained images are transmitted as they are from the transmission side and the images are synthesized together in the reception side, a wide-angle and high-definition image can be obtained in the reception side by using a low-resolution and narrow-angle camera and a transmission facility even in the case where the transmission side has only a conventional TV phone system, TV conference system, or the like. [0057]
  • When obtained images are to be transmitted from the transmission side, the obtained images are not transmitted as they are, but only portions of the obtained images which are judged as not overlapping with the previous image by checking the overlapping state of the images with respect to the previous image are transmitted. Therefore, it is possible to transmit only images of a minimum necessary total size, and hence the information amount in the communication can be reduced. [0058]
  • Even in the case where the luminance gradient is not uniformly distributed, a motion amount among images in time series which is basic to the image synthesization can be obtained by a correlation calculation using an arbitrary point as a feature point. Therefore, the motion amount can be accurately detected also from an object having a white background, such as a document. The invention is superior in accuracy than the representative point method. [0059]
  • As compared with the block matching method, the amount of calculation can be reduced by two orders and hence it is possible to perform a real time processing at a frame rate.[0060]
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Other and further objects, features, and advantages of the invention will be more explicit from the following detailed description taken with reference to the drawings wherein: [0061]
  • FIG. 1 is a block diagram showing the configuration of an image synthesis and communication apparatus of a first embodiment of the invention; [0062]
  • FIG. 2 is a block diagram showing the configuration of an image synthesis and communication apparatus of a second embodiment of the invention; [0063]
  • FIG. 3 is a block diagram showing the configuration of an image synthesis and communication apparatus of a third embodiment of the invention; [0064]
  • FIG. 4 is a view showing images which are continuously taken in the image synthesization of the invention; [0065]
  • FIG. 5 is a block diagram of a feature [0066] point extracting section 3 of the image synthesis and communication apparatus of the invention;
  • FIG. 6 is a diagram illustrating an advantage of a technique of the invention in which a point of a high luminance gradient is selected; [0067]
  • FIG. 7 is a diagram illustrating the configuration of a [0068] correlation calculating section 5 of the image synthesis and communication apparatus of the invention;
  • FIG. 8 is a block diagram of a search [0069] range determining section 4 of the image synthesis and communication apparatus of the invention;
  • FIG. 9 is a diagram illustrating a procedure of determining a search range for a correlation calculation in the invention; [0070]
  • FIG. 10 is a diagram illustrating a procedure of detecting an affine transformation in the invention; [0071]
  • FIG. 11 is a diagram illustrating a problem of the block matching method in the invention; [0072]
  • FIG. 12 is a flowchart of a synthesizing process in the invention; [0073]
  • FIG. 13 is a diagram illustrating a manner of a synthesized image in the invention; [0074]
  • FIG. 14 is a timing chart illustrating states of signal lines in the case where data are transmitted in the invention; [0075]
  • FIG. 15 is a timing chart illustrating an operation of transmitting a synthesized image in the invention; [0076]
  • FIG. 16 is a diagram illustrating an operation of transmitting an updated image in the invention; [0077]
  • FIG. 17 is a flowchart illustrating a procedure of determining transmission pixels in the invention; [0078]
  • FIG. 18 is a flowchart illustrating a procedure of storing reception pixels in the invention; [0079]
  • FIG. 19 is a view diagrammatically showing the representative point method of the prior art; and [0080]
  • FIG. 20 is a view diagrammatically showing the block matching method of the prior art.[0081]
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • Now referring to the drawings, preferred embodiments of the invention are described below. [0082]
  • FIG. 1 is a block diagram showing an image synthesis and communication apparatus of a first embodiment of the invention. The image synthesis and communication apparatus comprises: an [0083] imaging section 1 which is realized by an optical system for taking an image, a CCD, or the like; an image storing section 2 which stores image data transferred from the imaging section 1; a feature point extracting section 3 which extracts a point used for a pixel calculation; a search range determining section 4 which determines a search range for a correlation calculation; a correlation calculating section 5 which obtains correlations among pixels and outputs a motion amount; an image synthesizing section 36 which synthesizes an image; an image transmitting section 30 which transmits the synthesized image; a control section 31 which controls the components 1 to 5, 30, and 36; and an image receiving section 32 which receives the synthesized image.
  • The [0084] image receiving section 32 may be disposed integrally with the image transmitting section 30 in the image synthesis and communication apparatus which is used mainly as a transmission side. Alternatively, the image receiving section may be independently disposed so as to be separated from the apparatus, or may be disposed in another image synthesis and communication apparatus existing in a remote place.
  • Next, an operation of taking an image of a document in which black characters are written on a white background, while manually moving the [0085] imaging section 1 will be described with reference to FIG. 4 showing midway states of the operation.
  • FIG. 4 shows midway portions of images which are continuously taken at constant time intervals. [0086]
  • The reference numerals T[0087] 1, T2, T3, . . . designate start times of the image capturing. Captured images are indicated by reference numerals F1, F2, F3, . . . . During a period between times T1 and T2, the image F1 is captured through the imaging section 1 and then stored into the image storing section 2. At the same time, the image is sent also to the feature point extracting section 3 and feature points of the image F1 are extracted.
  • Next, during a period between times T[0088] 2 and T3, the image F2 is captured and then stored into the image storing section 2. At the same time, the image is sent also to the feature point extracting section 3 and all feature points of the image F2 are extracted.
  • During the period between times T[0089] 2 and T3, also a correlation calculation is performed in the correlation calculating section 5 on all the feature points of the image F1 extracted during a period between times T1 and T2 in the previous frame, and all pixels in the neighborhood of specific positions in the image F2 respectively corresponding to the feature points. The specific positions in the image F2 are corrected by a motion amount which is obtained in the previous frame between times T1 and T2.
  • During the period between times T[0090] 2 and T3, after the correlation calculation is performed, a motion amount obtained in the correlation calculating section 5 is sent to the image synthesizing section 36 and an image synthesization is conducted.
  • In the same manner, also during the periods subsequent to time T[0091] 3, a process similar to that performed during the period between times T2 and T3 is repeated while feature points of the current frame are used as reference images, and regions in the neighborhood of specific positions of the subsequent frame as search images.
  • Next, an operation of extracting a feature point will be described. FIG. 5 shows the configuration of the feature [0092] point extracting section 3. The feature point extracting section 3 comprises: a line memory 7 which concurrently reads adjacent pixels; an adder-subtractor 8 which obtains a luminance gradient; an absolute value calculator 9 which obtains the absolute value of the luminance gradient; a comparator 10 which judges whether the luminance gradient exceeds a threshold or not; a feature point information register 11 which stores the luminances and coordinates of obtained feature points; and a search range controlling section 12 which controls the search range.
  • An advantage of a technique in which a feature point of a high luminance gradient is selected will be described with reference to FIG. 6. A high luminance gradient means a large difference between adjacent pixels and corresponds to an edge portion of a character in a document image. It is assumed that the absolute value of a difference is used in a correlation calculation. [0093]
  • If a point of a low luminance gradient such as a portion of a background is set as a feature point, the luminance gradients of adjacent pixels are substantially equal to one another. Therefore, a difference is not produced in a subtraction result in a certain range. [0094]
  • By contrast, if a point of a high luminance gradient is set as a feature point, values of adjacent pixels are different from each other, and hence a difference is produced in a subtraction result in a certain range. Specifically, when a feature point P[0095] 1 of FIG. 6 in which the luminance is changed at a small degree is employed and subtraction is performed with respect to a search range S1, results are obtained as shown in the graph of the lower left side. The minimum value is obtained in most points, and hence it is difficult to determine a motion amount. By contrast, when a feature point P2 of FIG. 6 is employed and subtraction is performed with respect to a search range S2, results are obtained as shown in the graph of the lower right side. As a result, candidates of the motion amount are substantially restricted to two points. This means that candidates of the motion amount can be further restricted by employing other feature points and performing an accumulative addition of subtraction results with respect to the search range, for each pixel of the search range.
  • Referring again to FIG. 5, the operation of extracting a feature point will be described. In the invention, the feature point extraction is performed on the basis of a judgement whether the luminance gradient exceeds a certain threshold or not, i.e., whether the absolute difference of adjacent pixels exceeds the certain threshold or not. If the difference exceeds the certain threshold, the luminance and coordinates of the feature point are transferred to the search [0096] range determining section 4. When a feature point is to be detected, data of each pixel are first read into the line memory 7 in synchronization with the data transfer from the imaging section 1 to the image storing section 2.
  • The [0097] line memory 7 has a buffer for one line and is configured so that a certain pixel and 4-neighboring pixels are simultaneously referred by the adder-subtractor 8. The adder-subtractor 8 obtains the difference between adjacent pixel values and the absolute value calculator 9 obtains the absolute value of the difference. The absolute value is transferred to the comparator 10. The comparator 10 judges whether the absolute value is larger than the threshold or not. The luminances and coordinates of an applicable pixel, and a feature point number indicative of the order of the extraction of the feature point are stored into the feature point information register 11.
  • The search [0098] range controlling section 12 is used for preventing a feature point which is further inside the search region than the coordinates stored in the feature point information register 11, from being newly stored. This can be realized by a control in which a feature point on the reference side is uniquely determined for an image of the search side. Specifically, such a control can be realized by obtaining the distances in the x and y directions between the coordinates of the feature point stored in the feature point information register 11 and those of the feature point to be obtained, and, when the distance is equal to or smaller than a fixed value, not setting as a feature point. According to this configuration, the feature point extraction can be performed in parallel with the operation of capturing an image.
  • Next, operations of determining a search range for a correlation calculation and performing a correlation calculation will be described with reference to FIGS. 7 and 8. FIG. 7 shows a feature point obtained by the feature [0099] point extracting section 3, and a concept of a correlation calculation. The upper left image is the image F1 which is taken at time T1, and the upper right image is the image F2 which is taken at time T2.
  • FIG. 8 is a block diagram showing the search [0100] range determining section 4. The search range determining section 4 comprises: a coordinate register 13 which stores coordinates of feature points transferred from the feature point extracting section 3; a previous-frame motion amount storing section 14 which stores the motion amount obtained by the correlation calculating section 5; an address generator 15 which generates coordinates corrected by the motion amount of the previous frame; and an address converter 16 which performs a conversion process on the basis of the values of the address generator 15 and the coordinate register 13, to obtain an address in which the upper right end point of the search range is set as the origin, and a feature point number.
  • First, during a period between times T[0101] 1 and T2, all feature points of the image F1 are obtained, and, at time T2, stored into the coordinate register 13. Next, during a period between times T2 and T3, the image F2 is sent from the imaging section 1 to the image storing section 2 and feature points of the image F2 are obtained. At the same time, the value which is corrected by the motion amount of the previous frame in the address generator 15 is sent to the address converter 16. At the timing of starting the imaging operation, the initial value of a motion amount is set to be 0. For example, the address converter 16 is configured so as to have difference circuits and comparison circuits of a number which is equal to that of feature points. According to this configuration, either of feature points is designated, the relative position is obtained in the search range, and the results are sent in synchronization with the luminance of the corresponding address to the correlation calculating section 5. Referring again to FIG. 7, the feature point extracting section 3 determines a middle point of the portion “
    Figure US20010055429A1-20011227-P00900
    ” of “
    Figure US20010055429A1-20011227-P00901
    (Chinese character)” which is displayed, as a feature point, and the address of the feature point is sent to the coordinate register 13. The address generator 15 converts the address into an address which is corrected by the motion amount. This results in the generation of an image F2′ of FIG. 7.
  • As a result, when the motion amount of the current frame is equal to that of the previous frame, a point of the same address as that of the feature point should be the corresponding point. It is preferable to perform the correlation calculation while searching the neighborhood of the region which is corrected by the motion amount of the previous frame. Therefore, the [0102] address converter 16 generates an image of M×N pixels which is centered at the same coordinates as those of the feature point. This corresponds to the lower right image in FIG. 7. Results of a correlation calculation on the luminance of the feature point of the image F1 and the luminance of M×N pixels of the image F2′ captured from the imaging section are outputted. For each of the other feature points of the same frame, similarly, an image of M×N pixels is generated and a correlation calculation is performed.
  • The [0103] correlation calculating section 5 will be described. For example, the correlation calculating section 5 obtains (x, y) satisfying the following expressions:
  • {min(F(x,y))|0≦x<M,0≦y<N}  (1)
  • F(x,y)=Σi=0 k(S(x,y)(i)−R(i))  (2)
  • where k is the number of the feature points, M is the search width of a motion amount in the lateral direction, N is the search width of a motion amount in the vertical direction, S(x, y)(i) is the luminance of a pixel on the search side and corresponding to an i-th feature point in the case where the motion amount is (x, y), and R(i) is the luminance of feature point i. [0104]
  • In the case where the motion amount is (x, y), the feature point coordinates (xR, yR) of the image F[0105] 1 and the coordinates (xT, yT) of a pixel corresponding to the feature point of the image F2′ have the mutual relationships of xT=xR+x and yT=yR+y.
  • Next, the operation of detecting a motion amount will be described. FIG. 9 is a block diagram of the [0106] correlation calculating section 5. The correlation calculating section 5 comprises: a feature point register 17 which stores the luminance of the feature point obtained by the feature point extracting section 3; a pixel calculation section 18 which obtains a correlation of pixels; an accumulation memory 19 which stores an accumulation value; and a minimum value detecting section 20 which obtains the minimum value.
  • Based on the feature point number generated in the search [0107] range determining section 4, the corresponding feature point of the feature point register 17 is selected and sent to the pixel calculation section 18. At the same time, a corresponding section of the accumulation memory 19 is selected as the coordinates in the range. Furthermore, the luminance is given, the correlation value is obtained by a difference-sum operation, and the obtained value is returned to the accumulation memory 19. When a process for one frame is ended, the minimum value detecting section 20 detects a portion of the minimum correlation value from the M×N pixels on the search side of the accumulation memory 19, and outputs it as a motion amount. According to this configuration, a motion amount can be accurately detected in real time.
  • In the invention, the search range can be arbitrarily set, and hence a modification may be done, for example, in the following manner. The section where the coordinates in the range to be outputted from the search [0108] range determining section 4 are generated is slightly changed. As shown in FIG. 10, rectangular regions L1 and L2 on the reference side are set. For each of the rectangular regions L1 and L2, extraction of a feature point and the search range are determined. A correlation calculation is performed on each region, and two motion amounts are detected. As a result, it is possible to obtain parameters of an affine transformation including rotation, expansion, and reduction. Specifically, as shown in FIG. 10, a feature amount is obtained from each of the rectangular regions L1 and L2, a correlation calculation with respect to an image on the search side is performed, and rectangular regions R1 and R2 which respectively coincide with the rectangular regions L1 and L2 are obtained.
  • The center coordinates of the rectangular regions L[0109] 1 and L2 are (X(F1, 1), Y(F1, 1)) and (X(F1, 2), Y(F1, 2)), those of the rectangular regions R1 and R2 are (X(F2, 1), Y(F2, 1)) and (X(F2, 2), Y(F2, 2)). When arbitrary coordinates on the reference side are indicated by (XL, YL) and those on the search side by (XR, YR), the coordinates are mutually affine-transformed by the following expressions: [ X R Y R ] = 1 X F1 , 1 Y F1 , 2 - X F1 , 2 Y F1 , 1 × [ X F2 , 1 Y F1 , 2 - X F2 , 2 Y F1 , 1 - X F2 , 1 X F1 , 2 + X F2 , 2 X F1 , 1 X F2 , 1 Y F1 , 2 - X F2 , 2 Y F1 , 1 - Y F2 , 1 X F1 , 2 + Y F2 , 2 X F1 , 1 ] [ X L Y L ] ( 3 )
    Figure US20010055429A1-20011227-M00001
  • (where X[0110] F1,1 has the same meaning as X(F1, 1) in the description) [ X L Y L ] = 1 X F2 , 1 Y F2 , 2 - X F2 , 2 Y F2 , 1 × [ X F1 , 1 Y F2 , 2 - X F1 , 2 Y F2 , 1 - X F1 , 1 X F2 , 2 + X F1 , 2 X F2 , 1 X F1 , 1 Y F2 , 2 - X F1 , 2 Y F2 , 1 - Y F1 , 1 X F2 , 2 + Y F1 , 2 X F2 , 1 ] [ X R Y R ] ( 4 )
    Figure US20010055429A1-20011227-M00002
  • Next, the calculation time period in the case where the invention is employed will be described in comparison with the block matching method of the prior art. [0111]
  • In the feature point method, the time period required for detecting feature points is proportional to the area of an image to be referenced, and that required for a correlation calculation is proportional to the area of the search region and the number of the feature points. [0112]
  • When the area of an image to be referenced is 320 pixels×240 pixels, the number of feature points to be detected is 25, and the search range is 20 pixels×20 pixels, the time period required for a calculation is 78,000×α+10,000×β where α is the calculation amount per pixel in the case where a luminance gradient is obtained and β is the calculation amount per pixel for obtaining correlation between pixels. [0113]
  • By contrast, in the case where the block matching method is used, the time period is 20×20×320×240 β=30,720,000 β. Assuming that α and β are calculation amounts which are substantially equal to each other, the time period required for a calculation in the feature point method is 88,000 β. As a result, the block matching method requires a calculation amount which is 300 or more times that required in the feature point method. [0114]
  • In the block matching method also, when the luminance change is observed and the reference region is suppressed to about 25, the calculation amount is reduced to the same degree as described above. In a case such as that, as shown in FIG. 11, identical characters on the search side are substantially juxtaposed, it is difficult to perform the matching and the motion amount cannot be determined. In such a case, therefore, results are extremely impaired. [0115]
  • Next, the image synthesizing section will be described. [0116]
  • FIG. 12 is a flowchart showing the flow of a synthesizing process. [0117]
  • As a memory for storing a synthesized image, the [0118] image storing section 2 is used. Another region which is different from that for storing the image sent from the imaging section 1 is previously allocated to a synthesized image.
  • The other region corresponds to a synthesized-[0119] image memory 37 shown in FIG. 13.
  • The upper left end point of the synthesized-[0120] image memory 37 is set as the origin. Hereinafter, a pixel at position (x, y) is indicated by f(x, y). A memory can be accessed by designating an address. When the width of the synthesized-image memory 37 is w and the address of the origin is z, the address of f(x, y) is represented by z+y×w+x.
  • The first image F[0121] 1 is overwritten onto the synthesized-image memory 37 so that the upper left point of F1 is located at (x0, y0) of the synthesized-image memory 37 (step S1). Initial position (x0, y0) is previously determined. It is assumed that a pixel of F1 can be accessed by g1(x, y). In the same manner as f(x, y), the address of each pixel can be calculated by using the width of F1 and the address of the origin. The overwriting operation can be shown by using an expression of f(x+x0, y+y0)=g1(x, y) (where 0≦x≦w1, 0≦y≦h1, w1 is the width of F1, and h1 is the height of F1).
  • Next, the overwriting operation is performed only on the portion of the image F[0122] 2 which portion does not overlap with the image F1. FIG. 13 shows a state in which the images F1, F2, and F3 are overwritten.
  • In the case where the motion amount obtained from the [0123] correlation calculating section 5 includes not only parallel movement but also rotation, expansion, and reduction, it is obtained in the form of a matrix of an affine transformation described above. It is assumed that, in the image F1 and the second image F2, the correspondence positions in the two regions of each of the images, i.e., the rectangular regions L1 and L2, and R1 and R2 are previously obtained. When the center coordinates of the rectangular regions L1 and L2 of the first image F1 are (X(F1, 1), Y(F1, 1)) and (X(F1, 2), Y(F1, 2)) and those of the rectangular regions R1 and R2 of the second image F2 are (X(F2, 1), Y(F2, 1)) and (X(F2, 2), Y(F2, 2)), the following parameters of an affine transformation from the first image F1 to the second image F2: 1 X F1 , 1 Y F1 , 2 - X F1 , 2 Y F1 , 1 [ X F2 , 1 Y F1 , 2 - X F2 , 2 Y F1 , 1 - X F2 , 1 X F1 , 2 + X F2 , 2 X F1 , 1 X F2 , 1 Y F1 , 2 - X F2 , 2 Y F1 , 1 - Y F2 , 1 X F1 , 2 + Y F2 , 2 X F1 , 1 ] ( 5 )
    Figure US20010055429A1-20011227-M00003
  • are obtained by the [0124] correlation calculating section 5 in accordance with the above-mentioned expression of an affine transformation (step S4). In order simplify the expression, hereinafter the expression is expressed by: [ A 1 , 2 B 1 , 2 C 1 , 2 D 1 , 2 ] ( 6 )
    Figure US20010055429A1-20011227-M00004
  • When an affine transformation is not performed and only a parallel movement is to be performed, B[0125] 1, 2 and C1, 2 are 0.
  • A pixel position in the second image F[0126] 2 is indicated by (XF2, YF2), and a corresponding pixel position in the synthesized image to be overwritten is indicated by (X′F2, Y′F2). When the upper left point of the first image F1 is located at (x0, y0) of the synthesized-image memory 37, the transformation expression is defined by the following expression: [ X F2 Y F2 ] = [ A 1 , 2 B 1 , 2 C 1 , 2 D 1 , 2 ] [ X F2 - x 0 Y F2 - y 0 ] f ( X F2 , Y F2 ) = g 2 ( round ( X F2 ) , round ( Y F2 ) ) = g 2 ( round ( A 1 , 2 ( X F2 - x 0 ) + B 1 , 2 ( Y F2 - y 0 ) ) , round ( C 1 , 2 ( X F2 - x 0 ) + D 1 , 2 ( Y F2 - y 0 ) ) )
    Figure US20010055429A1-20011227-M00005
     0≦round(X F2)<w 2
  • 0≦round(Y F2))<h 2
  • x 0 ≦X′ F2 <x 0 +w 1
  • y 0 ≦Y′ F2 <y 0 +h 1
  • As a result, pixel position (X[0127] F2, YF2) in the second image F2 corresponding to position (X′F2, Y′F2) is obtained (step S5). Generally, when (X′F2, YF2) is an integer, (XF2, YF2) i s not always an integer. Therefore, pixel g2 (XF2, YF2) at position (XF2, YF2) in the second image F2 is usually determined by interpolation of neighboring pixels. In order to obtain the pixel in the simplest manner, the fractional portion of (XF2, YF2) is rounded of f so that (XF2, YF2) is an integer, i.e., the most neighboring point is instead used. When the round off operation is indicated by round( ), the most neighboring pixel position of (XF2, YF2) is indicated by (round(XF2), round(YF2)).
  • Therefore, pixel f(X′[0128] F2′, Y′F2′) at position (X′F2, Y′F2) in the synthesized-image memory 37 may be overwritten by pixel g2(round(XF2), round(YF2)) in the second image F2, and can be indicated as follows (step S6):
  • f(XF2 , Y′ F2)=g 2(round(X F2), round(Y F2))=g 2(round(A 1, 2(X′ F2 −x 0)+B 1, 2(Y′ F2 −y 0)), round(C 1, 2(X′ F2 −x 0)+D 1, 2(Y′ F2 −y 0)))  (8)
  • However, the operation of checking whether position (X[0129] F2, YF2) in the second image F2 corresponding to (X′F2, Y′F2) exists in F2 or not must be performed. In order to perform the checking operation in the simplest manner, the coordinates of (round(XF2), round(YF2)) are calculated and the coordinates are checked whether they are within the size of the display region of the second image F2 or not, by using the expressions below.
  • If all the following expressions are satisfied, (X′[0130] F2, Y′F2) can obtain the corresponding pixel value of the second image F2:
  • 0≦round(X F2)<w 2
  • 0≦round(Y F2))<h 2  (9)
  • where w[0131] 2 is the width of the second image F2, and h2 is the height of the second image F2.
  • By using the expressions, it is checked whether the corresponding pixel value of the second image F[0132] 2 can be obtained at all positions of the synthesized image or not (step S7). Thereafter, it is checked whether, in the synthesized image, the pixel overlaps with the first image F1 which is the previous image or not. This can be done by checking whether the following expressions are satisfied or not (step S11):
  • x 0 ≦X′ F2 <x 0 +w 1
  • y 0 ≦Y′ F2 <y 0 +h 1
  • If all the above expressions are satisfied, the pixel overlaps with the first image F[0133] 1. Therefore, the overwriting operation is not performed. As a result, only a portion of the second image F2 which does not overlap with the first image F1 is overwritten onto the synthesized image (step S12). The overwriting operation of the third image F3 will be described. A pixel position in the third image F3 is indicated by (XF3, YF3), a corresponding pixel position in the synthesized image to be overwritten is indicated by (X′F3, Y′F3), and the motion information between the second and third images F2 and F3 is indicated by: [ A 2 , 3 B 2 , 3 C 2 , 3 D 2 , 3 ] ( 11 )
    Figure US20010055429A1-20011227-M00006
  • The values of A[0134] 2, 3, B2, 3, C2, 3, and D2, 3 may be obtained in the same manner as the motion information between the first and second images F1 and F2. By using these values, the relationship between (XF3, YF3) and (X′F3, Y′F3) is defined by the following expression (steps S4 and S5): [ X F3 Y F3 ] = [ A 2 , 3 B 2 , 3 C 2 , 3 D 2 , 3 ] [ A 1 , 2 B 1 , 2 C 1 , 2 D 1 , 2 ] [ X F2 - x 0 Y F2 - y 0 ]
    Figure US20010055429A1-20011227-M00007
    f(X′ F3 , Y′ F3)=g 3(round(X F3),round(Y F3))
  • 0≦round(X F3)<w 3
  • 0≦round(Y F3))<h 3
  • The overwriting operation of the third image F[0135] 3 is indicated by the above expression and the following expression (steps S6 and S12):
  • f(X′ F3 ,Y′ F3)=g 3(round(X F3),round(Y F3))  (13)
  • The judgement on which pixel in the synthesized image is to be overwritten can be performed in the same manner as the case of the second image F[0136] 2. First, it is checked whether the point in the synthesized image is within the third image F3 or not, by using the following expressions (step s7):
  • 0≦round(X F3)<w 3
  • 0≦round(Y F3))<h 3  (14)
  • where w[0137] 3 is the width of the third image F3, and h3 is the height of the third image F3.
  • Next, it is checked whether the pixel overlaps with the second image F[0138] 2 which is the previous image or not, by using the following expressions. The position in F2 corresponding to the position (X′F3, Y′F3) in the synthesized image is indicated by (X″F3, Y″F3). The relationship between (X′F3, Y′F3) and (X″F3, Y″F3) is strictly identical with that between the synthesized image and the second image F2, and is indicated by the following expressions (steps S8 and S9): [ X F3 Y F3 ] = [ A 1 , 2 B 1 , 2 C 1 , 2 D 1 , 2 ] [ X F3 - x 0 Y F3 - y 0 ]
    Figure US20010055429A1-20011227-M00008
     0≦round(X″ F3)<w 2
  • 0≦round(Y″ F3)<h 2  (15)
  • Therefore, corresponding position (X″[0139] F3, Y″F3) in the second image F2 can be obtained from (X′F3, Y′F3). It is checked whether (X″F3, Y″F3) is within the second image F2 or not, by using the following expressions:
  • 0≦round(X″ F3)<w 2
  • 0≦round(Y″ F3))<h 2  (16)
  • If all the above expressions are satisfied, the position is within the second image F[0140] 2, and the portion of the third image F3 overlaps with the second image F2 (steps S10 and S11). At last, each of all the points of the synthesized image (steps S13 and S14) is checked whether it is a point corresponding to the third image F3 and not corresponding to the second image F2 or not. A point which is true in the check is judged to be overwritable, and the point is overwritten by the pixel value of the third image F3 (step S12). Also in the overwriting operations of the fourth image F4 and the subsequent image are repeated in the same manner except that the number of transformation matrices is increased as shown in the following example: [ X F4 Y F4 ] = [ A 3 , 4 B 3 , 4 C 3 , 4 D 3 , 4 ] [ A 2 , 3 B 2 , 3 C 2 , 3 D 2 , 3 ] [ A 1 , 2 B 1 , 2 C 1 , 2 D 1 , 2 ] [ X F4 - x 0 Y F4 - y 0 ] ( 17 )
    Figure US20010055429A1-20011227-M00009
  • The above-described procedure is performed on all the images in time series (steps S[0141] 15, S16, and S17).
  • In this way, images are overwritten in accordance with the motion amount obtained from the [0142] correlation calculating section 5, thereby obtaining a synthesized image such as that shown in FIG. 13.
  • Next, the [0143] image transmitting section 30 and the image receiving section 32 will be described.
  • The [0144] image transmitting section 30 receives synthesized image information from the image synthesizing section 36, and transmits the information.
  • Various kinds of communication lines are available as a [0145] communication line 40. In the embodiment, the communication line is an example of a simple serial connection consisting of only five lines for a clock signal, an enable signal, a data signal, a ground voltage, and a supply voltage (+5 V). The ground voltage and the supply voltage are constant independent of time. In a timing chart of FIG. 14, therefore, only the clock signal, the enable signal, and the data signal are shown. The polygonal lines respectively corresponding to the signals show the voltage levels of the signals. The lower side is the ground level, and the upper side is the supply voltage level. Hereinafter, an operation of setting the voltage of each signal to the supply voltage level is expressed by using a term “raise,” an operation of setting the voltage to the ground level is expressed by using a term “lower,” the ground level is expressed as “Low,” and the supply voltage level is expressed as “High.”
  • In the clock signal, the Low and High states alternatingly appear at equal time intervals. [0146]
  • There are many kinds of image communication formats. Hereinafter, the simplest format will be described. It is assumed that the synthesized image has a width of W and a height of H, the upper left end point of the synthesized image is set as the origin, the rightward direction from the origin is +X direction, and the downward direction is +Y direction. Each pixel consists of the three primary colors of RGB. For a pixel at position (X, Y) in the synthesized image, the R component is indicated by f[0147] R(X, y), the G component by fG(X, y), and the B component by fB(x, y). The value of each component is expressed by using one byte.
  • First, as shown in FIG. 14, the width W of the image is transmitted. The value of W consists of 2 bytes, and the bits are transmitted in the sequence of the most significant bit (bit[0148] 15) to the least significant bit (bit0). When the transmission side is not in the transmission enabled state, the enable signal is raised as shown at time T0 in the figure. When data are to be transmitted, the enable signal is lowered. The timing when the enable signal is lowered is set to be coincident with a timing when the clock signal is lowered, such as time T1. Before date are transmitted, a start bit is transmitted. In other words, the signal level is set to be Low at time T2, High at time T3, and Low at time T4.
  • The values of 0 and 1 of bits indicative of W are set to be Low and High, respectively. The bits from bit[0149] 15 to bit0 are sequentially set at a falling edge of the clock signal. In the same manner as the start bit, a stop bit is then transmitted. Finally, the enable signal is raised, and the transmission of the data set is ended.
  • Next, the height H of the image is transmitted. The procedure of the transmission is strictly identical with that of W. Namely, data of the height H are transmitted in place of the data of W. [0150]
  • Next, data of the pixels are transmitted. FIG. 15 is a view illustrating the transmission. The timings of the clock signal and the enable signal are identical with those of FIG. 14, and hence these signals are not shown. The time advances in the direction from the left side to the right side, and from the top to the bottom. In the same manner as W and the like, data are transmitted with being interposed between the start and stop bits. Since each of the R, G, and B components consists of 8-bit data, data of each pixel are transmitted in the form of three separate blocks, or in the sequence of R, G, and B. [0151]
  • Therefore, the transmission of all the pixels can be realized by performing (H×W×3) times the transmission of 8-bit data. The sequence of the transmission of the pixels is previously determined. For example, f[0152] R(0, 0) is first transmitted, and fG(0, 0) and fB(0, 0) are then transmitted in sequence. Thereafter, fR(1, 0), fG(1, 0), and fB(1, 0) are transmitted, and fR(2, 0), fG(2, 0), and fB(2, 0) are then transmitted. When fB(W−1, 0) at the right end is transmitted, data of the next line are then transmitted in the sequence starting from fR(0, 1) at the left end, in the same manner. The data which is finally transmitted is fB(W−1, H−1).
  • In the reception side, operations opposite to those described above are performed. [0153]
  • First, the enable signal is monitored. When the enable signal becomes Low, the value of the signal is read at a rising edge of the clock signal. After the reading of the signal is started, it is confirmed whether the signal is changed in the sequence of Low, High, and Low, i.e., the start bit is transmitted or not. Next, data of bits of a predetermined number are read. Then, it is confirmed whether the signal is changed in the sequence of Low, High, and Low, i.e., the stop bit is transmitted or not. When the enable signal becomes High, the reading of the value of the signal is ended. [0154]
  • Specifically, with respect to the first and second data, the predetermined bit number is 16 because the data indicative of the total width and height consist of 16 bits, respectively. With respect to the other data, the predetermined bit number is 8 because these data are pixel data. As a result of two data receptions, the total width W and height H are known. Then, the reception of 8-bit data is performed (W×H×3) times. The received pixel data are interpreted in a predetermined sequence. In the example described above, f[0155] R(0, 0) is first interpreted, and thereafter fG(0, 0), fB(0, ), and fR(1, 0) are interpreted in this sequence. When fB(W−1, 0) at the right end is interpreted, data of the next line are then interpreted in the sequence starting from fR(0, 1) at the left end, in the same manner. The data which is finally interpreted is fB(W−1, H−1).
  • As a result, the transmission and reception of the synthesized image are enabled. [0156]
  • [Embodiment 2][0157]
  • FIG. 2 is a block diagram showing the configuration of an image synthesis and communication apparatus of a second embodiment of the invention. The image synthesis and communication apparatus comprises: an [0158] imaging section 1 which is realized by an optical system for taking an image, a CCD, or the like; an image transmitting section 30 which transmits an image transferred from the imaging section 1; a control section 33 which controls the components of the transmission side; an image receiving section 32 which receives the image transmitted from the image transmitting section 30; an image storing section 2 which stores image data transferred from the image receiving section 32; a feature point extracting section 3 which extracts a point used for a pixel calculation; a search range determining section 4 which determines a search range for a correlation calculation; a correlation calculating section 5 which obtains correlations among pixels and outputs a motion amount; an image synthesizing section 36 which synthesizes an image; and a control section 31 which controls the components of the reception side.
  • The [0159] imaging section 1, the image transmitting section 30, and the control section 33 which are shown in FIG. 2 may be disposed integrally with or separately from the image synthesis and communication apparatus on the image information reception side. Alternatively, these sections may be disposed in another image synthesis and communication apparatus.
  • In the embodiment, images taken by the [0160] imaging section 1 are transmitted and received as they are. Also the transmission and reception can be realized by a communication method which is strictly identical with that used in the above-described transmission and reception of a synthesized image.
  • Namely, the width W and the height H of an image taken by the [0161] imaging section 1 are first transmitted, and pixel information of (W×H×3) bytes is then transmitted.
  • The [0162] image receiving section 32 also processes received data in the strictly same manner as the image receiving section 32 of the first embodiment.
  • The images received by the [0163] image receiving section 32 are stored in the image storing section 2, and then synthesized together by the image synthesizing section 36. The synthesization may be performed in the same manner as that of the first embodiment.
  • [Embodiment 3][0164]
  • FIG. 3 is a block diagram showing the configuration of an image synthesis and communication apparatus of a third embodiment of the invention. The image synthesis and communication apparatus comprises: an imaging section [0165] 1 which is realized by an optical system for taking an image, a CCD, or the like; an image storing section 2 which stores image data transferred from the imaging section 1; a feature point extracting section 3 which extracts a point used for a pixel calculation; a search range determining section 4 which determines a search range for a correlation calculation; a correlation calculating section 5 which obtains correlations among pixels and outputs a motion amount; an update image calculating section 35 which calculates a portion (update image) that does not overlap with the previous image, on the basis of the motion amount obtained by the correlation calculating section 5 and the image data of the image storing section 2; an image transmitting section 30 which transmits the motion amount obtained from the correlation calculating section 5, and the update image obtained by the update image calculating section 35; a control section 31 which controls the components of the transmission side; an image receiving section 32 which receives the motion amount and the update image transmitted from the image transmitting section 30; an image storing section 34 which stores image data transferred from the image receiving section 32; an image synthesizing section 36 which synthesizes an image on the basis of images of the image storing section and the motion amount obtained from the image receiving section 32; and a control section 33 which controls the components of the reception side.
  • FIG. 16 is a diagram illustrating the format of transmission and reception information of the image synthesis and communication apparatus of the third embodiment of the invention. Information to be transmitted and received includes the motion amount obtained from the [0166] correlation calculating section 5, and the partial image obtained from the update image calculating section 35. In an actual transmission, in the same manner as the first embodiment, lines such as an enable signal, a clock signal, a supply voltage, and a ground level, a start bit and stop bit in a data signal serving as a transmission sequence, and the like are required. The procedure is identical with that of the first embodiment, and hence only the data portion is illustrated.
  • The motion amount may be a value of the matrix of 2×2 which has been used in the description of the image synthesization. When the synthesization is to be performed without rotation, expansion, and reduction, and with using only a parallel movement amount, only shift amounts in X and Y directions are required. [0167]
  • First, as shown in FIG. 16, values of a matrix for performing an affine transformation between the previous image and the current image are transmitted. In the case of the third image F[0168] 3, for example, a transformation between the second image F2 and the third image F3 is to be performed. When the expression form described in the above example is used, A2, 3, B2, 3, C2, 3, and D2, 3 are sequentially transmitted. The bit number depends on the required accuracy. When data are expressed in floating-point form, 32 bits are sufficient.
  • After parameters are transmitted, the width and height of an image are transmitted. In the embodiment, these values are 2-byte data. [0169]
  • Next, a partial image obtained from the update [0170] image calculating section 35 is transmitted. The update image calculating section 35 calculates a portion which does not overlap with the previous image. The manner of the calculation is substantially identical with the method described in conjunction with the image synthesizing section 36. In the example of the third image F3 and the second image F2, when a point in the third image F3 is indicated by (x, y) and a point of the second image F2 corresponding to the point is indicated by (x′, y′), transformation expressions from (x, y) to (x′, y′) are as follows. In the expressions, (x1, y1) and (x2, Y2) are points in the third image F3 and correspond to points (x′1, y′1) and (x′2, y′2) in the second image F2, respectively. [ x y ] = 1 x 1 y 2 - x 2 y 1 [ x 1 y 2 - x 2 y 1 x 1 x 2 - x 2 x 1 y 1 y 2 - y 2 y 1 y 1 x 2 - y 2 x 1 ] [ x y ]
    Figure US20010055429A1-20011227-M00010
     0≦round(x)<w 2
  • 0≦round(y)<h 2  (18)
  • From the above, the position in the second image F[0171] 2 to which a point in the third image F3 is to be transferred is known. Therefore, when both the following expressions:
  • 0≦round(x)<w 2
  • 0≦round(y)<h 2  (19)
  • are satisfied, the points overlap with each other. This process is performed on all points in the third image F[0172] 3, and then it is possible to know points in the third image F3 which do not overlap with the second image F2. Hereinafter, a point which does not overlap with the previous image is called a transmission pixel, and that which overlaps with the previous image is called a nontransmission pixel. When all transmission pixels are to be transmitted, the transmission must be performed so that the reception side can distinguish a transmission pixel from a nontransmission pixel. Alternatively, a mask image may be separately transmitted. In the alternative, however, the amount of data to be transmitted is increased. Therefore, the transmission is performed by using the feature that regions which do not overlap with the previous image are usually continuous regions.
  • FIG. 17 is a flowchart illustrating a procedure of judging a transmission pixel and a nontransmission pixel. [0173]
  • First, the position “judgement point” where the judgement on a transmission pixel and a nontransmission pixel is started is set to be the upper left point of the third image F[0174] 3 (step S21). Next, “number of pixels before the next transmission pixel” in the line is counted in the rightward direction as seen from the judgement point (step S22). When the judgement point is a transmission pixel, the number is 0. When the judgement point is a nontransmission pixel, a p number of nontransmission pixels are continuous, and a transmission pixel exists after the nontransmission pixels, “number of pixels before the next transmission pixel” is p. If there is no transmission pixel after a judgement point in the line, the number of remaining pixels in the line is set to be “number of pixels before the next transmission pixel.” When “number of pixels before the next transmission pixel” is obtained, the number is expressed as a 2-byte data and then transmitted.
  • Next, the judgement point is rightward advanced by “number of next transmission pixels” (step S[0175] 23). If it is judged in step S24 that, at this timing, the judgement point reaches the right end of the line, the control proceeds to step S25. If the line is not the last line, the control proceeds to step S26. The left end point of the next line is set to be the judgement point, and the control then returns to the process of step s22 of counting “number of pixels before the next transmission pixel.” In the example of FIG. 16, after “value (t) before the next transmission pixel” is transmitted, the judgement point is jumped to the next line. If it is judged in step S24 that the judgement point has not yet reached the right end, the control proceeds to step S27. The judgement point is set to be the next transmission pixel, and “number of pixels before the next nontransmission pixel” is counted in the rightward direction as seen from the judgement point. If a q number of transmission pixels are continuous, the number is q. Then, “number of pixels before the next nontransmission pixel” is expressed as a 2-byte data and then transmitted (step S27).
  • Next, data of pixels, i.e., the values of R, G, and B the number of each of which corresponds to the number of the pixels are sequentially transmitted. In the example of FIG. 16, data starting from R(p, 0) and ending at B(p+q−1, 0) are transmitted (step S[0176] 28).
  • If it is judged in step S[0177] 29 that, at this timing, the judgement point reaches the right end of the line, and the line is not the last line, the left end point of the next line is set to be the judgement point, and the control then returns to step S25. If the judgement point has not yet reached the right end, the judgement point is set to be the next nontransmission pixel, and the control then returns to the initial process, i.e., the process in step S22 of counting “number of pixels before the next transmission pixel.” If it is judged in step S25 that the judgement point reaches the right end of the last line, the process is totally ended. In this way, an image of a portion which does not overlap with the previous image can be transmitted.
  • The reception side performs operations which are contrary to the above-described operations. FIG. 18 is a flowchart illustrating a procedure of storing reception pixels. [0178]
  • It is assumed that values of a transformation matrix, and data of images including the width and the height are already received. [0179]
  • A region for storing received images is previously reserved in the [0180] image storing section 34 on the reception side in FIG. 3. The size of the region is determined on the basis of the width and height of the images which have been already received. A pixel data can be written at a specific position in the storage region by determining a position “drawing point” relative to the origin, i.e., the upper left end of the reserved region. First, the drawing point is set to be the upper left end of the reserved storage region (step S31).
  • Next, “number of pixels before the next transmission pixel” is received (step S[0181] 32). Since pixels of this number are nontransmission pixels and are not transmitted, the drawing point is rightward shifted by this number (step S33). In step S34, it is judged whether the drawing point reaches the right end or not. If the drawing point reaches the right end, the control proceeds to step S35 to judge whether the line is the last line or not. If the line is not the last line, the drawing point is set in step S36 to be the left end of the next line. The control then returns to step S32 or the process of receiving the value before the next transmission pixel.
  • If it is judged in step S[0182] 34 that the drawing point has not yet reached the right end, the control proceeds to step S37 and “number of pixels before the next nontransmission pixel” is received. In step S38, since data of pixels of this number are transmitted, the data are received, and, concurrently with the data reception, the data are written into the image storing section 34. Specifically, since data of each pixel are received in the sequence of R, G, and B, the pixel data are written at the position of the drawing point by using these values. Next, the drawing point is rightward shifted by one, and the R, G, and B data of the next pixel are received. The above process is repeatedly performed for each of the pixels.
  • When the process is ended, it is judged in step S[0183] 39 whether the drawing point reaches the right end or not. If the drawing point reaches the right end, the control proceeds to step S35 to judge whether the line is the last line or not. If the line is not the last line, the drawing point is set to be the left end of the next line. The control then returns to step S32 or the process of receiving the value before the next transmission pixel. If it is judged in step S35 that the drawing point reaches the right end of the last line, the reception of the image data is ended.
  • As a result, images of a minimum necessary total size have been received. [0184]
  • By repeating these processes, plural image data can be recorded onto the image storing section. Next, an image is synthesized from these data. The method of the synthesization is strictly identical with the above-described method. Data required for the synthesization are a transformation matrix between two continuous images, and pixel data of each image. [0185]
  • As the elements of the transformation matrix, transmitted data may be used. Although all of pixel data of each image are not transmitted, the partial image data recorded on the image storing section are sufficient for the synthesization because only a portion which does not overlap with the previous image is used in the synthesization. Therefore, an image can be synthesized from the received data. [0186]
  • The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description and all changes which come within the meaning and the range of equivalency of the claims are therefore intended to be embraced therein. [0187]

Claims (9)

What is claimed is:
1. An image synthesis and communication apparatus comprising:
imaging means for inputting images in time series;
image storing means for storing the images inputted from the imaging means;
feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point;
search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation;
correlation calculating means for obtaining correlations between the feature point and pixels in the search range;
image synthesizing means for synthesizing an image on the basis of a motion amount obtained from the correlation calculating means;
image transmitting means for transmitting an image obtained from the image synthesizing means; and
image receiving means for receiving the image transmitted from the image transmitting means.
2. The image synthesis and communication apparatus of
claim 1
, wherein the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames.
3. The image synthesis and communication apparatus of
claim 1
, wherein the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current image, the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame, and the correlation calculating means performs a correlation calculation for each of the regions.
4. An image synthesis and communication apparatus comprising:
imaging means for inputting images in time series;
image transmitting means for transmitting images obtained from the imaging means;
image receiving means for receiving the images transmitted from the image transmitting means;
image storing means for storing the images inputted from the image receiving means;
feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point;
search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation;
correlation calculating means for obtaining correlations between the feature point and pixels in the search range; and
image synthesizing means for receiving a motion amount obtained from the correlation calculating means, and synthesizing an image.
5. The image synthesis and communication apparatus of
claim 4
, wherein the search range determining means determines the search range on the basis of a motion amount obtained from images of current and previous frames.
6. The image synthesis and communication apparatus of
claim 4
, wherein the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame, the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame, and the correlation calculating means performs a correlation calculation for each of the regions.
7. An image synthesis and communication apparatus comprising:
imaging means for inputting images in time series;
image storing means for storing the images inputted from the imaging means;
feature point extracting means for extracting a point of a large luminance change from an image of a current frame, as a feature point;
search range determining means for determining a predetermined region of an image of a subsequent frame, as a search range for a correlation calculation;
correlation calculating means for obtaining correlations between the feature point and pixels in the search range;
update image calculating means for obtaining an image of a newly-imaged portion from a motion amount obtained from the correlation calculating means;
image transmitting means for transmitting the motion amount obtained from the correlation calculating means, and a partial image obtained from the update image calculating means;
image receiving means for receiving the motion amount and the partial image transmitted from the image transmitting means; and
image synthesizing means for synthesizing an image from the motion amount and the partial image obtained from the image receiving means.
8. The image synthesis and communication apparatus of claim 7, wherein the search range determining means determines the search range on the basis of the motion amount obtained from images of current and previous frames.
9. The image synthesis and communication apparatus of
claim 7
, wherein the feature point extracting means extracts, as a feature point, a point of a large luminance change from plural regions where coordinates in one direction coincide with one another in an image of a current frame, the search range determining means determines a search range for a correlation calculation, for each of the regions of a subsequent frame, and the correlation calculating means performs a correlation calculation for each of the regions.
US09/162,024 1997-09-30 1998-09-28 Image synthesis and communication apparatus Expired - Fee Related US6434276B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP26613497A JP3966392B2 (en) 1997-09-30 1997-09-30 Image composition communication device
JPP9-266134 1997-09-30
JP9-266134 1997-09-30

Publications (2)

Publication Number Publication Date
US20010055429A1 true US20010055429A1 (en) 2001-12-27
US6434276B2 US6434276B2 (en) 2002-08-13

Family

ID=17426803

Family Applications (1)

Application Number Title Priority Date Filing Date
US09/162,024 Expired - Fee Related US6434276B2 (en) 1997-09-30 1998-09-28 Image synthesis and communication apparatus

Country Status (4)

Country Link
US (1) US6434276B2 (en)
EP (1) EP0905977B1 (en)
JP (1) JP3966392B2 (en)
DE (1) DE69841993D1 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6714680B1 (en) * 1999-06-02 2004-03-30 Fuji Photo Film Co., Ltd. Method and apparatus for image positioning processing
US20050025339A1 (en) * 2000-06-27 2005-02-03 Kabushiki Kaisha Toshiba Electronic watermark detection apparatus and method
US20100149364A1 (en) * 2008-12-12 2010-06-17 Keyence Corporation Imaging Device
US20100149362A1 (en) * 2008-12-12 2010-06-17 Keyence Corporation Imaging Device
US20110091065A1 (en) * 2009-10-21 2011-04-21 MindTree Limited Image Alignment Using Translation Invariant Feature Matching
US8385662B1 (en) 2009-04-30 2013-02-26 Google Inc. Principal component analysis based seed generation for clustering analysis
US8391634B1 (en) 2009-04-28 2013-03-05 Google Inc. Illumination estimation for images
US8396325B1 (en) * 2009-04-27 2013-03-12 Google Inc. Image enhancement through discrete patch optimization
US20130215228A1 (en) * 2012-02-22 2013-08-22 David Stoker Method and apparatus for robustly collecting facial, ocular, and iris images using a single sensor
US8611695B1 (en) 2009-04-27 2013-12-17 Google Inc. Large scale patch search
US8798393B2 (en) 2010-12-01 2014-08-05 Google Inc. Removing illumination variation from images
US8938119B1 (en) 2012-05-01 2015-01-20 Google Inc. Facade illumination removal
EP2662832A3 (en) * 2012-05-09 2017-05-03 Nokia Technologies Oy Method, apparatus and computer program product for alignment of frames

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6798923B1 (en) * 2000-02-04 2004-09-28 Industrial Technology Research Institute Apparatus and method for providing panoramic images
US7031016B1 (en) * 2000-05-26 2006-04-18 Kabushiki Kaisha Toshiba System for controlling images according to applications
US6806886B1 (en) * 2000-05-31 2004-10-19 Nvidia Corporation System, method and article of manufacture for converting color data into floating point numbers in a computer graphics pipeline
US20020126877A1 (en) * 2001-03-08 2002-09-12 Yukihiro Sugiyama Light transmission type image recognition device and image recognition sensor
US7646891B2 (en) * 2002-12-26 2010-01-12 Mitshubishi Denki Kabushiki Kaisha Image processor
TWI260918B (en) * 2005-05-30 2006-08-21 Pixart Imaging Inc Pixel compensation method for an image sensor
TWI297464B (en) * 2005-07-21 2008-06-01 Lightuning Tech Inc Wireless peripheral device having a sweep-type fingerprint sensing chip
JP4270281B2 (en) 2007-01-16 2009-05-27 セイコーエプソン株式会社 Apparatus, method, and program for image processing
JP4775277B2 (en) * 2007-02-07 2011-09-21 株式会社デンソー Image processing apparatus and image processing method
US8340477B2 (en) * 2008-03-31 2012-12-25 Intel Corporation Device with automatic image capture
JP5017431B2 (en) * 2009-07-24 2012-09-05 株式会社沖データ Image processing apparatus and image forming apparatus having the same
MX2018005709A (en) 2015-11-06 2018-11-09 Oklahoma Safety Equipment Company Inc Rupture disc device and method of assembly thereof.
CN110197167B (en) * 2019-06-05 2021-03-26 清华大学深圳研究生院 Video motion migration method

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5944184A (en) 1982-09-07 1984-03-12 Oki Electric Ind Co Ltd Still picture television conference device
JPS60203063A (en) 1984-03-27 1985-10-14 Nukada Fumiaki Communication equipment
JP2989364B2 (en) 1992-03-12 1999-12-13 シャープ株式会社 Image processing apparatus and image processing method
JPH0686149A (en) 1992-08-31 1994-03-25 Sony Corp Motion vector detector and video camera
EP0592136B1 (en) 1992-10-09 1999-12-08 Sony Corporation Producing and recording images
US5768404A (en) * 1994-04-13 1998-06-16 Matsushita Electric Industrial Co., Ltd. Motion and disparity estimation method, image synthesis method, and apparatus for implementing same methods
US5619281A (en) 1994-12-30 1997-04-08 Daewoo Electronics Co., Ltd Method and apparatus for detecting motion vectors in a frame decimating video encoder
EP0720382B1 (en) * 1994-12-30 2000-04-12 Daewoo Electronics Co., Ltd Variable size block matching motion estimation apparatus
EP0721287A1 (en) 1995-01-09 1996-07-10 Daewoo Electronics Co., Ltd Method and apparatus for encoding a video signal
TW257924B (en) * 1995-03-18 1995-09-21 Daewoo Electronics Co Ltd Method and apparatus for encoding a video signal using feature point based motion estimation
KR0171143B1 (en) * 1995-03-20 1999-03-20 배순훈 Apparatus for composing triangle in the hexagonal grid
KR0171118B1 (en) * 1995-03-20 1999-03-20 배순훈 Apparatus for encoding video signal
KR0181031B1 (en) * 1995-03-20 1999-05-01 배순훈 Apparatus for compensating edge in the motion compensated interpolation
KR0178229B1 (en) * 1995-08-08 1999-05-01 배순훈 Image processing system using a pixel-by-pixel motion estimation based on feature points
US5959673A (en) * 1995-10-05 1999-09-28 Microsoft Corporation Transform coding of dense motion vector fields for frame and object based video coding applications
KR100209793B1 (en) * 1995-10-28 1999-07-15 전주범 Apparatus for encoding/decoding a video signals by using feature point based motion estimation

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6714680B1 (en) * 1999-06-02 2004-03-30 Fuji Photo Film Co., Ltd. Method and apparatus for image positioning processing
US20050025339A1 (en) * 2000-06-27 2005-02-03 Kabushiki Kaisha Toshiba Electronic watermark detection apparatus and method
US6985602B2 (en) * 2000-06-27 2006-01-10 Kabushiki Kaisha Toshiba Electronic watermark detection apparatus and method
US8451335B2 (en) * 2008-12-12 2013-05-28 Keyence Corporation Imaging device
US20100149364A1 (en) * 2008-12-12 2010-06-17 Keyence Corporation Imaging Device
US20100149362A1 (en) * 2008-12-12 2010-06-17 Keyence Corporation Imaging Device
US8508587B2 (en) 2008-12-12 2013-08-13 Keyence Corporation Imaging device
US8611695B1 (en) 2009-04-27 2013-12-17 Google Inc. Large scale patch search
US8571349B1 (en) * 2009-04-27 2013-10-29 Google Inc Image enhancement through discrete patch optimization
US8396325B1 (en) * 2009-04-27 2013-03-12 Google Inc. Image enhancement through discrete patch optimization
US8391634B1 (en) 2009-04-28 2013-03-05 Google Inc. Illumination estimation for images
US8385662B1 (en) 2009-04-30 2013-02-26 Google Inc. Principal component analysis based seed generation for clustering analysis
US8385689B2 (en) * 2009-10-21 2013-02-26 MindTree Limited Image alignment using translation invariant feature matching
US20110091065A1 (en) * 2009-10-21 2011-04-21 MindTree Limited Image Alignment Using Translation Invariant Feature Matching
US8798393B2 (en) 2010-12-01 2014-08-05 Google Inc. Removing illumination variation from images
US20130215228A1 (en) * 2012-02-22 2013-08-22 David Stoker Method and apparatus for robustly collecting facial, ocular, and iris images using a single sensor
US9373023B2 (en) * 2012-02-22 2016-06-21 Sri International Method and apparatus for robustly collecting facial, ocular, and iris images using a single sensor
US8938119B1 (en) 2012-05-01 2015-01-20 Google Inc. Facade illumination removal
EP2662832A3 (en) * 2012-05-09 2017-05-03 Nokia Technologies Oy Method, apparatus and computer program product for alignment of frames

Also Published As

Publication number Publication date
DE69841993D1 (en) 2010-12-23
EP0905977A2 (en) 1999-03-31
JPH11112956A (en) 1999-04-23
US6434276B2 (en) 2002-08-13
EP0905977A3 (en) 2001-02-07
JP3966392B2 (en) 2007-08-29
EP0905977B1 (en) 2010-11-10

Similar Documents

Publication Publication Date Title
US6434276B2 (en) Image synthesis and communication apparatus
CN101843092B (en) Image apparatus and method
US5475428A (en) Method for processing color image records subject to misregistration
TWI459324B (en) Modifying color and panchromatic channel cfa image
US5060074A (en) Video imaging apparatus
US7956897B2 (en) Image combining device and imaging apparatus
CN100498893C (en) Projection image position adjustment method
US20060158545A1 (en) Image display method, imaging method, and image synthesis method
EP1653743A1 (en) Monitoring device and monitoring method using panorama image
EP2063646A2 (en) Method and apparatus for predictive coding
EP1184809A2 (en) Object detecting method and object detecting apparatus and intruding object monitoring apparatus employing the object detecting method
JP3448868B2 (en) Image coincidence detecting device and image coincidence detecting method
EP2088787A1 (en) Image picking-up processing device, image picking-up device, image processing method and computer program
JP2004208336A (en) Full color image adaptive interpolation arrangement using luminance gradients
JP2004056789A (en) Resolution and picture quality improving method for small image sensor
US7420598B1 (en) Apparatus and method for recording image data and reproducing zoomed images from the image data
US20080131027A1 (en) System and method for merging differently focused images
US7339617B1 (en) Image providing device and its providing method, image processing device and processing method, and storage medium
EP1933549B1 (en) Image correction device and image correction method
KR100323759B1 (en) Method and Apparatus for Data Acquisition and Controlling Equipment using Mobile Telecommunication Terminal
EP1109411A1 (en) Interpolation processor and recording medium recording interpolation processing program
JP3296675B2 (en) Subject tracking apparatus and subject tracking method
US6650362B1 (en) Movement detecting apparatus with feature point extractor based on luminance gradient in current frame
US7542074B2 (en) Photographed image projection device which sets a frame frequency of photographing by a camera based on movement in a photographed image
JP3786300B2 (en) Motion vector detection apparatus and motion vector detection method

Legal Events

Date Code Title Description
AS Assignment

Owner name: SHARP KABUSHIKI KAISHA, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:HIROSAWA, MASASHI;NAKAMURA, YASUHISA;AKAGI, HIROSHI;AND OTHERS;REEL/FRAME:009483/0041

Effective date: 19980910

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

REMI Maintenance fee reminder mailed
LAPS Lapse for failure to pay maintenance fees
STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20140813