WO2011105044A1 - Information processing method and information processing apparatus - Google Patents

Information processing method and information processing apparatus Download PDF

Info

Publication number
WO2011105044A1
WO2011105044A1 PCT/JP2011/000965 JP2011000965W WO2011105044A1 WO 2011105044 A1 WO2011105044 A1 WO 2011105044A1 JP 2011000965 W JP2011000965 W JP 2011000965W WO 2011105044 A1 WO2011105044 A1 WO 2011105044A1
Authority
WO
WIPO (PCT)
Prior art keywords
orientation
imaging
image
missing
information processing
Prior art date
Application number
PCT/JP2011/000965
Other languages
French (fr)
Inventor
Yusuke Mitarai
Masakazu Matsugu
Hiroto Yoshii
Original Assignee
Canon Kabushiki Kaisha
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Kabushiki Kaisha filed Critical Canon Kabushiki Kaisha
Priority to US13/580,560 priority Critical patent/US9429418B2/en
Publication of WO2011105044A1 publication Critical patent/WO2011105044A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01BMEASURING LENGTH, THICKNESS OR SIMILAR LINEAR DIMENSIONS; MEASURING ANGLES; MEASURING AREAS; MEASURING IRREGULARITIES OF SURFACES OR CONTOURS
    • G01B11/00Measuring arrangements characterised by the use of optical techniques
    • G01B11/02Measuring arrangements characterised by the use of optical techniques for measuring length, width or thickness
    • G01B11/03Measuring arrangements characterised by the use of optical techniques for measuring length, width or thickness by measuring coordinates of points
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01BMEASURING LENGTH, THICKNESS OR SIMILAR LINEAR DIMENSIONS; MEASURING ANGLES; MEASURING AREAS; MEASURING IRREGULARITIES OF SURFACES OR CONTOURS
    • G01B11/00Measuring arrangements characterised by the use of optical techniques
    • G01B11/26Measuring arrangements characterised by the use of optical techniques for measuring angles or tapers; for testing the alignment of axes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/10Geometric effects
    • G06T15/20Perspective computation
    • G06T15/205Image-based rendering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30204Marker
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30244Camera pose

Definitions

  • the present invention relates to a method for generating a multi view-point image composed of a great number of images of a three-dimensional object captured from a plurality of different view points.
  • a great number of images captured from various view points are necessary. Further, with respect to recognition of a three-dimensional object by using an image which is obtained by imaging the three-dimensional object from an arbitrary view point or with respect to learning of a classifier that recognizes the three-dimensional object, it is useful to use a great number of images captured from various view points.
  • the learning of the classifier is to set or adjust a parameter of the classifier by using a great amount of data that continually increases according to learning. According to learning, optimum control is realized while adjustment is being made.
  • Various methods have been proposed so far as a method for acquiring a great number of images obtained by imaging an object from various view points.
  • Japanese Patent Laid-Open No. 2007-72537 discusses an imaging apparatus including a plurality of imaging apparatuses and a rotation base where an object is mounted.
  • the imaging apparatus references a feature point of the object. According to such an apparatus, the orientation of the object can be changed and a complete periphery image of the object can be captured.
  • Japanese Patent Laid-Open No. 2004-139294 discusses a multi view-point image processing program using a plurality of markers each of which serves as a reference of a view point position. According to such a method, as is with patent literature 1, without using a large-scale imaging apparatus, images of an object captured according to imaging from various view points can be acquired.
  • an improved three-dimensional model or improved learning of a classifier that performs arbitrary view-point object recognition it is useful if the image is acquired according to a shape of the object. For example, it is useful if an image of a portion whose structure is complex is acquired in detail from a number of view points. However, the portion of the view points which should be densely arranged in capturing a portion of the object is not clear. Further, even if a portion where view points should be densely arranged is given, a method for simply acquiring such an image is not provided.
  • Japanese Patent Laid-Open No. 2007-72537 Japanese Patent Laid-Open No. 2004-139294 Japanese Patent Laid-Open No. 2007-156528 Japanese Patent Laid-Open No. 2000-194859
  • an image which is missing in object recognition is presented by using a great number of images obtained by imaging an object from various view points.
  • an information processing apparatus includes a relative position and orientation calculation means configured to calculate, based on a plurality of images captured by imaging an object from a plurality of view points by an imaging means, a relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points, a missing position and orientation calculation means configured to calculate a missing position and orientation of the imaging means from which imaging by the imaging means is missing based on the calculated plurality of relative positions and orientations, and a missing view-point image generation means configured to generate an image used for displaying the calculated missing position and orientation on a display means.
  • an information processing method performed by an information processing apparatus includes a relative position and orientation calculation means of the information processing apparatus calculating, based on a plurality of images captured by imaging an object from a plurality of view points by an imaging means, a relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points, a missing position and orientation calculation means of the information processing apparatus calculating a missing position and orientation of the imaging means from which imaging by the imaging means is missing based on the calculated plurality of relative positions and orientations, and a missing view-point image generation means of the information processing apparatus generating an image used for display of the calculated missing position and orientation on a display means.
  • a program causes a computer to execute each step of one of the video information processing methods described above.
  • a recording medium stores a program causing a computer to execute each step of one of the video information processing methods described above.
  • Fig. 1 is a configuration diagram according to a first exemplary embodiment.
  • Fig. 2 illustrates a definition of a measurement coordinate system according to the first exemplary embodiment.
  • Fig. 3 illustrates definition of a camera coordinate system and an image coordinate system according to the first exemplary embodiment.
  • Fig. 4 is a flowchart illustrating processing according to the first exemplary embodiment.
  • Fig. 5 is a flowchart illustrating processing performed by a missing position and orientation calculation unit according to the first exemplary embodiment.
  • Fig. 6 illustrates an example of an image displayed on a display unit according to the first exemplary embodiment.
  • Fig. 7 is a configuration diagram according to a second exemplary embodiment.
  • Fig. 8 is a flowchart illustrating processing according to the second exemplary embodiment.
  • Fig. 1 is a configuration diagram according to a first exemplary embodiment.
  • Fig. 2 illustrates a definition of a measurement coordinate system according to the first exemplary embodiment.
  • Fig. 3 illustrates definition of a
  • FIG. 9A illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment.
  • Fig. 9B illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment.
  • Fig. 9C illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment.
  • Fig. 9D illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment.
  • Fig. 10 is a flowchart illustrating processing performed by the missing position and orientation calculation unit according to the second exemplary embodiment.
  • Fig. 11 is a configuration diagram according to a third exemplary embodiment.
  • Fig. 12 is a flowchart illustrating processing according to the third exemplary embodiment.
  • Fig. 13 illustrates an example of missing view point determination based on comparison of two images.
  • Fig. 14 illustrates an example of missing view point determination based on an orientation estimation result.
  • Fig. 15 is a flowchart illustrating processing performed by the missing position and orientation calculation unit according to the third exemplary embodiment.
  • a multi view-point image is generated from a great number of images obtained by evenly imaging an object being a measurement object from various view points.
  • Fig. 1 is a configuration diagram according to the present embodiment.
  • the present embodiment includes an imaging unit 101, an image input unit 102, a relative position and orientation calculation unit 103, a multi view-point image storage unit 104, a missing position and orientation calculation unit 105, a missing view-point image generation unit 106, and a display unit 107.
  • an object 112 being a measurement object is set on a support 113.
  • a plurality of markers 121, 122, . . . , 128 are arranged in the periphery of the object 112. Further, the eight markers of the markers 121 to 128 correspond to color 1, color 2, . . . , color 8 respectively.
  • the imaging unit 101 performs imaging of the object 112. For example, imaging of the object 112 mounted on the support 113 is performed by a camera operated by a user.
  • the imaging unit 101 can externally output the image which has been captured.
  • the imaging unit is a digital still camera or a camcorder including an image sensor, such as a charge-coupled device (CCD), and a lens.
  • the captured image is transmitted to the image input unit 102 when, for example, the user presses a shutter button of the digital still camera.
  • the image input unit 102 inputs the image transmitted from the imaging unit 101.
  • the image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103.
  • the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object, and the imaging unit 101.
  • the relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104.
  • the multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed. The storing operation is repeated until the number of the imaging orientation data pieces reaches a predetermined number. The recorded imaging orientation data is transmitted to the missing position and orientation calculation unit.
  • the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage based on a great amount of imaging orientation data obtained by imaging up to the current stage and stored in the multi view-point image storage unit 104.
  • the missing view-point image generation unit 106 generates an image showing the view point which is missing from the imaging orientation data of the missing view points calculated by the missing position and orientation calculation unit 105.
  • the display unit 107 displays the display image generated by the missing view-point image generation unit 106.
  • Fig. 2 illustrates a definition of a measurement coordinate system according to the present embodiment.
  • the support 113 is where a measurement object is mounted.
  • the support 113 is provided on the measurement base 111.
  • the point of origin of the measurement coordinate system is a contact position 200 where the support 113 contacts the measurement base 111, and the axes are an x-axis 201, a y-axis 202, and a z-axis 203.
  • the markers are uniquely identified. For example, if the markers have different colors and their three-dimensional positions are known, a color unique to each marker is extracted from the image captured by the imaging unit 101. According to this processing, a marker in the image can be identified and the position of the marker in the image can be detected. A three-dimensional position of each marker is fixed and a center position of the markers in the measurement coordinate system is measured in advance.
  • Fig. 3 illustrates a definition of a camera coordinate system and an image coordinate system.
  • a three-dimensional camera coordinate system is defined with respect to the imaging unit 101.
  • an image plane 302 which is a virtual plane corresponding to the captured image is defined, and a two-dimensional image coordinate system of the image plane is defined.
  • a principal point of the lens of the imaging unit 101 in the camera coordinate system is a point of origin 310.
  • the direction corresponding to the right direction with respect to the captured image is defined as an x'-axis 311, the direction corresponding to the upper direction is defined as a y'-axis 312, and the axis parallel to the optical axis of the lens and in a direction opposite to an imaging direction 301 of the imaging unit 101 is defined as a z'-axis 313.
  • the image coordinate system is defined, as is illustrated in Fig. 3, as a two-dimensional coordinate system.
  • the center of an image plane 302 is defined as a point of origin 320
  • the direction corresponding to the right direction of the image is defined as an x"-axis 321
  • the direction corresponding to the upper direction is defined as a y"-axis 322.
  • Fig. 4 is a flowchart illustrating processing according to the present embodiment. An example of a multi view-point image generation method will be described.
  • step S401 the image input unit 102 inputs an image transmitted from the imaging unit 101 to the relative position and orientation calculation unit 103.
  • the image obtained by imaging the object 112 needs to include at least four markers in the image.
  • the image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103 and the processing proceeds to step S402.
  • step S402 the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101. In other words, by processing the image transmitted from the image input unit 102, the relative position and orientation calculation unit 103 measures a position and orientation of the imaging unit when the imaging unit captured the image.
  • the relative position and orientation calculation unit extracts the colors 1 to 8, each of which is unique to each marker, and determines whether each marker is included in the image. To be more precise, a hue is obtained from RGB values of each pixel and if the obtained hue and the hue of the color to be extracted are within a predetermined range, that pixel is extracted. If a pixel of a corresponding color is extracted from the image, it is determined that a marker corresponding to that color is included in the image. Then, a barycentric position (position (x", y”) in the above-described image coordinate system) with respect to the image of the pixel corresponding to the color of the marker is obtained.
  • positions of four or more markers, whose three-dimensional positions in the measurement coordinate system are known, in the image coordinate system can be obtained.
  • a focal length of the lens is known as f [mm]
  • f [mm] a focal length of the lens
  • a relative position and orientation of two three-dimensional coordinate systems is expressed by two components of the coordinate system, which are a translational component and a rotational component.
  • the translational component is a translational mobile component between two points of origin of the two three-dimensional coordinates and is expressed by a translational mobile component (Tx, Ty, Tz) from a point of origin 40 of the measurement coordinate system to the point of origin 310 of the camera coordinate system.
  • the rotational component is expressed by an axis of rotation and an amount of rotation with respect to the axis of rotation.
  • the relative position and orientation of the two three-dimensional coordinate systems is expressed by the amount of rotation when the x-axis 201, the y-axis 202, and the z-axis 203 of the measurement coordinate system are in the same directions as the x'-axis 311, the y'-axis 312, and the z'-axis 313 of the camera coordinate system respectively.
  • the measurement coordinate system is expressed by a direction of the axis of rotation (Rx, Ry, Rz) and a rotation angle "theta" when the measurement coordinate system is rotated on the axis of rotation which passes through the origin of the measurement coordinate system.
  • the measured relative position and orientation is expressed by the translational component (Tx, Ty, Tz) and the rotational component between the coordinate systems.
  • the rotational component is expressed by the direction of the axis of rotation (Rx, Ry, Rz) and the rotation angle "theta".
  • the relative position and orientation is used as the imaging orientation data.
  • step S403 The relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104 and then the processing proceeds to step S403.
  • the multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed.
  • the image of the object 112 which has been captured by the user and the position and orientation of the imaging unit 101 when the image has been captured are stored in the multi view-point image storage unit 104.
  • the imaging is repeated until, for example, about 60 pieces of data are obtained.
  • step S404 the missing position and orientation calculation unit 105 calculates the imaging-missing view point at the current stage based on a great amount of imaging orientation data captured by that time and stored in the multi view-point image storage unit 104.
  • Fig. 5 is a flowchart illustrating processing performed by the missing position and orientation calculation unit. Details of the actual processing performed by the missing position and orientation calculation unit 105 in step S404 will be described.
  • Step S501 is an approximately uniform direction calculation step.
  • virtual view points are approximately uniformly arranged on a sphere in a virtual three-dimensional space having the object 112 at the center.
  • a virtual coordinate system is set in a virtual three-dimensional space, and it is assumed that a sphere with radius 1 having the point of origin of the virtual coordinate system at the center is provided. Points of a predetermined number M are approximately uniformly arranged on the surface of the sphere. Then, a direction in which the imaging is possible is calculated
  • the points can be approximately uniformly arranged on the surface of the sphere.
  • Step S502 is a similar direction determination step. According to this step, it is determined whether a point not yet used for imaging in a similar orientation exists. To be more precise, it is determined whether imaging in the orientation similar to the imaging orientation in the virtual three-dimensional space is already performed when it is assumed that the imaging is performed with an angle toward the center of the sphere from each point. First, it is simply assumed that the imaging unit 101 is arranged such that the imaging is performed with an angle toward the center of the sphere from each point. Then, the direction of an z'-axis 313 of the camera coordinate system illustrated in Fig. 3 in the virtual coordinate system at that time is obtained.
  • the imaging unit 101 Since the imaging is performed with an angle toward the center of the sphere from a point on the surface of the sphere, in other words, since the negative direction of the z'-axis 313 of the camera coordinate system passes through the center of the sphere, this direction will be the position of each point in the virtual coordinate system.
  • the imaging unit 101 is arranged such that the image of the center of the sphere is captured from a certain point (Pxi, Pyi, Pzi), the direction of the z'-axis 313 of the camera coordinate system in the virtual coordinate system will be simply (Pxi, Pyi, Pzi).
  • a direction with respect to the measurement coordinate system having highest similarity with the direction of the z'-axis 313 of the camera coordinate system is searched and the similarity is obtained.
  • An inner product of each of the two directions can be used in determining the similarity.
  • a direction (Qx, Qy, Qz) of the z'-axis 313 in the camera coordinate system of the measurement coordinate system is obtained. If the rotational components of the imaging orientation data are the direction of the axis of rotation (Rx, Ry, Rz) and the rotation angle "theta", the direction (Qx, Qy, Qz) can be obtained from the following equation (1).
  • inner product of the direction (Pxi, Pyi, Pzi) corresponding to the certain point arranged approximately uniformly on the surface of the sphere and the direction (Qx, Qy, Qz) acquired according to the above-described equation (1) by using a plurality pieces of imaging orientation data stored in the multi view-point image storage unit 104 is obtained. From the obtained inner products, an inner product with the largest value is obtained.
  • the predetermined reference value used for the determination can be an arbitrary value smaller than 1. For example, a value, 0.8, is used.
  • Step S503 is a dissimilar direction calculation step.
  • an imaging-missing view point imaging orientation when the imaging is performed with an angle toward the center of the sphere from a point on the surface of the sphere is obtained.
  • a direction (Pxi, Pyi, Pzi) corresponding to the point is determined as the imaging orientation corresponding to the imaging-missing view point. According to such processing, an imaging-missing view point is calculated.
  • step S405 it is determined whether a missing position and orientation exists. If it is determined that imaging from view points corresponding to all the points arranged approximately uniformly on the surface of the sphere is sufficient (NO in step S405), the missing position and orientation calculation unit 105 determines that imaging of the measurement object body is evenly performed from various view points, and the processing ends.
  • step S405 if it is determined that imaging from a view point corresponding to one of the points arranged approximately uniformly on the surface of the sphere is missing (YES in step S405), the data (Pxi, Pyi, Pzi) corresponding to the imaging orientation is transmitted to the missing view-point image generation unit 106, and the processing proceeds to step S406.
  • step S406 the missing view-point image generation unit 106 generates an image presenting the missing view point based on the data of the imaging orientation transmitted from the missing position and orientation calculation unit 105.
  • Fig. 6 illustrates an example of an image displayed on the display unit 107.
  • An arrow 601 corresponds to an imaging orientation from an imaging-missing view point transmitted from the missing position and orientation calculation unit 105.
  • the arrow 601 is on a dotted line 602.
  • the dotted line 602 connects the point of origin 40 of the measurement coordinate system and the data (Pxi, Pyi, Pzi) regarding imaging orientation from the imaging-missing view point.
  • the missing view point can be confirmed.
  • the missing view point can be expressed by a form other than an arrow so long as the user can understand the missing view point.
  • the user that performs the imaging of the measurement object body performs imaging from the missing view point displayed on the display unit 107.
  • An image captured from the view point corresponding to the arrow 601 is transmitted from the imaging unit 101 to the image input unit 102, and the processing returns to step S401.
  • a multi view-point image composed of a great number of images obtained by evenly performing imaging from various view points is stored and a multi view-point image composed of a great number of images obtained by evenly performing imaging from various view points can be generated.
  • step S407 the display unit 107 displays the image generated by the missing view-point image generation unit 106.
  • the display unit 107 displays a position to which the imaging unit 101 is to be set and imaging of the object 112 is to be performed.
  • the images of the object 112 captured evenly from various view points are stored in the multi view-point image storage unit 104, and the processing is continued until there are no missing view points.
  • a multi view-point image composed of a great number of images of the measurement object body 112 captured evenly from various view points is generated.
  • a view point used in imaging a complex portion of the three-dimensional structure of the measurement object body it is desirable to minutely perform the imaging compared to imaging of a portion having a simple structure. For example, by inputting data of a three-dimensional model of the measurement object body in advance and analyzing the data, a complex portion of the three-dimensional structure of the measurement object body is obtained.
  • the measurement object body is a polygon
  • the complexity of the structure can be evaluated according to the number of sides of the polygon.
  • a view point used for imaging the complex portion of the three-dimensional structure of the measurement object body is obtained.
  • a point in the vicinity of a point corresponding to a view point used for imaging the object body where the structure is complex is additionally generated.
  • the density of the points on the surface of the sphere in the vicinity of the view point that is used for imaging the complex portion of the structure will be higher.
  • step S402 the relative position and orientation of the measurement object body and the imaging unit 101 is directly obtained.
  • the method for obtaining the position and orientation of the imaging unit 101 is not limited to using a marker.
  • the position and orientation can be obtained in a way different from a captured image.
  • a relative position and orientation can be obtained.
  • the relative position and orientation of the imaging unit 101 and the object 112 can be obtained by estimating the relative position and orientation of the imaging unit and the object 112 from a three-dimensional model by using a technique discussed in nonpatent literature 2.
  • an information processing method used for generating a multi view-point image used by a classifier will be described.
  • the classifier recognizes the orientation of the object.
  • the information processing method is used for generating the multi view-point image which is used for the learning of the classifier.
  • Fig. 7 is a configuration diagram according to the present embodiment.
  • the present embodiment includes the imaging unit 101, the image input unit 102, the relative position and orientation calculation unit 103, an image normalization unit 701, the multi view-point image storage unit 104, a classifier learning unit 702, the missing position and orientation calculation unit 105, the missing view-point image generation unit 106, and the display unit 107.
  • the object 112 which is a measurement object is set on the support 113.
  • a plurality of markers 121, 122, . . . , 128 are arranged. Further, the eight markers of the markers 121 to 128 correspond to the color 1, color 2, . . .
  • the object 112 which is a measurement object does not include the colors 1 to 9.
  • the imaging unit 101 performs imaging of the object 112.
  • the captured image is transmitted to the image input unit 102.
  • the image input unit 102 inputs the image transmitted from the imaging unit 101 to the relative position and orientation calculation unit 103.
  • the image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103.
  • the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101.
  • the relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104.
  • the image normalization unit 701 normalizes an image of the object 112 which is the measurement object into a unified format. To be more precise, an image of the measurement object is clipped from the input image. Then a gray-scale image of the clipped image is generated and the generated image is normalized into a predetermined size.
  • the multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed. The storing operation is repeated until the number of the imaging orientation data pieces reaches a predetermined number. The recorded imaging orientation data is transmitted to the classifier learning unit 702.
  • the classifier learning unit 702 Based on a great number of images and imaging orientation data stored in the multi view-point image storage unit 104, the classifier learning unit 702 performs learning of the classifier that estimates the orientation of the object 112 which is included in the image.
  • the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage based on a great amount of imaging orientation data obtained by imaging by the current stage according to a result of estimation performed by the classifier which has performed learning according to the classifier learning unit 702.
  • the missing view-point image generation unit 106 generates an image showing the view point which is missing from the imaging orientation data of the missing view points calculated by the missing position and orientation calculation unit 105.
  • the display unit 107 displays the display image generated by the missing view-point image generation unit 106.
  • Fig. 8 is a flowchart illustrating processing flow of a multi view-point image generation method according to the present embodiment.
  • step S801 the image input unit 102 inputs the image transmitted from the imaging unit 101.
  • the image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103 and then the processing proceeds to step S802.
  • step S802 the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101.
  • step S803 the image normalization unit 701 normalizes an image of the object 112 which is the measurement object into a unified format. To be more precise, an image of the object 112 which is the measurement object is clipped from the input image. Then a gray-scale image of the clipped image is generated and the generated image is normalized into a predetermined size.
  • Fig. 9 illustrates an example of an image processed by the image normalization unit 701. Details of the actual processing performed by the image normalization unit 701 in step S803 will be described.
  • Fig. 9A illustrates an input image. Pixels of colors other than colors 1 to 8 being colors of the markers and the color 9 being the color assigned to the regions other than the regions of the markers in the input image, are extracted. As a result of the extraction, an image that extracts only the region of the object 112 which is the measurement object in Fig. 9B is generated. From this image, a circumscribing square region 9 is extracted and clipped as illustrated in Fig. 9C. Then, the color image of the clipped square region is changed into a gray scale image. As illustrated in Fig.
  • the size of the image is normalized by expanding or reducing the image so that the width and the height of the clipped region is a predetermined length w pixels.
  • An arbitrary value can be used for the predetermined length w pixels.
  • step S804 the multi view-point image storage unit 104 stores the image which has undergone the normalization processing in association with the imaging orientation data calculated by the relative position and orientation calculation unit 103.
  • the classifier learning unit 702 performs learning of the classifier that estimates the orientation of the object 112 included in the image based on a great number of images and imaging orientation data stored in the multi view-point image storage unit 104.
  • the learning of the classifier a plurality of images which have undergone processing such as object clipping performed by a clipping processing unit 702 and imaging orientation data corresponding to each of the images, which are stored in the multi view-point image storage unit 104, are used as learning data.
  • this classifier can be used for the estimation of orientation of a three-dimensional object of the support vector regression method discussed in nonpatent literature 3.
  • the values predicted by the classifier use three values, that is, roll, pitch, and yaw as estimated values of the orientation of the object 112. To be more precise, the direction of the axis of rotation (Rx, Ry, Rz) and the value of the rotation angle "theta" that represent the rotational component of the imaging orientation data are estimated.
  • an eigenspace of the image group is obtained.
  • the eigenspace is of an arbitrary dimension, it is desirable to determine the dimension using a cumulative contribution ratio. For example, a number of dimensions where the cumulative contribution ratio is 90% or greater is used as a number of dimensions d ( ⁇ 100 x 100) in the eigenspace. Then, data generated by projecting each of the great number of images which are stored, to an eigenspace of d-dimension is used as input data.
  • the direction of the axis of rotation (Rx, Ry, Rz) that indicates a rotational component of the imaging orientation data corresponding to each of the great number of images and the rotation angle "theta" are used as target values when the learning of the support vector regression method is performed. If a clipped image of the object 112 by the image normalization unit is input, in the classifier based on the support vector regression method, an orientation of the measurement object body 112 is output.
  • the classifier learning unit 702 is not limited to a classifier using the support vector regression method. For example, a classifier that estimates the orientation of the object 112 from an image of the object 112 which has been captured such as a parametric eigenspace method discussed in nonpatent literature 4 can also be used.
  • the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage based on estimation capability of the classifier at the time the classifier has performed learning according to a classifier learning unit 702. For example, a predetermined number of points are approximately uniformly arranged on the surface of the sphere in the virtual three-dimensional space. A plurality of view points, from which imaging is performed with an angle toward the center of the sphere, is prepared. The plurality of view points are hereinafter referred to as virtual view points. Then, regarding each of the virtual view points, out of the plurality pieces of imaging orientation data stored in the multi view-point image storage unit 104, two view points closest to the virtual view point are selected. These view points are hereinafter referred to as imaging-completed adjacent view points. An inner product value in the z'-axis direction in the camera coordinate system indicates the proximity of the view point. In this manner, the imaging-missing view point is calculated.
  • Fig. 10 is a flowchart illustrating processing performed by the missing position and orientation calculation unit. Details of actual processing performed by a missing position and orientation calculation unit 123 in step S806 will be described.
  • step S1001 a predetermined number of points are approximately uniformly arranged on the surface of the sphere in the virtual three-dimensional space.
  • a plurality of view points, from which imaging is performed with an angle toward the center of the sphere, is prepared.
  • the plurality of view points are hereinafter referred to as virtual view points.
  • two view points closest to the virtual view point are selected. These view points are hereinafter referred to as imaging-completed adjacent view points.
  • the proximity of the view point it may be determined that the higher the value of the inner product in the z'-axis direction in the camera coordinate system, the closer the view point.
  • an estimated image of the measurement object body 112 when imaging is performed from a position corresponding to the virtual view point is generated.
  • the estimated image is a virtual image of the measurement object body 112 when the measurement object body is observed from the above-described virtual view point. Any method can be used as the generation method of the estimated image so long as it can generate the above-described virtual image. Further, as is with the processing of the clip processing unit 702, a gray-scale image having the width and height of w pixels can be generated.
  • step S1002 the image of the measurement object body 112 which is captured from the generated virtual view point is input to the classifier of the classifier learning unit 702, and the result of the orientation estimation performed by the classifier is obtained.
  • a composite image of a measurement object body corresponding to imaging from a certain virtual view point is prepared, and with respect to the composite image, the classifier that has undergone the learning is applied, and a result of the orientation estimation of the object in the composite image is calculated.
  • step S1003 if the result of the orientation estimation is not similar to the imaging orientation of an imaging system 71 when the imaging of the object is performed from a corresponding virtual view point, then it is determined that imaging is missing. Whether the orientation and the imaging orientation are similar or not, can also be determined according to whether a difference between the imaging orientation corresponding to the virtual view point and the result of the orientation estimation is equal to or smaller than a predetermined value. The difference between the two orientations can be considered as a rotation angle of one orientation when it is rotated so that it matches the other orientation.
  • Fig. 14 is a schematic diagram of an example of missing view point determination based on the result of the orientation estimation.
  • an image 1403 captured from a virtual view point which is a point corresponding to the middle of them is generated by using, for example, morphing. Then, the image is input to a discriminant function (e.g., multivalued nonlinear kernel regression function) which outputs the orientation of the object (direction of the axis of rotation and rotation angle) learned by using the acquired image. Whether the imaging data taken from the middle of the view points is missing is determined based on whether the orientation of the object which is output (estimated value) is within a predetermined range.
  • a discriminant function e.g., multivalued nonlinear kernel regression function
  • an estimated orientation output 1404 of the middle of the view points (output of the discriminant function) is correct as a middle of the view points, and the missing view points are calculated.
  • a curve in Fig. 14 schematically shows values (multivalues) of the correct orientation. If the orientation estimated output 1404 is an output close to the curve as illustrated in Fig. 14, it is determined that an image captured from a view point corresponding to the virtual view point is not missing. If the orientation estimated output 1404 is greatly different from the curve, it is determined that an image captured from a view point corresponding to the virtual view point is missing for the learning of the classifier.
  • step S1004 an imaging orientation corresponding to the virtual view point which has been determined that imaging from that point is missing is obtained.
  • the position (Pxi, Pyi, Pzi) on the surface of the virtual sphere is obtained as the imaging orientation corresponding to the virtual view point.
  • step S807 it is determined whether a missing position and orientation exists. If it is determined that imaging from view points corresponding to all the points arranged approximately uniformly on the surface of the sphere is sufficient (NO in step S807), the missing position and orientation calculation unit 105 determines that a multi view-point image of the measurement object body 112 is sufficiently captured for the learning of the classifier, and the processing proceeds to step S810.
  • step S405 if it is determined that imaging from a view point corresponding to one of the points arranged approximately uniformly on the surface of the sphere is missing (YES in step S405), the data (Pxi, Pyi, Pzi) of the imaging orientation corresponding to the virtual view point determined as missing is transmitted to the missing view-point image generation unit 106, and the processing proceeds to step S808.
  • step S808 the missing view-point image generation unit 106 generates an image presenting the missing view point based on the data of the imaging orientation transmitted from the missing position and orientation calculation unit 105.
  • step S809 the display unit 107 displays the image generated by the missing view-point image generation unit 106.
  • step S810 data of the classifier in a state where there are no missing view points is stored in the multi view-point image storage unit 104.
  • a number of dimensions of the eigenspace related to the eigenspace on which imaging is performed and a basis vector corresponding to the number of dimensions, a plurality of support vectors, and a plurality of linear combination coefficients corresponding to the plurality of support vectors are stored in the multi view-point image storage unit 104.
  • a multi view-point image composed of images necessary in the learning of the classifier that estimates the orientation can be generated. Further, since the classifier that performs arbitrary view point object recognition performs the learning by using the multi view-point image, the orientation of the object can be estimated by using the classifier.
  • an example of a method for generating a multi view-point image used for learning of a classifier that recognizes orientation of an object whose image is captured is described.
  • the user performs imaging from the missing view point.
  • the imaging unit 101 can be mounted on a robot arm or the like. Then, the robot arm or the like can be moved to the missing view point and the imaging can be performed from the missing view point.
  • data of the imaging orientation corresponding to the virtual view point which is determined as missing by the missing position and orientation calculation unit 105 is transmitted to an operation control unit of the robot arm or the like. Then, the operation control unit moves the robot arm based on the transmitted data of the imaging orientation.
  • imaging by the imaging unit 101 is performed. In this way, the imaging from the missing view point can be performed and a multi view-point image sufficient for the learning of the classifier can be automatically obtained.
  • an example of an information processing method for generating a multi view-point image used for generating a three-dimensional model of a measurement object body will be described.
  • Fig. 11 is a configuration diagram according to the present embodiment.
  • the present embodiment includes the imaging unit 101, the image input unit 102, the relative position and orientation calculation unit 103, an image normalization unit 701, the multi view-point image storage unit 104, a three-dimensional model generation unit 1101, the missing position and orientation calculation unit 105, the missing view-point image generation unit 106, and the display unit 107.
  • the object 112 which is a measurement object is set on the support 113.
  • a plurality of markers 121, 122, . . . , 128 are arranged. Further, the eight markers of the markers 121 to 128 correspond to the color 1, color 2, . .
  • the object 112 which is a measurement object does not include the colors 1 to 9. In the following description, regarding the above-described configuration, only components different from those of the first exemplary embodiment are described and description of similar portions are not repeated.
  • the imaging unit 101 performs imaging of the object 112.
  • the captured image is transmitted to the processing unit 12.
  • the image input unit 102 inputs the image transmitted from the imaging unit 101 to the processing unit 12.
  • the image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103.
  • the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101.
  • the relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104.
  • the image normalization unit 701 clips only the object 112 which is a measurement object from the input image.
  • the clipped image is transformed into a gray scale image and normalized into a predetermined size.
  • the multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed.
  • the three-dimensional model generation unit 1101 generates a three-dimensional model of the measurement object body by using the images of the measurement object body 112 captured from a plurality of view points and stored in the multi view-point image storage unit 104.
  • the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage according to a great amount of imaging orientation data obtained by imaging performed by the current stage.
  • the missing view-point image generation unit 106 generates an image to be displayed from the imaging orientation data of the missing view points calculated by the missing position and orientation calculation unit 105.
  • the display unit 107 displays the display image generated by the missing view-point image generation unit 106.
  • Fig. 12 is a flowchart illustrating a multi view-point image generation method according to the present embodiment.
  • step S1201 the image input unit 102 inputs the image transmitted from the imaging unit 101.
  • the image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103, and then the processing proceeds to step S1202.
  • step S1202 the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101.
  • step S1203 the image normalization unit 701 clips only the object 112 which is the measurement object from the input image.
  • the clipped image is transformed into a gray scale image and normalized into a predetermined size.
  • step S1204 the multi view-point image storage unit 104 stores the image whose object is clipped and has undergone the normalization processing in association with the imaging orientation data obtained by the relative position and orientation calculation unit 103.
  • step S1205 the three-dimensional model generation unit 1101 generates a three-dimensional model of the measurement object body by using the images of the measurement object body 112 captured from a plurality of view points and stored in the multi view-point image storage unit 104. For example, by using a technique discussed in patent literature 4, a three-dimensional model of the object 112 captured from a plurality of view points is generated.
  • step S1206 based on the three-dimensional model generated by the three-dimensional model generation unit 1101, the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage.
  • Fig. 15 is a flowchart illustrating processing performed by the missing position and orientation calculation unit. Details of the actual processing performed by the missing position and orientation calculation unit 105 in step S1206 will be described.
  • step S1501 first, a predetermined number of points are approximately uniformly arranged on the surface of a sphere in the virtual three-dimensional space and a virtual view point corresponding to each of such point is provided. Subsequently, two imaging-completed adjacent view points are selected for each virtual view point. Then, by using an image of the measurement object body 112 corresponding to the selected two imaging-completed adjacent view points, an estimated image of the measurement object body 112 is generated in a case where the imaging of the measurement object body is performed from a position corresponding to the virtual view point.
  • the estimated image can also be generated, for example, by using view morphing method discussed in nonpatent literature 5.
  • step S1502 a rendering image of the three-dimensional model which is generated by using the multi view-point image obtained by the current stage is generated.
  • the rendering image of the three-dimensional model is generated by rendering the three-dimensional model generated by the three-dimensional model generation unit 1101 under a condition that the three-dimensional model is observed from each virtual view point.
  • the rendering is a 3D-CG technique.
  • step 1503 the estimated image and the rendering image corresponding to each virtual point are compared. If similarity of the two images is low, it is determined that imaging from that virtual view point is missing.
  • the comparison of the two images is based on, for example, comparison of normalization correlation. If the value of the normalization correlation is equal to or lower than a predetermined value, it is determined that the similarity of the two images is low.
  • Fig. 13 is a schematic diagram of an example of the missing view point determination based on the comparison of two images.
  • an image 1303 captured from a virtual view point which is a point corresponding to the middle of them is generated by using, for example, a view-point morphing method.
  • a rendering image 1304 is generated by rendering the three-dimensional model generated by using the multi view-point images obtained by the current stage according to projection under the condition that the observation is made from the middle of the view points. Then, by comparing the two images, based on whether their similarity (e.g., normalization correlation value of the result of the edge extraction) is equal to or greater than a predetermined value, whether imaging data of that middle of the view points is missing or not is determined. In other words, if a difference 1305 between the estimated image 1303 which is generated according to the view point morphing method or the like and the rendering image 1304 is great, it is determined that a corresponding view point is missing.
  • step S1504 an imaging orientation corresponding to the virtual view point about which it has been determined that imaging from that point is missing is obtained.
  • the position (Pxi, Pyi, Pzi) on the surface of the virtual sphere is obtained as the imaging orientation corresponding to the virtual view point.
  • step S1207 it is determined whether a missing position and orientation exists. If it is determined that imaging from view points corresponding to all the points arranged approximately uniformly on the surface of the sphere is sufficient (NO in step S807), the missing position and orientation calculation unit 105 determines that imaging of a multi view-point image of the measurement object body 112 is sufficiently captured for the generation of the three-dimensional model, and the processing proceeds to step S1210.
  • step S405 if it is determined that the imaging from a view point corresponding to one of the points arranged approximately uniformly on the surface of the sphere is missing (YES in step S405), the data (Pxi, Pyi, Pzi) of the imaging orientation is transmitted to the missing view-point image generation unit 106, and the processing proceeds to step S1208.
  • step S1208 the missing view-point image generation unit 106 generates an image indicating the missing view point based on the data of the imaging orientation sent from the missing position and orientation calculation unit 105.
  • step S1209 the display unit 107 displays the image generated by the missing view-point image generation unit 106.
  • step S1210 the three-dimensional model generation unit 1101 generates a three-dimensional model of the measurement object body 112 from a plurality of images with no missing view points and stores the generated three-dimensional model in the multi view-point image storage unit 104.
  • a multi view-point image composed of a great number of images necessary for generating a three-dimensional model for generating a three-dimensional model of an object can be generated.
  • a multi view-point image sufficient for the generation of the three-dimensional model can be automatically obtained.
  • an example of a multi view-point image generation method used for generating a three-dimensional model of a measurement object body is described.
  • the present invention can be applied to an apparatus comprising a single device or to system constituted by a plurality of devices.
  • the invention can be implemented by supplying a software program, which implements the functions of the foregoing embodiments, directly or indirectly to a system or apparatus, reading the supplied program code with a computer of the system or apparatus, and then executing the program code.
  • a software program which implements the functions of the foregoing embodiments
  • reading the supplied program code with a computer of the system or apparatus, and then executing the program code.
  • the mode of implementation need not rely upon a program.
  • the program code installed in the computer also implements the present invention.
  • the claims of the present invention also cover a computer program for the purpose of implementing the functions of the present invention.
  • the program may be executed in any form, such as an object code, a program executed by an interpreter, or scrip data supplied to an operating system.
  • Example of storage media that can be used for supplying the program are a floppy disk, a hard disk, an optical disk, a magneto-optical disk, a CD-ROM, a CD-R, a CD-RW, a magnetic tape, a non-volatile type memory card, a ROM, and a DVD (DVD-ROM and a DVD-R).
  • a client computer can be connected to a website on the Internet using a browser of the client computer, and the computer program of the present invention or an automatically-installable compressed file of the program can be downloaded to a recording medium such as a hard disk.
  • the program of the present invention can be supplied by dividing the program code constituting the program into a plurality of files and downloading the files from different websites.
  • a WWW World Wide Web
  • a storage medium such as a CD-ROM
  • an operating system or the like running on the computer may perform all or a part of the actual processing so that the functions of the foregoing embodiments can be implemented by this processing.
  • a CPU or the like mounted on the function expansion board or function expansion unit performs all or a part of the actual processing so that the functions of the foregoing embodiments can be implemented by this processing.

Abstract

A multi view-point image composed of a great number of images according to a shape of an object is generated or an information processing method used for generating a three-dimensional model or performing image processing of arbitrary view-point object recognition is provided, and based on a plurality of captured images obtained by imaging of the object from a plurality of view points by an imaging means, a relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points is calculated, and based on the calculated plurality of relative positions and orientations, a missing position and orientation of the imaging means in a direction in which imaging by the imaging means is missing is calculated, and an image used for displaying the calculated missing position and orientation on a display means is generated.

Description

INFORMATION PROCESSING METHOD AND INFORMATION PROCESSING APPARATUS
The present invention relates to a method for generating a multi view-point image composed of a great number of images of a three-dimensional object captured from a plurality of different view points.
In generating a three-dimensional model by using an image which is obtained by imaging a three-dimensional object, a great number of images captured from various view points are necessary. Further, with respect to recognition of a three-dimensional object by using an image which is obtained by imaging the three-dimensional object from an arbitrary view point or with respect to learning of a classifier that recognizes the three-dimensional object, it is useful to use a great number of images captured from various view points. The learning of the classifier is to set or adjust a parameter of the classifier by using a great amount of data that continually increases according to learning. According to learning, optimum control is realized while adjustment is being made. Various methods have been proposed so far as a method for acquiring a great number of images obtained by imaging an object from various view points.
Japanese Patent Laid-Open No. 2007-72537 discusses an imaging apparatus including a plurality of imaging apparatuses and a rotation base where an object is mounted. The imaging apparatus references a feature point of the object. According to such an apparatus, the orientation of the object can be changed and a complete periphery image of the object can be captured.
Japanese Patent Laid-Open No. 2004-139294 discusses a multi view-point image processing program using a plurality of markers each of which serves as a reference of a view point position. According to such a method, as is with patent literature 1, without using a large-scale imaging apparatus, images of an object captured according to imaging from various view points can be acquired.
Regarding generation of a highly-accurate three-dimensional model or learning of a classifier that performs arbitrary view-point object recognition, it is desirable if a great number of images can be acquired which is performed by imaging the object evenly from various view points. However, with respect to the above-described method, an optimum method for evenly imaging the object from various view points is not provided or a simple method for realizing such imaging is not provided.
Further, with respect to generation of an improved three-dimensional model or improved learning of a classifier that performs arbitrary view-point object recognition, it is useful if the image is acquired according to a shape of the object. For example, it is useful if an image of a portion whose structure is complex is acquired in detail from a number of view points. However, the portion of the view points which should be densely arranged in capturing a portion of the object is not clear. Further, even if a portion where view points should be densely arranged is given, a method for simply acquiring such an image is not provided.
Japanese Patent Laid-Open No. 2007-72537 Japanese Patent Laid-Open No. 2004-139294 Japanese Patent Laid-Open No. 2007-156528 Japanese Patent Laid-Open No. 2000-194859
V.Lepetit, F.Moreno-Noguer, P.Fua "EPnP: An Accurate O(n) Solution to the PnP Problem", International Journal of Computer Vision, Vol.81, No.2, pp.155-166, 2008 G.Reitmayr, T.W.Drummond, "Going out: Robust Model-based Tracking for Outdoor Augmented Reality", IEEE/ACM International Symposium on Mixed and Augmented Reality, pp.109-118, 2006 Shingo Ando, Yoshinori Kusachi, Akira Suzuki, Kenichi Arakawa, "Pose Estimation of 3D Object Using Support Vector Regression", Journal of The Institute of Electronics, Information and Communication Engineers D, Vol.J89-D, pp.1840-1847, 2006 Hiroshi Murase, S.K.Nayar, "3D Object Recognition from Appearance: Parametric Eigenspace Method", The Institute of Electronics, Information and Communication Engineers D-II, Vol.J77-D-2(11), pp.2179-2187, 1994 S.M.Seitz, C.R.Dyer, "View Morphing", Proceedings of SIGGRAPH 96, pp.21-30, 1996
In the present invention, an image which is missing in object recognition is presented by using a great number of images obtained by imaging an object from various view points.
In accordance with a first aspect of the present invention, an information processing apparatus includes a relative position and orientation calculation means configured to calculate, based on a plurality of images captured by imaging an object from a plurality of view points by an imaging means, a relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points, a missing position and orientation calculation means configured to calculate a missing position and orientation of the imaging means from which imaging by the imaging means is missing based on the calculated plurality of relative positions and orientations, and a missing view-point image generation means configured to generate an image used for displaying the calculated missing position and orientation on a display means.
In accordance with another aspect of the present invention, an information processing method performed by an information processing apparatus includes a relative position and orientation calculation means of the information processing apparatus calculating, based on a plurality of images captured by imaging an object from a plurality of view points by an imaging means, a relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points, a missing position and orientation calculation means of the information processing apparatus calculating a missing position and orientation of the imaging means from which imaging by the imaging means is missing based on the calculated plurality of relative positions and orientations, and a missing view-point image generation means of the information processing apparatus generating an image used for display of the calculated missing position and orientation on a display means.
In accordance with a still further aspect of the present invention, a program causes a computer to execute each step of one of the video information processing methods described above.
In accordance with another aspect of the present invention, a recording medium stores a program causing a computer to execute each step of one of the video information processing methods described above.
Further features of the present invention will be apparent from the following description of exemplary embodiments with reference to the attached drawings.
Fig. 1 is a configuration diagram according to a first exemplary embodiment. Fig. 2 illustrates a definition of a measurement coordinate system according to the first exemplary embodiment. Fig. 3 illustrates definition of a camera coordinate system and an image coordinate system according to the first exemplary embodiment. Fig. 4 is a flowchart illustrating processing according to the first exemplary embodiment. Fig. 5 is a flowchart illustrating processing performed by a missing position and orientation calculation unit according to the first exemplary embodiment. Fig. 6 illustrates an example of an image displayed on a display unit according to the first exemplary embodiment. Fig. 7 is a configuration diagram according to a second exemplary embodiment. Fig. 8 is a flowchart illustrating processing according to the second exemplary embodiment. Fig. 9A illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment. Fig. 9B illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment. Fig. 9C illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment. Fig. 9D illustrates an example of an image processed by an image normalization unit according to the second exemplary embodiment. Fig. 10 is a flowchart illustrating processing performed by the missing position and orientation calculation unit according to the second exemplary embodiment. Fig. 11 is a configuration diagram according to a third exemplary embodiment. Fig. 12 is a flowchart illustrating processing according to the third exemplary embodiment. Fig. 13 illustrates an example of missing view point determination based on comparison of two images. Fig. 14 illustrates an example of missing view point determination based on an orientation estimation result. Fig. 15 is a flowchart illustrating processing performed by the missing position and orientation calculation unit according to the third exemplary embodiment.
It should be noted that the relative arrangement of the components, the numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present invention unless it is specifically stated otherwise.
Exemplary Embodiments of the present invention will now be described in detail below with reference to the accompanying drawings.
According to a first exemplary embodiment, a multi view-point image is generated from a great number of images obtained by evenly imaging an object being a measurement object from various view points.
Fig. 1 is a configuration diagram according to the present embodiment. The present embodiment includes an imaging unit 101, an image input unit 102, a relative position and orientation calculation unit 103, a multi view-point image storage unit 104, a missing position and orientation calculation unit 105, a missing view-point image generation unit 106, and a display unit 107. Further, on a measurement base 111 set in the real space, an object 112 being a measurement object is set on a support 113. A plurality of markers 121, 122, . . . , 128 are arranged in the periphery of the object 112. Further, the eight markers of the markers 121 to 128 correspond to color 1, color 2, . . . , color 8 respectively.
The imaging unit 101 performs imaging of the object 112. For example, imaging of the object 112 mounted on the support 113 is performed by a camera operated by a user. The imaging unit 101 can externally output the image which has been captured. For example, the imaging unit is a digital still camera or a camcorder including an image sensor, such as a charge-coupled device (CCD), and a lens. The captured image is transmitted to the image input unit 102 when, for example, the user presses a shutter button of the digital still camera.
The image input unit 102 inputs the image transmitted from the imaging unit 101. The image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103.
The relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object, and the imaging unit 101. The relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104.
The multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed. The storing operation is repeated until the number of the imaging orientation data pieces reaches a predetermined number. The recorded imaging orientation data is transmitted to the missing position and orientation calculation unit.
The missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage based on a great amount of imaging orientation data obtained by imaging up to the current stage and stored in the multi view-point image storage unit 104.
The missing view-point image generation unit 106 generates an image showing the view point which is missing from the imaging orientation data of the missing view points calculated by the missing position and orientation calculation unit 105.
The display unit 107 displays the display image generated by the missing view-point image generation unit 106.
Fig. 2 illustrates a definition of a measurement coordinate system according to the present embodiment. The support 113 is where a measurement object is mounted. The support 113 is provided on the measurement base 111. The point of origin of the measurement coordinate system is a contact position 200 where the support 113 contacts the measurement base 111, and the axes are an x-axis 201, a y-axis 202, and a z-axis 203.
When the image captured by the imaging unit 101 is processed, the markers are uniquely identified. For example, if the markers have different colors and their three-dimensional positions are known, a color unique to each marker is extracted from the image captured by the imaging unit 101. According to this processing, a marker in the image can be identified and the position of the marker in the image can be detected. A three-dimensional position of each marker is fixed and a center position of the markers in the measurement coordinate system is measured in advance.
Fig. 3 illustrates a definition of a camera coordinate system and an image coordinate system. As is with the measurement coordinate system, a three-dimensional camera coordinate system is defined with respect to the imaging unit 101. Further, an image plane 302 which is a virtual plane corresponding to the captured image is defined, and a two-dimensional image coordinate system of the image plane is defined. A principal point of the lens of the imaging unit 101 in the camera coordinate system is a point of origin 310. The direction corresponding to the right direction with respect to the captured image is defined as an x'-axis 311, the direction corresponding to the upper direction is defined as a y'-axis 312, and the axis parallel to the optical axis of the lens and in a direction opposite to an imaging direction 301 of the imaging unit 101 is defined as a z'-axis 313. Further, the image coordinate system is defined, as is illustrated in Fig. 3, as a two-dimensional coordinate system. In this image coordinate system, the center of an image plane 302 is defined as a point of origin 320, the direction corresponding to the right direction of the image is defined as an x"-axis 321, and the direction corresponding to the upper direction is defined as a y"-axis 322.
Fig. 4 is a flowchart illustrating processing according to the present embodiment. An example of a multi view-point image generation method will be described.
In step S401, the image input unit 102 inputs an image transmitted from the imaging unit 101 to the relative position and orientation calculation unit 103. The image obtained by imaging the object 112 needs to include at least four markers in the image. The image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103 and the processing proceeds to step S402.
In step S402, the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101. In other words, by processing the image transmitted from the image input unit 102, the relative position and orientation calculation unit 103 measures a position and orientation of the imaging unit when the imaging unit captured the image.
When the image is transmitted, the relative position and orientation calculation unit extracts the colors 1 to 8, each of which is unique to each marker, and determines whether each marker is included in the image. To be more precise, a hue is obtained from RGB values of each pixel and if the obtained hue and the hue of the color to be extracted are within a predetermined range, that pixel is extracted. If a pixel of a corresponding color is extracted from the image, it is determined that a marker corresponding to that color is included in the image. Then, a barycentric position (position (x", y") in the above-described image coordinate system) with respect to the image of the pixel corresponding to the color of the marker is obtained. According to this processing, positions of four or more markers, whose three-dimensional positions in the measurement coordinate system are known, in the image coordinate system can be obtained. Further, if a focal length of the lens is known as f [mm], according to a technique discussed in nonpatent literature 1, a relative position and orientation of the camera coordinate system with respect to the measurement coordinate system can be obtained.
A relative position and orientation of two three-dimensional coordinate systems is expressed by two components of the coordinate system, which are a translational component and a rotational component. The translational component is a translational mobile component between two points of origin of the two three-dimensional coordinates and is expressed by a translational mobile component (Tx, Ty, Tz) from a point of origin 40 of the measurement coordinate system to the point of origin 310 of the camera coordinate system. The rotational component is expressed by an axis of rotation and an amount of rotation with respect to the axis of rotation. To be more precise, the relative position and orientation of the two three-dimensional coordinate systems is expressed by the amount of rotation when the x-axis 201, the y-axis 202, and the z-axis 203 of the measurement coordinate system are in the same directions as the x'-axis 311, the y'-axis 312, and the z'-axis 313 of the camera coordinate system respectively. In other words, the measurement coordinate system is expressed by a direction of the axis of rotation (Rx, Ry, Rz) and a rotation angle "theta" when the measurement coordinate system is rotated on the axis of rotation which passes through the origin of the measurement coordinate system. Since the rotational component consists of four parameters and the parameters are normalized so that (Rx, Ry, Rz) that express the direction of the axis of rotation satisfy Rx2 + Ry2 + Rz2 = 1, the actual degree of freedom will be the same as the degree of freedom of the rotation. The measured relative position and orientation is expressed by the translational component (Tx, Ty, Tz) and the rotational component between the coordinate systems. The rotational component is expressed by the direction of the axis of rotation (Rx, Ry, Rz) and the rotation angle "theta". The relative position and orientation is used as the imaging orientation data.
The relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104 and then the processing proceeds to step S403.
In step S403, the multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed. In other words, the image of the object 112 which has been captured by the user and the position and orientation of the imaging unit 101 when the image has been captured are stored in the multi view-point image storage unit 104. When they are stored, they are associated to each other. This process is repeated until the number of the data pieces of the imaging orientation data stored in association reaches a predetermined number, and then the processing proceeds to step S404. The imaging is repeated until, for example, about 60 pieces of data are obtained.
In step S404, the missing position and orientation calculation unit 105 calculates the imaging-missing view point at the current stage based on a great amount of imaging orientation data captured by that time and stored in the multi view-point image storage unit 104.
Fig. 5 is a flowchart illustrating processing performed by the missing position and orientation calculation unit. Details of the actual processing performed by the missing position and orientation calculation unit 105 in step S404 will be described.
Step S501 is an approximately uniform direction calculation step. In this step, virtual view points are approximately uniformly arranged on a sphere in a virtual three-dimensional space having the object 112 at the center. To be more precise, a virtual coordinate system is set in a virtual three-dimensional space, and it is assumed that a sphere with radius 1 having the point of origin of the virtual coordinate system at the center is provided. Points of a predetermined number M are approximately uniformly arranged on the surface of the sphere. Then, a direction in which the imaging is possible is calculated
For example, according to a technique discussed in Japanese Patent Laid-Open No. 2007-156528, the points can be approximately uniformly arranged on the surface of the sphere. The points of the predetermined number M approximately uniformly arranged on the surface of the sphere may be an arbitrary number from, for example, 20 to about 100, and M = 60 is employed in the present embodiment. Thus, 60 points are arranged on the surface of the sphere. Positions of these points in the virtual coordinate system in the virtual three-dimensional space are set as (Pxi, Pyi, Pzi)(i = 1, 2, . . . , 60).
Step S502 is a similar direction determination step. According to this step, it is determined whether a point not yet used for imaging in a similar orientation exists. To be more precise, it is determined whether imaging in the orientation similar to the imaging orientation in the virtual three-dimensional space is already performed when it is assumed that the imaging is performed with an angle toward the center of the sphere from each point. First, it is simply assumed that the imaging unit 101 is arranged such that the imaging is performed with an angle toward the center of the sphere from each point. Then, the direction of an z'-axis 313 of the camera coordinate system illustrated in Fig. 3 in the virtual coordinate system at that time is obtained. Since the imaging is performed with an angle toward the center of the sphere from a point on the surface of the sphere, in other words, since the negative direction of the z'-axis 313 of the camera coordinate system passes through the center of the sphere, this direction will be the position of each point in the virtual coordinate system. For example, if the imaging unit 101 is arranged such that the image of the center of the sphere is captured from a certain point (Pxi, Pyi, Pzi), the direction of the z'-axis 313 of the camera coordinate system in the virtual coordinate system will be simply (Pxi, Pyi, Pzi). Then, from this direction and a plurality of imaging orientations whose imaging is already finished, a direction with respect to the measurement coordinate system having highest similarity with the direction of the z'-axis 313 of the camera coordinate system is searched and the similarity is obtained. An inner product of each of the two directions can be used in determining the similarity. Further, with respect to each imaging orientation whose imaging is already finished, a direction (Qx, Qy, Qz) of the z'-axis 313 in the camera coordinate system of the measurement coordinate system is obtained. If the rotational components of the imaging orientation data are the direction of the axis of rotation (Rx, Ry, Rz) and the rotation angle "theta", the direction (Qx, Qy, Qz) can be obtained from the following equation (1).
Figure JPOXMLDOC01-appb-I000001
In other words, inner product of the direction (Pxi, Pyi, Pzi) corresponding to the certain point arranged approximately uniformly on the surface of the sphere and the direction (Qx, Qy, Qz) acquired according to the above-described equation (1) by using a plurality pieces of imaging orientation data stored in the multi view-point image storage unit 104 is obtained. From the obtained inner products, an inner product with the largest value is obtained.
Then, it is determined whether the obtained greatest inner product value is equal to or greater than a predetermined reference value. If the greatest inner product value is equal to or greater than the predetermined reference value, it is determined that the imaging from the view point corresponding to the point arranged on the surface of the sphere is sufficient. If the greatest inner product value is equal to or smaller than the predetermined reference value, it is determined that the imaging from the view point corresponding to the point is missing. The predetermined reference value used for the determination can be an arbitrary value smaller than 1. For example, a value, 0.8, is used.
Step S503 is a dissimilar direction calculation step. In this step, as an imaging-missing view point, imaging orientation when the imaging is performed with an angle toward the center of the sphere from a point on the surface of the sphere is obtained. To be more precise, if it is determined that imaging is missing with respect to a certain point on the surface of the sphere, a direction (Pxi, Pyi, Pzi) corresponding to the point is determined as the imaging orientation corresponding to the imaging-missing view point. According to such processing, an imaging-missing view point is calculated.
In step S405, it is determined whether a missing position and orientation exists. If it is determined that imaging from view points corresponding to all the points arranged approximately uniformly on the surface of the sphere is sufficient (NO in step S405), the missing position and orientation calculation unit 105 determines that imaging of the measurement object body is evenly performed from various view points, and the processing ends.
On the other hand, if it is determined that imaging from a view point corresponding to one of the points arranged approximately uniformly on the surface of the sphere is missing (YES in step S405), the data (Pxi, Pyi, Pzi) corresponding to the imaging orientation is transmitted to the missing view-point image generation unit 106, and the processing proceeds to step S406.
In step S406, the missing view-point image generation unit 106 generates an image presenting the missing view point based on the data of the imaging orientation transmitted from the missing position and orientation calculation unit 105.
Fig. 6 illustrates an example of an image displayed on the display unit 107. An arrow 601 corresponds to an imaging orientation from an imaging-missing view point transmitted from the missing position and orientation calculation unit 105. The arrow 601 is on a dotted line 602. The dotted line 602 connects the point of origin 40 of the measurement coordinate system and the data (Pxi, Pyi, Pzi) regarding imaging orientation from the imaging-missing view point. According to the arrow 601, the missing view point can be confirmed. The missing view point can be expressed by a form other than an arrow so long as the user can understand the missing view point.
The user that performs the imaging of the measurement object body performs imaging from the missing view point displayed on the display unit 107. An image captured from the view point corresponding to the arrow 601 is transmitted from the imaging unit 101 to the image input unit 102, and the processing returns to step S401. In this manner, a multi view-point image composed of a great number of images obtained by evenly performing imaging from various view points is stored and a multi view-point image composed of a great number of images obtained by evenly performing imaging from various view points can be generated.
In step S407, the display unit 107 displays the image generated by the missing view-point image generation unit 106. In other words, the display unit 107 displays a position to which the imaging unit 101 is to be set and imaging of the object 112 is to be performed. The images of the object 112 captured evenly from various view points are stored in the multi view-point image storage unit 104, and the processing is continued until there are no missing view points.
According to the first exemplary embodiment, an example of a method for generating a multi view-point image composed of a great number of images of the measurement object body 112 captured evenly from various view points has been described.
A multi view-point image composed of a great number of images of the measurement object body 112 captured evenly from various view points is generated. However, it may be more useful to minutely perform imaging from an area in the vicinity of a predetermined view point depending on the measurement object. For example, with respect to a view point used in imaging a complex portion of the three-dimensional structure of the measurement object body, it is desirable to minutely perform the imaging compared to imaging of a portion having a simple structure. For example, by inputting data of a three-dimensional model of the measurement object body in advance and analyzing the data, a complex portion of the three-dimensional structure of the measurement object body is obtained. For example, if the measurement object body is a polygon, the complexity of the structure can be evaluated according to the number of sides of the polygon. Then, a view point used for imaging the complex portion of the three-dimensional structure of the measurement object body is obtained. Subsequently, out of a great number of points arranged approximately uniformly on the surface of a virtual sphere, a point in the vicinity of a point corresponding to a view point used for imaging the object body where the structure is complex is additionally generated. In this manner, the density of the points on the surface of the sphere in the vicinity of the view point that is used for imaging the complex portion of the structure will be higher. By using the plurality of points arranged on the surface of the virtual sphere, detailed images from view points used for capturing a complex portion of the structure can be captured.
In step S402, the relative position and orientation of the measurement object body and the imaging unit 101 is directly obtained. The method for obtaining the position and orientation of the imaging unit 101 is not limited to using a marker. For example, the position and orientation can be obtained in a way different from a captured image. By inputting a three-dimensional model of the measurement object body in advance and by comparing the characteristics when the three-dimensional model is projected on a two-dimensional image and the characteristics of the captured image, a relative position and orientation can be obtained. Further, the relative position and orientation of the imaging unit 101 and the object 112 can be obtained by estimating the relative position and orientation of the imaging unit and the object 112 from a three-dimensional model by using a technique discussed in nonpatent literature 2.
According to a second exemplary embodiment, an information processing method used for generating a multi view-point image used by a classifier will be described. When an image of an object is input, the classifier recognizes the orientation of the object. The information processing method is used for generating the multi view-point image which is used for the learning of the classifier.
Fig. 7 is a configuration diagram according to the present embodiment. The present embodiment includes the imaging unit 101, the image input unit 102, the relative position and orientation calculation unit 103, an image normalization unit 701, the multi view-point image storage unit 104, a classifier learning unit 702, the missing position and orientation calculation unit 105, the missing view-point image generation unit 106, and the display unit 107. Further, on the a measurement base 111 set in the real space, the object 112 which is a measurement object is set on the support 113. In the periphery of the object 112, a plurality of markers 121, 122, . . . , 128 are arranged. Further, the eight markers of the markers 121 to 128 correspond to the color 1, color 2, . . . , color 8 respectively. Further, the color 9 is set for regions other than the regions of the markers. The object 112 which is a measurement object does not include the colors 1 to 9. In the following description, regarding the above-described configuration, components different from those of the first exemplary embodiment are described and description of similar portions are not repeated.
The imaging unit 101 performs imaging of the object 112. The captured image is transmitted to the image input unit 102.
The image input unit 102 inputs the image transmitted from the imaging unit 101 to the relative position and orientation calculation unit 103. The image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103.
The relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101. The relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104.
The image normalization unit 701 normalizes an image of the object 112 which is the measurement object into a unified format. To be more precise, an image of the measurement object is clipped from the input image. Then a gray-scale image of the clipped image is generated and the generated image is normalized into a predetermined size.
The multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed. The storing operation is repeated until the number of the imaging orientation data pieces reaches a predetermined number. The recorded imaging orientation data is transmitted to the classifier learning unit 702.
Based on a great number of images and imaging orientation data stored in the multi view-point image storage unit 104, the classifier learning unit 702 performs learning of the classifier that estimates the orientation of the object 112 which is included in the image.
The missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage based on a great amount of imaging orientation data obtained by imaging by the current stage according to a result of estimation performed by the classifier which has performed learning according to the classifier learning unit 702.
The missing view-point image generation unit 106 generates an image showing the view point which is missing from the imaging orientation data of the missing view points calculated by the missing position and orientation calculation unit 105.
The display unit 107 displays the display image generated by the missing view-point image generation unit 106.
Fig. 8 is a flowchart illustrating processing flow of a multi view-point image generation method according to the present embodiment.
In step S801, the image input unit 102 inputs the image transmitted from the imaging unit 101. The image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103 and then the processing proceeds to step S802.
In step S802, the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101.
In step S803, the image normalization unit 701 normalizes an image of the object 112 which is the measurement object into a unified format. To be more precise, an image of the object 112 which is the measurement object is clipped from the input image. Then a gray-scale image of the clipped image is generated and the generated image is normalized into a predetermined size.
Fig. 9 illustrates an example of an image processed by the image normalization unit 701. Details of the actual processing performed by the image normalization unit 701 in step S803 will be described. Fig. 9A illustrates an input image. Pixels of colors other than colors 1 to 8 being colors of the markers and the color 9 being the color assigned to the regions other than the regions of the markers in the input image, are extracted. As a result of the extraction, an image that extracts only the region of the object 112 which is the measurement object in Fig. 9B is generated. From this image, a circumscribing square region 9 is extracted and clipped as illustrated in Fig. 9C. Then, the color image of the clipped square region is changed into a gray scale image. As illustrated in Fig. 9D, the size of the image is normalized by expanding or reducing the image so that the width and the height of the clipped region is a predetermined length w pixels. An arbitrary value can be used for the predetermined length w pixels. By extracting a region whose color is different from the colors of the markers and the color other than the measurement system and is set to one color, the image normalization unit 701 clips the region of the object 112 which is the measurement object. However, the present invention is not limited to such a method and a different method can be used so long as a region of an object which is a measurement object can be clipped.
In step S804, the multi view-point image storage unit 104 stores the image which has undergone the normalization processing in association with the imaging orientation data calculated by the relative position and orientation calculation unit 103.
In step S805, the classifier learning unit 702 performs learning of the classifier that estimates the orientation of the object 112 included in the image based on a great number of images and imaging orientation data stored in the multi view-point image storage unit 104. With respect to the learning of the classifier, a plurality of images which have undergone processing such as object clipping performed by a clipping processing unit 702 and imaging orientation data corresponding to each of the images, which are stored in the multi view-point image storage unit 104, are used as learning data. For example, this classifier can be used for the estimation of orientation of a three-dimensional object of the support vector regression method discussed in nonpatent literature 3. The values predicted by the classifier use three values, that is, roll, pitch, and yaw as estimated values of the orientation of the object 112. To be more precise, the direction of the axis of rotation (Rx, Ry, Rz) and the value of the rotation angle "theta" that represent the rotational component of the imaging orientation data are estimated.
For example, by using a great number of images (gray scale images of 100 x 100 pixels) stored in the multi view-point image storage unit 104 and according to main component analysis, an eigenspace of the image group is obtained. Although the eigenspace is of an arbitrary dimension, it is desirable to determine the dimension using a cumulative contribution ratio. For example, a number of dimensions where the cumulative contribution ratio is 90% or greater is used as a number of dimensions d (<100 x 100) in the eigenspace. Then, data generated by projecting each of the great number of images which are stored, to an eigenspace of d-dimension is used as input data. The direction of the axis of rotation (Rx, Ry, Rz) that indicates a rotational component of the imaging orientation data corresponding to each of the great number of images and the rotation angle "theta" are used as target values when the learning of the support vector regression method is performed. If a clipped image of the object 112 by the image normalization unit is input, in the classifier based on the support vector regression method, an orientation of the measurement object body 112 is output. The classifier learning unit 702 is not limited to a classifier using the support vector regression method. For example, a classifier that estimates the orientation of the object 112 from an image of the object 112 which has been captured such as a parametric eigenspace method discussed in nonpatent literature 4 can also be used.
In step S806, the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage based on estimation capability of the classifier at the time the classifier has performed learning according to a classifier learning unit 702. For example, a predetermined number of points are approximately uniformly arranged on the surface of the sphere in the virtual three-dimensional space. A plurality of view points, from which imaging is performed with an angle toward the center of the sphere, is prepared. The plurality of view points are hereinafter referred to as virtual view points. Then, regarding each of the virtual view points, out of the plurality pieces of imaging orientation data stored in the multi view-point image storage unit 104, two view points closest to the virtual view point are selected. These view points are hereinafter referred to as imaging-completed adjacent view points. An inner product value in the z'-axis direction in the camera coordinate system indicates the proximity of the view point. In this manner, the imaging-missing view point is calculated.
Fig. 10 is a flowchart illustrating processing performed by the missing position and orientation calculation unit. Details of actual processing performed by a missing position and orientation calculation unit 123 in step S806 will be described.
In step S1001, a predetermined number of points are approximately uniformly arranged on the surface of the sphere in the virtual three-dimensional space. A plurality of view points, from which imaging is performed with an angle toward the center of the sphere, is prepared. The plurality of view points are hereinafter referred to as virtual view points. Then, regarding each of the virtual view points, out of the plurality pieces of imaging orientation data stored in the multi view-point image storage unit 104, two view points closest to the virtual view point are selected. These view points are hereinafter referred to as imaging-completed adjacent view points. Regarding the proximity of the view point, it may be determined that the higher the value of the inner product in the z'-axis direction in the camera coordinate system, the closer the view point. Then, by using the image of the measurement object body 112 corresponding to the selected two imaging-completed adjacent view points, an estimated image of the measurement object body 112 when imaging is performed from a position corresponding to the virtual view point is generated. The estimated image is a virtual image of the measurement object body 112 when the measurement object body is observed from the above-described virtual view point. Any method can be used as the generation method of the estimated image so long as it can generate the above-described virtual image. Further, as is with the processing of the clip processing unit 702, a gray-scale image having the width and height of w pixels can be generated.
In step S1002, the image of the measurement object body 112 which is captured from the generated virtual view point is input to the classifier of the classifier learning unit 702, and the result of the orientation estimation performed by the classifier is obtained. To be more precise, a composite image of a measurement object body corresponding to imaging from a certain virtual view point is prepared, and with respect to the composite image, the classifier that has undergone the learning is applied, and a result of the orientation estimation of the object in the composite image is calculated.
In step S1003, if the result of the orientation estimation is not similar to the imaging orientation of an imaging system 71 when the imaging of the object is performed from a corresponding virtual view point, then it is determined that imaging is missing. Whether the orientation and the imaging orientation are similar or not, can also be determined according to whether a difference between the imaging orientation corresponding to the virtual view point and the result of the orientation estimation is equal to or smaller than a predetermined value. The difference between the two orientations can be considered as a rotation angle of one orientation when it is rotated so that it matches the other orientation.
Fig. 14 is a schematic diagram of an example of missing view point determination based on the result of the orientation estimation.
For example, from two images 1401 and 1402 corresponding to the imaging-completed adjacent view points, an image 1403 captured from a virtual view point which is a point corresponding to the middle of them is generated by using, for example, morphing. Then, the image is input to a discriminant function (e.g., multivalued nonlinear kernel regression function) which outputs the orientation of the object (direction of the axis of rotation and rotation angle) learned by using the acquired image. Whether the imaging data taken from the middle of the view points is missing is determined based on whether the orientation of the object which is output (estimated value) is within a predetermined range. In other words, it is determined whether an estimated orientation output 1404 of the middle of the view points (output of the discriminant function) is correct as a middle of the view points, and the missing view points are calculated. A curve in Fig. 14 schematically shows values (multivalues) of the correct orientation. If the orientation estimated output 1404 is an output close to the curve as illustrated in Fig. 14, it is determined that an image captured from a view point corresponding to the virtual view point is not missing. If the orientation estimated output 1404 is greatly different from the curve, it is determined that an image captured from a view point corresponding to the virtual view point is missing for the learning of the classifier.
In step S1004, an imaging orientation corresponding to the virtual view point which has been determined that imaging from that point is missing is obtained. For example, the position (Pxi, Pyi, Pzi) on the surface of the virtual sphere is obtained as the imaging orientation corresponding to the virtual view point.
In step S807, it is determined whether a missing position and orientation exists. If it is determined that imaging from view points corresponding to all the points arranged approximately uniformly on the surface of the sphere is sufficient (NO in step S807), the missing position and orientation calculation unit 105 determines that a multi view-point image of the measurement object body 112 is sufficiently captured for the learning of the classifier, and the processing proceeds to step S810. On the other hand, if it is determined that imaging from a view point corresponding to one of the points arranged approximately uniformly on the surface of the sphere is missing (YES in step S405), the data (Pxi, Pyi, Pzi) of the imaging orientation corresponding to the virtual view point determined as missing is transmitted to the missing view-point image generation unit 106, and the processing proceeds to step S808.
In step S808, the missing view-point image generation unit 106 generates an image presenting the missing view point based on the data of the imaging orientation transmitted from the missing position and orientation calculation unit 105.
In step S809, the display unit 107 displays the image generated by the missing view-point image generation unit 106.
In step S810, data of the classifier in a state where there are no missing view points is stored in the multi view-point image storage unit 104. To be more precise, a number of dimensions of the eigenspace related to the eigenspace on which imaging is performed and a basis vector corresponding to the number of dimensions, a plurality of support vectors, and a plurality of linear combination coefficients corresponding to the plurality of support vectors are stored in the multi view-point image storage unit 104.
According to the present embodiment, from images of a measurement object body, a multi view-point image composed of images necessary in the learning of the classifier that estimates the orientation can be generated. Further, since the classifier that performs arbitrary view point object recognition performs the learning by using the multi view-point image, the orientation of the object can be estimated by using the classifier.
According to the second exemplary embodiment, an example of a method for generating a multi view-point image used for learning of a classifier that recognizes orientation of an object whose image is captured is described.
According to an image generated by the missing view-point image generation unit 106 displayed on the display unit 107, the user performs imaging from the missing view point. However, in stead of the user performing the imaging, the imaging unit 101 can be mounted on a robot arm or the like. Then, the robot arm or the like can be moved to the missing view point and the imaging can be performed from the missing view point. In this case, data of the imaging orientation corresponding to the virtual view point which is determined as missing by the missing position and orientation calculation unit 105 is transmitted to an operation control unit of the robot arm or the like. Then, the operation control unit moves the robot arm based on the transmitted data of the imaging orientation. When the movement is completed, imaging by the imaging unit 101 is performed. In this way, the imaging from the missing view point can be performed and a multi view-point image sufficient for the learning of the classifier can be automatically obtained.
According to a third exemplary embodiment, an example of an information processing method for generating a multi view-point image used for generating a three-dimensional model of a measurement object body will be described.
Fig. 11 is a configuration diagram according to the present embodiment. The present embodiment includes the imaging unit 101, the image input unit 102, the relative position and orientation calculation unit 103, an image normalization unit 701, the multi view-point image storage unit 104, a three-dimensional model generation unit 1101, the missing position and orientation calculation unit 105, the missing view-point image generation unit 106, and the display unit 107. Further, on the a measurement base 111 set in the real space, the object 112 which is a measurement object is set on the support 113. In the periphery of the object 112, a plurality of markers 121, 122, . . . , 128 are arranged. Further, the eight markers of the markers 121 to 128 correspond to the color 1, color 2, . . . , color 8 respectively. Further, the color 9 is set for regions other than the regions of the markers. The object 112 which is a measurement object does not include the colors 1 to 9. In the following description, regarding the above-described configuration, only components different from those of the first exemplary embodiment are described and description of similar portions are not repeated.
The imaging unit 101 performs imaging of the object 112. The captured image is transmitted to the processing unit 12.
The image input unit 102 inputs the image transmitted from the imaging unit 101 to the processing unit 12. The image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103.
The relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101. The relative position and orientation of the object 112 and the imaging unit 101 calculated by the relative position and orientation calculation unit 103 is transmitted to the multi view-point image storage unit 104.
The image normalization unit 701 clips only the object 112 which is a measurement object from the input image. The clipped image is transformed into a gray scale image and normalized into a predetermined size.
The multi view-point image storage unit 104 stores the imaging orientation data obtained by the relative position and orientation calculation unit 103 in association with the image according to which the measurement of the imaging orientation has been performed.
The three-dimensional model generation unit 1101 generates a three-dimensional model of the measurement object body by using the images of the measurement object body 112 captured from a plurality of view points and stored in the multi view-point image storage unit 104.
Based on the generated three-dimensional model, the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage according to a great amount of imaging orientation data obtained by imaging performed by the current stage.
The missing view-point image generation unit 106 generates an image to be displayed from the imaging orientation data of the missing view points calculated by the missing position and orientation calculation unit 105.
The display unit 107 displays the display image generated by the missing view-point image generation unit 106.
Fig. 12 is a flowchart illustrating a multi view-point image generation method according to the present embodiment.
In step S1201, the image input unit 102 inputs the image transmitted from the imaging unit 101. The image input by the image input unit 102 is transmitted to the relative position and orientation calculation unit 103, and then the processing proceeds to step S1202.
In step S1202, the relative position and orientation calculation unit 103 calculates a relative position and orientation of the object 112 which is a measurement object and the imaging unit 101.
In step S1203, the image normalization unit 701 clips only the object 112 which is the measurement object from the input image. The clipped image is transformed into a gray scale image and normalized into a predetermined size.
In step S1204, the multi view-point image storage unit 104 stores the image whose object is clipped and has undergone the normalization processing in association with the imaging orientation data obtained by the relative position and orientation calculation unit 103.
In step S1205, the three-dimensional model generation unit 1101 generates a three-dimensional model of the measurement object body by using the images of the measurement object body 112 captured from a plurality of view points and stored in the multi view-point image storage unit 104. For example, by using a technique discussed in patent literature 4, a three-dimensional model of the object 112 captured from a plurality of view points is generated.
In step S1206, based on the three-dimensional model generated by the three-dimensional model generation unit 1101, the missing position and orientation calculation unit 105 calculates an imaging-missing view point at the current stage.
Fig. 15 is a flowchart illustrating processing performed by the missing position and orientation calculation unit. Details of the actual processing performed by the missing position and orientation calculation unit 105 in step S1206 will be described.
In step S1501, first, a predetermined number of points are approximately uniformly arranged on the surface of a sphere in the virtual three-dimensional space and a virtual view point corresponding to each of such point is provided. Subsequently, two imaging-completed adjacent view points are selected for each virtual view point. Then, by using an image of the measurement object body 112 corresponding to the selected two imaging-completed adjacent view points, an estimated image of the measurement object body 112 is generated in a case where the imaging of the measurement object body is performed from a position corresponding to the virtual view point. The estimated image can also be generated, for example, by using view morphing method discussed in nonpatent literature 5.
In step S1502, a rendering image of the three-dimensional model which is generated by using the multi view-point image obtained by the current stage is generated. The rendering image of the three-dimensional model is generated by rendering the three-dimensional model generated by the three-dimensional model generation unit 1101 under a condition that the three-dimensional model is observed from each virtual view point. The rendering is a 3D-CG technique.
In step 1503, the estimated image and the rendering image corresponding to each virtual point are compared. If similarity of the two images is low, it is determined that imaging from that virtual view point is missing. The comparison of the two images is based on, for example, comparison of normalization correlation. If the value of the normalization correlation is equal to or lower than a predetermined value, it is determined that the similarity of the two images is low.
Fig. 13 is a schematic diagram of an example of the missing view point determination based on the comparison of two images.
For example, from two images 1301 and 1302 corresponding to the imaging-completed adjacent view points, an image 1303 captured from a virtual view point which is a point corresponding to the middle of them is generated by using, for example, a view-point morphing method. Further, a rendering image 1304 is generated by rendering the three-dimensional model generated by using the multi view-point images obtained by the current stage according to projection under the condition that the observation is made from the middle of the view points. Then, by comparing the two images, based on whether their similarity (e.g., normalization correlation value of the result of the edge extraction) is equal to or greater than a predetermined value, whether imaging data of that middle of the view points is missing or not is determined. In other words, if a difference 1305 between the estimated image 1303 which is generated according to the view point morphing method or the like and the rendering image 1304 is great, it is determined that a corresponding view point is missing.
In step S1504, an imaging orientation corresponding to the virtual view point about which it has been determined that imaging from that point is missing is obtained. For example, the position (Pxi, Pyi, Pzi) on the surface of the virtual sphere is obtained as the imaging orientation corresponding to the virtual view point.
In step S1207, it is determined whether a missing position and orientation exists. If it is determined that imaging from view points corresponding to all the points arranged approximately uniformly on the surface of the sphere is sufficient (NO in step S807), the missing position and orientation calculation unit 105 determines that imaging of a multi view-point image of the measurement object body 112 is sufficiently captured for the generation of the three-dimensional model, and the processing proceeds to step S1210. On the other hand, if it is determined that the imaging from a view point corresponding to one of the points arranged approximately uniformly on the surface of the sphere is missing (YES in step S405), the data (Pxi, Pyi, Pzi) of the imaging orientation is transmitted to the missing view-point image generation unit 106, and the processing proceeds to step S1208.
In step S1208, the missing view-point image generation unit 106 generates an image indicating the missing view point based on the data of the imaging orientation sent from the missing position and orientation calculation unit 105.
In step S1209, the display unit 107 displays the image generated by the missing view-point image generation unit 106.
In step S1210, the three-dimensional model generation unit 1101 generates a three-dimensional model of the measurement object body 112 from a plurality of images with no missing view points and stores the generated three-dimensional model in the multi view-point image storage unit 104.
A multi view-point image composed of a great number of images necessary for generating a three-dimensional model for generating a three-dimensional model of an object can be generated.
By mounting the imaging unit 101 on a robot arm or the like and moving the robot arm or the like to a missing view point and performing the imaging from the missing view point, a multi view-point image sufficient for the generation of the three-dimensional model can be automatically obtained.
According to the third exemplary embodiment, an example of a multi view-point image generation method used for generating a three-dimensional model of a measurement object body is described.
Other Exemplary Embodiment
Note that the present invention can be applied to an apparatus comprising a single device or to system constituted by a plurality of devices.
Furthermore, the invention can be implemented by supplying a software program, which implements the functions of the foregoing embodiments, directly or indirectly to a system or apparatus, reading the supplied program code with a computer of the system or apparatus, and then executing the program code. In this case, so long as the system or apparatus has the functions of the program, the mode of implementation need not rely upon a program.
Accordingly, since the functions of the present invention are implemented by a computer, the program code installed in the computer also implements the present invention. In other words, the claims of the present invention also cover a computer program for the purpose of implementing the functions of the present invention.
In this case, so long as the system or apparatus has the functions of the program, the program may be executed in any form, such as an object code, a program executed by an interpreter, or scrip data supplied to an operating system.
Example of storage media that can be used for supplying the program are a floppy disk, a hard disk, an optical disk, a magneto-optical disk, a CD-ROM, a CD-R, a CD-RW, a magnetic tape, a non-volatile type memory card, a ROM, and a DVD (DVD-ROM and a DVD-R).
As for the method of supplying the program, a client computer can be connected to a website on the Internet using a browser of the client computer, and the computer program of the present invention or an automatically-installable compressed file of the program can be downloaded to a recording medium such as a hard disk. Further, the program of the present invention can be supplied by dividing the program code constituting the program into a plurality of files and downloading the files from different websites. In other words, a WWW (World Wide Web) server that downloads, to multiple users, the program files that implement the functions of the present invention by computer is also covered by the claims of the present invention.
It is also possible to encrypt and store the program of the present invention on a storage medium such as a CD-ROM, distribute the storage medium to users, allow users who meet certain requirements to download decryption key information from a website via the Internet, and allow these users to decrypt the encrypted program by using the key information, whereby the program is installed in the user computer.
Besides the cases where the aforementioned functions according to the embodiments are implemented by executing the read program by computer, an operating system or the like running on the computer may perform all or a part of the actual processing so that the functions of the foregoing embodiments can be implemented by this processing.
Furthermore, after the program read from the storage medium is written to a function expansion board inserted into the computer or to a memory provided in a function expansion unit connected to the computer, a CPU or the like mounted on the function expansion board or function expansion unit performs all or a part of the actual processing so that the functions of the foregoing embodiments can be implemented by this processing.
While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all modifications, equivalent structures, and functions.
This application claims priority from Japanese Patent Application No. 2010-040596 filed February 25, 2010, which is hereby incorporated by reference herein in its entirety.

Claims (20)

  1. An information processing apparatus characterized in that the apparatus comprises:
    a relative position and orientation calculation means configured to calculate, based on a plurality of images captured by imaging an object from a plurality of view points by an imaging means, a relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points;
    a missing position and orientation calculation means configured to calculate a missing position and orientation of the imaging means from which imaging by the imaging means is missing based on the calculated plurality of relative positions and orientations; and
    a missing view-point image generation means configured to generate an image used for display of the calculated missing position and orientation on a display means.
  2. The information processing apparatus according to claim 1, characterized in that the missing position and orientation calculation means calculates the relative position and orientation, with respect to the object whose imaging by the imaging means is missing relative to the imaging means, as the missing position and orientation.
  3. The information processing apparatus according to claim 1 or 2, characterized in that the apparatus further comprises a learning means for classifier configured to allow a classifier that estimates orientations of the plurality of objects to learn the plurality of captured images, wherein the missing position and orientation calculation means calculates the missing position and orientation based on the orientations of the plurality of the calculated relative positions and the orientation of the object estimated by the classifier.
  4. The information processing apparatus according to claim 3, characterized in that the apparatus further comprises an image normalizing means configured to normalize the plurality of captured images in a unified format, and
    the learning means for classifier allows learning of the plurality of captured images which have been normalized.
  5. The information processing apparatus according to claim 3 or 4, characterized in that the missing position and orientation calculation means calculates the missing position and orientation by comparing the orientations of the plurality of the calculated relative positions and orientations and a set position and orientation which is set based on the plurality of captured images learned by the classifier.
  6. The information processing apparatus according to claim 3 or 4, characterized in that the apparatus further comprises an estimated image generation means configured to generate an estimated image in a case where imaging is performed from a set position and orientation which is set based on the plurality of captured images learned by the classifier, and
    a projection image generation means configured to generate a rendering image of a three-dimensional model of the object stored in advance in a case where observation is performed from the set position and orientation, and
    wherein the missing position and orientation calculation means calculates the missing position and orientation by comparing the estimated image and the rendering image.
  7. The information processing apparatus according to claim 5 or 6, characterized in that with respect to the estimated image generation means, the set position and orientation is an intermediate position and orientation of positions and orientations used for imaging the plurality of captured images learned by the classifier.
  8. The information processing apparatus according to claim 6 or 7, characterized in that the plurality of captured images leaned by the classifier are two captured images.
  9. The information processing apparatus according to any one of claims 1 to 8, characterized in that the captured image includes a plurality of markers arranged on the object and in a periphery of the object, and three-dimensional positions of the plurality of markers in a real space are known.
  10. The information processing apparatus according to claim 4, characterized in that each of the plurality of markers has a color different from a color of the object and the plurality of markers which are different from the marker.
  11. The information processing apparatus according to claim 4 or 5, characterized in that the relative position and orientation calculation means calculates the relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points by detecting positions of the plurality of markers in the captured image.
  12. The information processing method according to claim 6, characterized in that positions of the detected plurality of markers are four or more.
  13. The information processing method according to claim 1 characterized in that the relative position and orientation calculation means calculates the relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points by comparing characteristics of the three-dimensional model of the object stored in advance and characteristics of the object in the captured image.
  14. The information processing apparatus according to any one of claims 1 to 13, characterized in that the missing position and orientation calculation means comprises:
    an approximately uniform direction calculation means configured to calculate a plurality of approximately uniform directions in which imaging of the object can be approximately uniformly performed having the object at a center;
    a similar direction determination means configured to determine the approximately uniform direction which is similar to a direction of imaging based on the plurality of the calculated relative positions and orientations; and
    a dissimilar direction calculation means configured to calculate a direction whose imaging by the imaging means is missing based on the approximately uniform direction which is determined as not similar.
  15. The information processing apparatus according to any one of claims 1 to 14, characterized in that the missing position and orientation calculation means calculates, if the calculated relative position and orientation from a position in a vicinity of a view point which can be used in imaging a complex structure of a three-dimensional model of the object stored in advance is small in number, a relative position and orientation of the view point as a missing position and orientation.
  16. The information processing apparatus according to any one of claims 1 to 15, characterized in that the apparatus further comprises a three-dimensional model generation means configured to generate the three-dimensional model based on the plurality of captured images.
  17. The information processing apparatus according to any one of claims 1 to 16, characterized in that the apparatus further comprises an imaging moving means used for moving the imaging means.
  18. An information processing method performed by an information processing apparatus, characterized in that the method comprises:
    a relative position and orientation calculation means of the information processing apparatus calculating, based on a plurality of images captured by imaging an object from a plurality of view points by an imaging means, a relative position and orientation with respect to the object relative to the imaging means for each of the plurality of view points;
    a missing position and orientation calculation means of the information processing apparatus calculating a missing position and orientation of the imaging means from which imaging by the imaging means is missing based on the calculated plurality of relative positions and orientations; and
    a missing view-point image generation means of the information processing apparatus generating an image used for display of the calculated missing position and orientation on a display means.
  19. A computer-executable program configured to allow a computer to execute the information processing method according to claim 18.
  20. A storage medium storing a computer-executable program configured to allow a computer to execute the information processing method according to claim 18.
PCT/JP2011/000965 2010-02-25 2011-02-22 Information processing method and information processing apparatus WO2011105044A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/580,560 US9429418B2 (en) 2010-02-25 2011-02-22 Information processing method and information processing apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2010040596 2010-02-25
JP2010-040596 2010-02-25

Publications (1)

Publication Number Publication Date
WO2011105044A1 true WO2011105044A1 (en) 2011-09-01

Family

ID=44506477

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2011/000965 WO2011105044A1 (en) 2010-02-25 2011-02-22 Information processing method and information processing apparatus

Country Status (3)

Country Link
US (1) US9429418B2 (en)
JP (1) JP5812599B2 (en)
WO (1) WO2011105044A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103295011A (en) * 2012-02-08 2013-09-11 索尼公司 Information processing apparatus, information processing method and computer program
CN109565580A (en) * 2016-05-25 2019-04-02 佳能株式会社 Information processing equipment, image generating method, control method and program

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5812599B2 (en) * 2010-02-25 2015-11-17 キヤノン株式会社 Information processing method and apparatus
US9129438B2 (en) * 2011-01-18 2015-09-08 NedSense Loft B.V. 3D modeling and rendering from 2D images
JP5571128B2 (en) * 2012-06-28 2014-08-13 株式会社東芝 Measurement support apparatus, method and program
JP6338369B2 (en) * 2013-12-26 2018-06-06 キヤノン株式会社 Information processing apparatus and information processing method
JP6664579B2 (en) * 2014-06-20 2020-03-13 ヤフー株式会社 Learning device, learning method and learning program
JP6193195B2 (en) * 2014-09-17 2017-09-06 株式会社東芝 Movement support apparatus, method and program
US9704300B2 (en) * 2015-03-06 2017-07-11 Siemens Medical Solutions Usa, Inc. Detection of anatomy orientation using learning-based regression
EP3295368A1 (en) * 2015-05-13 2018-03-21 Google LLC Deepstereo: learning to predict new views from real world imagery
US10102654B1 (en) * 2015-07-28 2018-10-16 Cascade Technologies, Inc. System and method for a scalable interactive image-based visualization environment of computational model surfaces
JP2017049052A (en) * 2015-08-31 2017-03-09 日本ユニシス株式会社 System, method and program for generating three-dimensional image data of object
CN105513049B (en) * 2015-11-25 2018-12-21 中国联合网络通信集团有限公司 A kind of image forming method and image formation system
KR102016413B1 (en) * 2016-01-05 2019-09-02 한국전자통신연구원 Apparatus and method for scanning item
JP6164319B2 (en) * 2016-03-02 2017-07-19 ソニー株式会社 Information processing apparatus, information processing method, and computer program
US10486061B2 (en) 2016-03-25 2019-11-26 Zero Latency Pty Ltd. Interference damping for continuous game play
US10421012B2 (en) 2016-03-25 2019-09-24 Zero Latency PTY LTD System and method for tracking using multiple slave servers and a master server
US9916496B2 (en) 2016-03-25 2018-03-13 Zero Latency PTY LTD Systems and methods for operating a virtual reality environment using colored marker lights attached to game objects
JP6431495B2 (en) * 2016-03-25 2018-11-28 本田技研工業株式会社 Teacher data generation method
US10717001B2 (en) * 2016-03-25 2020-07-21 Zero Latency PTY LTD System and method for saving tracked data in the game server for replay, review and training
JP6821326B2 (en) * 2016-05-12 2021-01-27 キヤノン株式会社 Information processing equipment, measurement systems, information processing methods and programs
US10751609B2 (en) 2016-08-12 2020-08-25 Zero Latency PTY LTD Mapping arena movements into a 3-D virtual world
JP6945297B2 (en) * 2016-12-05 2021-10-06 株式会社日立製作所 Device diagnostic device, device diagnostic system and device diagnostic method
US10977825B2 (en) 2017-02-03 2021-04-13 Sumitomo Mitsui Construction Co., Ltd. Position measurement device and position measurement method
JP6983524B2 (en) * 2017-03-24 2021-12-17 キヤノン株式会社 Information processing equipment, information processing methods and programs
JP7164883B2 (en) * 2017-05-31 2022-11-02 株式会社フジキン Semiconductor manufacturing equipment management system, method, and computer program
JP6946087B2 (en) * 2017-07-14 2021-10-06 キヤノン株式会社 Information processing device, its control method, and program
JP6474179B2 (en) * 2017-07-30 2019-02-27 国立大学法人 奈良先端科学技術大学院大学 Learning data set creation method, and object recognition and position and orientation estimation method
JP6734253B2 (en) 2017-12-20 2020-08-05 ファナック株式会社 Imaging device including a visual sensor for imaging a workpiece
JP7133926B2 (en) 2018-01-15 2022-09-09 キヤノン株式会社 Information processing device, system, information processing method
JP2019192145A (en) 2018-04-27 2019-10-31 ソニー株式会社 Information processing device, information processing method and program
JP7036401B2 (en) * 2018-04-27 2022-03-15 Awl株式会社 Learning server, image collection support system for insufficient learning, and image estimation program for insufficient learning
US20220198751A1 (en) * 2019-03-15 2022-06-23 Mimaki Engineering Co., Ltd. Three-dimensional-body data generation device, three-dimensional-body data generation method, program, and modeling system
KR102198851B1 (en) 2019-11-12 2021-01-05 네이버랩스 주식회사 Method for generating three dimensional model data of an object
US11800056B2 (en) 2021-02-11 2023-10-24 Logitech Europe S.A. Smart webcam system
US11800048B2 (en) 2021-02-24 2023-10-24 Logitech Europe S.A. Image generating system with background replacement or modification capabilities

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005195335A (en) * 2003-12-26 2005-07-21 Fuji Xerox Co Ltd Three-dimensional image photographing equipment and method

Family Cites Families (93)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5327221A (en) * 1988-02-16 1994-07-05 Canon Kabushiki Kaisha Device for detecting positional relationship between two objects
US5340992A (en) * 1988-02-16 1994-08-23 Canon Kabushiki Kaisha Apparatus and method of detecting positional relationship using a weighted coefficient
US5319444A (en) * 1988-02-16 1994-06-07 Canon Kabushiki Kaisha Position detecting method and apparatus
US4920491A (en) * 1988-05-16 1990-04-24 General Electric Company Enhancement of image quality by utilization of a priori information
EP0358514B1 (en) * 1988-09-09 2000-12-13 Canon Kabushiki Kaisha Position detecting method and apparatus
US5225892A (en) * 1990-02-05 1993-07-06 Canon Kabushiki Kaisha Positional deviation detecting method
EP0455443B1 (en) * 1990-05-01 1997-11-12 Canon Kabushiki Kaisha Positional deviation detecting method and apparatus
US5291563A (en) * 1990-12-17 1994-03-01 Nippon Telegraph And Telephone Corporation Method and apparatus for detection of target object with improved robustness
US5625408A (en) * 1993-06-24 1997-04-29 Canon Kabushiki Kaisha Three-dimensional image recording/reconstructing method and apparatus therefor
JP2883265B2 (en) * 1993-09-24 1999-04-19 キヤノン株式会社 Image processing device
US6463176B1 (en) * 1994-02-02 2002-10-08 Canon Kabushiki Kaisha Image recognition/reproduction method and apparatus
US6118475A (en) * 1994-06-02 2000-09-12 Canon Kabushiki Kaisha Multi-eye image pickup apparatus, and method and apparatus for measuring or recognizing three-dimensional shape
JP3513219B2 (en) * 1994-07-12 2004-03-31 キヤノン株式会社 Corresponding point matching method and apparatus
US6839081B1 (en) * 1994-09-09 2005-01-04 Canon Kabushiki Kaisha Virtual image sensing and generating method and apparatus
US6640004B2 (en) * 1995-07-28 2003-10-28 Canon Kabushiki Kaisha Image sensing and image processing apparatuses
DE69635101T2 (en) * 1995-11-01 2006-06-01 Canon K.K. Method for extracting objects and image recording apparatus using this method
JP3869876B2 (en) * 1995-12-19 2007-01-17 キヤノン株式会社 Image measuring method and image measuring apparatus
US6445814B2 (en) * 1996-07-01 2002-09-03 Canon Kabushiki Kaisha Three-dimensional information processing apparatus and method
JP3679512B2 (en) * 1996-07-05 2005-08-03 キヤノン株式会社 Image extraction apparatus and method
US6453069B1 (en) * 1996-11-20 2002-09-17 Canon Kabushiki Kaisha Method of extracting image from input image using reference image
JPH10191020A (en) * 1996-12-20 1998-07-21 Canon Inc Object image segmenting method and device
EP0977151B1 (en) * 1998-07-31 2007-11-07 Canon Kabushiki Kaisha Image processing apparatus, image processing method, and storage medium
JP2000209425A (en) * 1998-11-09 2000-07-28 Canon Inc Device and method for processing image and storage medium
JP2000194859A (en) 1998-12-25 2000-07-14 Canon Inc Object shape extraction method, object shape extraction device and recording medium
JP2000209430A (en) * 1999-01-18 2000-07-28 Canon Inc Contour extraction device and method and storage medium
US6674905B1 (en) * 1999-01-22 2004-01-06 Canon Kabushiki Kaisha Image processing method, image processing apparatus, and storage medium
JP4306006B2 (en) * 1999-03-30 2009-07-29 コニカミノルタセンシング株式会社 Three-dimensional data input method and apparatus
US7054850B2 (en) * 2000-06-16 2006-05-30 Canon Kabushiki Kaisha Apparatus and method for detecting or recognizing pattern by employing a plurality of feature detecting elements
US6826299B2 (en) * 2000-07-31 2004-11-30 Geodetic Services, Inc. Photogrammetric image correlation and measurement system and method
US7088860B2 (en) * 2001-03-28 2006-08-08 Canon Kabushiki Kaisha Dynamically reconfigurable signal processing circuit, pattern recognition apparatus, and image processing apparatus
US20020164067A1 (en) * 2001-05-02 2002-11-07 Synapix Nearest neighbor edge selection from feature tracking
JP4085671B2 (en) * 2002-03-29 2008-05-14 コニカミノルタホールディングス株式会社 Data processing method, data processing program, and recording medium
US7031512B2 (en) * 2002-04-18 2006-04-18 Stmicroelectronics, Inc. Method and system for 3D smoothing within the bound of error regions of matching curves
JP2003346185A (en) * 2002-05-24 2003-12-05 Olympus Optical Co Ltd Information display system and personal digital assistant
JP2004139294A (en) * 2002-10-17 2004-05-13 Hitachi Ltd Multi-viewpoint image processing program, system, and marker
JP2004170329A (en) * 2002-11-22 2004-06-17 Nippon Steel Corp Bump electrode and ball inspection method for the same
WO2004055735A1 (en) * 2002-12-16 2004-07-01 Canon Kabushiki Kaisha Pattern identification method, device thereof, and program thereof
US8737720B2 (en) * 2003-04-30 2014-05-27 Deere & Company System and method for detecting and analyzing features in an agricultural field
EP2955662B1 (en) * 2003-07-18 2018-04-04 Canon Kabushiki Kaisha Image processing device, imaging device, image processing method
US7643025B2 (en) * 2003-09-30 2010-01-05 Eric Belk Lange Method and apparatus for applying stereoscopic imagery to three-dimensionally defined substrates
JP4262082B2 (en) * 2003-12-24 2009-05-13 キヤノン株式会社 Image processing apparatus, control method therefor, program, and storage medium
US7292735B2 (en) * 2004-04-16 2007-11-06 Microsoft Corporation Virtual image artifact detection
GB2415344B (en) * 2004-06-14 2010-10-06 Canon Europa Nv Texture data compression and rendering in 3D computer graphics
EP1607716A3 (en) * 2004-06-18 2012-06-20 Topcon Corporation Model forming apparatus and method, and photographing apparatus and method
JP4217664B2 (en) * 2004-06-28 2009-02-04 キヤノン株式会社 Image processing method and image processing apparatus
EP1774466A4 (en) * 2004-07-30 2009-10-21 Canon Kk Image processing method and apparatus, image sensing apparatus, and program
JP2006287749A (en) * 2005-04-01 2006-10-19 Canon Inc Imaging apparatus and control method thereof
FI20055168A0 (en) * 2005-04-13 2005-04-13 Gen Electric Tomografiamenetelmä
US7945938B2 (en) * 2005-05-11 2011-05-17 Canon Kabushiki Kaisha Network camera system and control method therefore
EP1904977B1 (en) * 2005-07-08 2010-12-15 Wisconsin Alumni Research Foundation Backprojection reconstruction method for ct imaging
DE602006009191D1 (en) * 2005-07-26 2009-10-29 Canon Kk Imaging device and method
JP4508049B2 (en) 2005-09-05 2010-07-21 株式会社日立製作所 360 ° image capturing device
US8542928B2 (en) * 2005-09-26 2013-09-24 Canon Kabushiki Kaisha Information processing apparatus and control method therefor
US7599547B2 (en) * 2005-11-30 2009-10-06 Microsoft Corporation Symmetric stereo model for handling occlusion
JP2007156528A (en) 2005-11-30 2007-06-21 Canon Inc Point density equally arranging device, method and program therefor, and storage medium
JP4764273B2 (en) * 2006-06-30 2011-08-31 キヤノン株式会社 Image processing apparatus, image processing method, program, and storage medium
US8072482B2 (en) * 2006-11-09 2011-12-06 Innovative Signal Anlysis Imaging system having a rotatable image-directing device
US8542236B2 (en) * 2007-01-16 2013-09-24 Lucasfilm Entertainment Company Ltd. Generating animation libraries
JP4845755B2 (en) * 2007-01-30 2011-12-28 キヤノン株式会社 Image processing apparatus, image processing method, program, and storage medium
US8000522B2 (en) * 2007-02-02 2011-08-16 General Electric Company Method and system for three-dimensional imaging in a non-calibrated geometry
JP4999570B2 (en) * 2007-06-18 2012-08-15 キヤノン株式会社 Facial expression recognition apparatus and method, and imaging apparatus
JP5084398B2 (en) * 2007-08-24 2012-11-28 キヤノン株式会社 Measuring apparatus, measuring method, and program
JP5448326B2 (en) * 2007-10-29 2014-03-19 キヤノン株式会社 Gripping device and gripping device control method
JP5553310B2 (en) * 2007-12-20 2014-07-16 コーニンクレッカ フィリップス エヌ ヴェ Image encoding method for stereoscopic rendering
JP5058838B2 (en) * 2008-02-01 2012-10-24 キヤノン株式会社 Information processing apparatus and method
JP5243612B2 (en) * 2008-10-02 2013-07-24 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ Intermediate image synthesis and multi-view data signal extraction
WO2010048632A1 (en) * 2008-10-24 2010-04-29 Real D Stereoscopic image format with depth information
US8634635B2 (en) * 2008-10-30 2014-01-21 Clever Sys, Inc. System and method for stereo-view multiple animal behavior characterization
KR101260613B1 (en) * 2008-12-26 2013-05-03 닛뽕빅터 가부시키가이샤 Image encoding device, image encoding method, program thereof, image decoding device, image decoding method, and program thereof
US8306265B2 (en) * 2009-01-12 2012-11-06 Eastman Kodak Company Detection of animate or inanimate objects
JP5599572B2 (en) * 2009-03-12 2014-10-01 富士フイルム株式会社 Case image retrieval apparatus, method and program
CA2772607A1 (en) * 2009-09-01 2011-03-10 Prime Focus Vfx Services Ii Inc. System and process for transforming two-dimensional images into three-dimensional images
JP5812599B2 (en) * 2010-02-25 2015-11-17 キヤノン株式会社 Information processing method and apparatus
JP5588196B2 (en) * 2010-02-25 2014-09-10 キヤノン株式会社 Recognition device, control method therefor, and computer program
JP5665333B2 (en) * 2010-03-10 2015-02-04 キヤノン株式会社 Information processing apparatus and information processing apparatus control method
US9053393B2 (en) * 2010-03-19 2015-06-09 Canon Kabushiki Kaisha Learning method and apparatus for pattern recognition
US9400503B2 (en) * 2010-05-20 2016-07-26 Irobot Corporation Mobile human interface robot
JP2012038106A (en) * 2010-08-06 2012-02-23 Canon Inc Information processor, information processing method and program
JP5615088B2 (en) * 2010-08-18 2014-10-29 キヤノン株式会社 Image processing apparatus and method, program, and imaging apparatus
JP5587137B2 (en) * 2010-10-29 2014-09-10 キヤノン株式会社 Measuring apparatus and measuring method
JP5743499B2 (en) * 2010-11-10 2015-07-01 キヤノン株式会社 Image generating apparatus, image generating method, and program
JP5767464B2 (en) * 2010-12-15 2015-08-19 キヤノン株式会社 Information processing apparatus, information processing apparatus control method, and program
JP5798318B2 (en) * 2010-12-15 2015-10-21 キヤノン株式会社 Distance data generation device, position and orientation measurement device, distance data generation device control method, and program
US9177381B2 (en) * 2010-12-22 2015-11-03 Nani Holdings IP, LLC Depth estimate determination, systems and methods
CN103430218A (en) * 2011-03-21 2013-12-04 英特尔公司 Method of augmented makeover with 3d face modeling and landmark alignment
FR2981772B1 (en) * 2011-10-21 2017-12-22 Thales Sa METHOD FOR 3D RECONSTRUCTION OF AN OBJECT OF A SCENE
JP5865043B2 (en) * 2011-12-06 2016-02-17 キヤノン株式会社 Information processing apparatus and information processing method
JP2013162487A (en) * 2012-02-08 2013-08-19 Canon Inc Image display apparatus and imaging apparatus
JP6027764B2 (en) * 2012-04-25 2016-11-16 キヤノン株式会社 Mirror system and control method thereof
US9141868B2 (en) * 2012-06-26 2015-09-22 Xerox Corporation Contemporaneously reconstructing images captured of a scene illuminated with unstructured and structured illumination sources
JP6143469B2 (en) * 2013-01-17 2017-06-07 キヤノン株式会社 Information processing apparatus, information processing method, and program
JP6433149B2 (en) * 2013-07-30 2018-12-05 キヤノン株式会社 Posture estimation apparatus, posture estimation method and program
JP6338369B2 (en) * 2013-12-26 2018-06-06 キヤノン株式会社 Information processing apparatus and information processing method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005195335A (en) * 2003-12-26 2005-07-21 Fuji Xerox Co Ltd Three-dimensional image photographing equipment and method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103295011A (en) * 2012-02-08 2013-09-11 索尼公司 Information processing apparatus, information processing method and computer program
CN109565580A (en) * 2016-05-25 2019-04-02 佳能株式会社 Information processing equipment, image generating method, control method and program
US11012674B2 (en) 2016-05-25 2021-05-18 Canon Kabushiki Kaisha Information processing apparatus, image generation method, control method, and program

Also Published As

Publication number Publication date
US9429418B2 (en) 2016-08-30
US20120321173A1 (en) 2012-12-20
JP2011198349A (en) 2011-10-06
JP5812599B2 (en) 2015-11-17

Similar Documents

Publication Publication Date Title
WO2011105044A1 (en) Information processing method and information processing apparatus
US9940756B2 (en) Silhouette-based object and texture alignment, systems and methods
US11710282B2 (en) Augmented reality content rendering via Albedo models, systems and methods
JP6768156B2 (en) Virtually enhanced visual simultaneous positioning and mapping systems and methods
US9928656B2 (en) Markerless multi-user, multi-object augmented reality on mobile devices
US9177381B2 (en) Depth estimate determination, systems and methods
US10410089B2 (en) Training assistance using synthetic images
US11222471B2 (en) Implementing three-dimensional augmented reality in smart glasses based on two-dimensional data
JP4829141B2 (en) Gaze detection apparatus and method
CN112889091A (en) Camera pose estimation using fuzzy features
KR102386444B1 (en) Image depth determining method and living body identification method, circuit, device, and medium
JP6897082B2 (en) Computer program for face orientation estimation, face orientation estimation device and face orientation estimation method
JP4631973B2 (en) Image processing apparatus, image processing apparatus control method, and image processing apparatus control program
JP6420605B2 (en) Image processing device
KR20190011492A (en) Device for providing content and method of operating the same
CN108028904A (en) The method and system of light field augmented reality/virtual reality in mobile equipment
JP2020042575A (en) Information processing apparatus, positioning method, and program
JP2006215743A (en) Image processing apparatus and image processing method
US11410398B2 (en) Augmenting live images of a scene for occlusion
JP2003346185A (en) Information display system and personal digital assistant
JP6931267B2 (en) A program, device and method for generating a display image obtained by transforming the original image based on the target image.
JP6606340B2 (en) Image detection apparatus, image detection method, and program
KR101844367B1 (en) Apparatus and Method for Head pose estimation using coarse holistic initialization followed by part localization
WO2022102015A1 (en) Image information acquisition device, image information acquisition method, and computer program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11747016

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 13580560

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11747016

Country of ref document: EP

Kind code of ref document: A1