Object region data generating method, object region data generating apparatus, approximation polygon generating method, and approximation polygon generating apparatus

Information

  • Patent Application
  • 20010025298
  • Publication Number
    20010025298
  • Date Filed
    March 16, 2001
    23 years ago
  • Date Published
    September 27, 2001
    22 years ago
Abstract
In an object region data generating method for moving picture, an object region is approximated with a polygon in each frame. Each vertex of the polygon in one frame is associated with each vertex of the polygon in adjacent frame based on the distance between the vertexes. A trajectory of the vertex is obtained on a time-series variation of the associated vertexes. The object region data is generated based on the trajectory of the vertex.
Description


CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is based upon and claims the benefit of priority from the prior Japanese Patent Application No. 2000-077161, filed Mar. 17, 2000, the entire contents of which are incorporated herein by reference.



BACKGROUND OF THE INVENTION

[0002] The present invention relates an object region data generating method and apparatus for generating data that indicates a region of an object in a moving picture. In addition, the present invention relates to an approximation polygon generating method and apparatus for generating an approximation polygon having an arbitrary object region approximated in one frame of a moving picture or a still picture.


[0003] In recent years, with rapid advancement of image processing technology, a moving picture or a still picture is generally processed as digital data. With this digitization, a technique for efficiently compressing moving picture or still picture data with its large data quantity has been established. In addition, with the development of network technology such as Internet or satellite broadcasting, CATV and the like, a moving picture or still picture database which stores a large amount of moving picture or still picture or a video-on-demand system will be put into a practical stage. Further, an automatic supervisory system from a remote site becomes a major object. In this way, when a moving picture or a still picture is used, there is a variety of demands to classify such moving picture or still picture according to what exists in a screen, and extract, process, and retrieve an object in a moving picture or a still picture.


[0004] In order to respond to these demands, technical research for determining what is drawn in a moving picture or a still picture is made. However, both of a background and a plurality of objects exist in a moving picture or a still picture. In order to identify a specific object, it is required to identify such background from objects as preprocessing. At this time, it is required to represent object region data in any shape, and data on object position, size, and shape must be precisely expressed. As a technique for this purpose, for example, there is developed an alpha map representing an image with high brightness of a portion at which an object exists and with low brightness of a portion at which no object exists or a bounding box approximating an object region in rectangular shape. However, alpha map has disadvantages that a large amount of data is required, and a large calculation amount of detection of a position or size of an object is required. In addition, in such bounding box, it is impossible to express a shape of an object.


[0005] As a technique to solve these disadvantages, there is a method for approximating an object in a polygonal shape. In this case, only the coordinate of each vertex of a polygon is described, and thus, a remarkably smaller amount of data than that in alpha map is required. In addition, this method is the most suitable to expression of object region data because it has characteristics in which the position, size, and shape of an object is expressed faithfully.


[0006] However, conventionally, as a method for approximating an object in a polygonal shape, there has been mainly used a technique in which an inertial main axis of an object is obtained, a point at which the inertial main axis cross an object contour is defined as a vertex, and then, a point on the object contour on which a distance from a line connecting vertexes to each other is the most distant is added as a vertex to obtain a polygon with a required number of vertexes. In this technique, it is required to search a point on an object contour every time an attempt is made to add one vertex. Thus, there has been a disadvantage that a large calculation amount of polygon approximation is required. In addition, a vertex always exists on an object contour, and thus, a segment of line connecting vertexes to each other enters the inside of an object at a protrusion portion of the object. Conversely, a segment of line connecting vertex to each other is expelled to the outside of an object at a recess portion of the object. Thus, there is a problem that a large amount of error between a polygon and an object is required.


[0007] In addition, with respect to expression of an object in a moving picture as well, polygon approximation is excellent as compared with alpha map or bounding box or the like from the aspect of data amount or expression of the position, size, and shape of the object for the reason similar to that in still picture. However, if a time-series variation of an object is expressed by polygon approximation, the movement of each vertex that is polygon approximated is described. However, in order to serially record a position of the vertex of each frame, there is a problem that a large amount of data is required, which is inconvenient.



BRIEF SUMMARY OF THE INVENTION

[0008] Accordingly, it is an object of the present invention to provide an object region data generating method and apparatus generating a trajectory of a vertex of an approximation polygon that expresses a region of a desired object in a moving picture with high speed and high precision.


[0009] It is another object of the present invention to provide an object region data generating method and apparatus describing a region of a desired object in a moving picture with a small amount of data, and facilitating generation of such object or data processing.


[0010] It is still another object of the present invention to provide an approximation polygon generating method and apparatus generating an approximation polygon that approximates a region of an arbitrary object in one frame of a moving picture or a still picture with high precision and high speed.


[0011] According to the present invention, there is provided a method of generating object region data relating to object regions in frames of moving picture, the frames lined up in a time-series variation, the method comprising:


[0012] approximating the object region in each of the frames with a polygon, the polygon having vertexes;


[0013] associating the vertexes in each of the frames with the respective vertexes in an adjacent frame of each of the frames;


[0014] obtaining trajectories linking the vertexes associated together based on the time-series variation of the frames; and


[0015] generating the object region data based on the trajectories.


[0016] According to the present invention, there is provided another method of generating object region data relating to an object region in moving picture data having a plurality of frames, the method comprising:


[0017] approximating the object region in each of the plurality of frames with a polygon;


[0018] estimating a plurality of vertexes of the polygon in a given frame based on trajectory data indicating vertexes of the polygon from a first frame to an immediately preceding frame of the given frame;


[0019] modifying the plurality of vertexes estimated so as to be associated with the object region in the given frame;


[0020] obtaining trajectory data indicating vertexes of the polygon from the first frame to the given frame; and


[0021] generating the object region data based on the trajectory data indicating vertexes of the polygon from the first frame to the given frame.


[0022] According to the present invention, there is provided a further method of generating a polygon that approximates an object region in an image, the method comprising:


[0023] extracting a contour of the object region to generate an initial polygon;


[0024] obtaining a characteristic quantity of each of vertexes of the initial polygon;


[0025] selecting the vertex to be deleted based on the characteristic quantity;


[0026] deleting the selected vertex;


[0027] correcting a position of vertexes adjacent to the deleted vertex, thereby performing processing for updating the polygon; and


[0028] repeatedly performing the processing until a predetermined ending condition has been met, thereby generating the polygon that approximates the object region.


[0029] According to the present invention, there is provided an apparatus for generating object region data relating to object regions in frames of moving picture, the frames lined up in a time-series variation, the apparatus comprising:


[0030] an approximation unit configured to approximate the object region in each of the frames with a polygon, the polygon having vertexes;


[0031] an association unit configured to associate the vertexes in each of the frames with the respective vertexes in an adjacent frame of each of the frames;


[0032] a trajectory unit configured to obtain trajectories linking the vertexes associated together based on the time-series variation of the frames; and


[0033] a data generation unit configured to generate the object region data based on the trajectories.


[0034] According to the present invention, there is provided another apparatus for generating object region data relating to an object region in moving picture data having a plurality of frames, the apparatus comprising:


[0035] an approximation unit configured to approximate the object region in each of the plurality of frames with a polygon;


[0036] an estimation unit configured to estimate a plurality of vertexes of the polygon in a given frame based on trajectory data indicating vertexes of the polygon from a first frame to an immediately preceding frame of the given frame;


[0037] a modifying unit configured to modify the plurality of vertexes estimated so as to be associated with the object region in the given frame;


[0038] a trajectory unit configured to obtain trajectory data indicating vertexes of the polygon from the first frame to the given frame; and


[0039] a data generation unit configured to generate the object region data based on the trajectory data indicating vertexes of the polygon from the first frame to the given frame.


[0040] According to the present invention, there is provided a still another apparatus for generating a polygon that approximates an object region in an image, the apparatus comprising:


[0041] an extraction unit configured to extract a contour of the object region to generate an initial polygon;


[0042] a characteristic quantity unit configured to obtain a characteristic quantity of each of vertexes of the initial polygon;


[0043] a selector configured to select the vertex to be deleted based on the characteristic quantity;


[0044] a deletion unit configured to delete the selected vertex;


[0045] a correction unit configured to correct a position of vertexes adjacent to the deleted vertex, thereby performing processing for updating the polygon; and


[0046] a controller configured to repeatedly perform the processing until a predetermined ending condition has been met, thereby generating the polygon that approximates the object region.


[0047] According to the present invention, there is provided an article of manufacture comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means comprising:


[0048] first computer readable program code means for causing a computer to approximate the object region in each of the frames with a polygon, the polygon having vertexes;


[0049] second computer readable program code means for causing a computer to associate the vertexes in each of the frames with the respective vertexes in an adjacent frame of each of the frames;


[0050] third computer readable program code means for causing a computer to obtain trajectories linking the vertexes associated together based on the time-series variation of the frames; and


[0051] fourth computer readable program code means for causing a computer to generate the object region data based on the trajectories.


[0052] According to the present invention, there is provided another article of manufacture comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means comprising:


[0053] first computer readable program code means for causing a computer to approximate the object region in each of the plurality of frames with a polygon;


[0054] second computer readable program code means for causing a computer to estimate a plurality of vertexes of the polygon in a given frame based on trajectory data indicating vertexes of the polygon from a first frame to an immediately preceding frame of the given frame;


[0055] third computer readable program code means for causing a computer to modify the plurality of vertexes estimated so as to be associated with the object region in the given frame;


[0056] fourth computer readable program code means for causing a computer to obtain trajectory data indicating vertexes of the polygon from the first frame to the given frame; and


[0057] fifth computer readable program code means for causing a computer to generate the object region data based on the trajectory data indicating vertexes of the polygon from the first frame to the given frame.


[0058] According to the present invention, there is provided a still another article of manufacture comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means comprising:


[0059] first computer readable program code means for causing a computer to extract a contour of the object region to generate an initial polygon;


[0060] second computer readable program code means for causing a computer to obtain a characteristic quantity of each of vertexes of the initial polygon;


[0061] third computer readable program code means for causing a computer to select the vertex to be deleted based on the characteristic quantity;


[0062] fourth computer readable program code means for causing a computer to delete the selected vertex;


[0063] fifth computer readable program code means for causing a computer to correct a position of vertexes adjacent to the deleted vertex, thereby performing processing for updating the polygon; and


[0064] sixth computer readable program code means for causing a computer to repeatedly perform the processing until a predetermined ending condition has been met, thereby generating the polygon that approximates the object region.


[0065] According to the present invention, the size, shape, movement, and deformation of a region for a desired object in a moving picture is approximated by a polygon with high precision and high speed. In addition, a region for a desired object in a moving picture is described with a small amount of data, making it possible to facilitate such generation and data processing. As a result, in a moving picture database or a supervisory system and the like, the present invention is efficiently used for object recognition, classification or search and the like.







BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING

[0066]
FIG. 1 is a view showing an example of an object region data generating apparatus according to a first embodiment of the present invention;


[0067]
FIG. 2 is a view showing an example of processing procedures in the first embodiment;


[0068]
FIG. 3 is a flow chart showing an example of polygon generating processing;


[0069]
FIG. 4 is a view illustrating a polygon generating processing;


[0070]
FIG. 5 is a view illustrating vertex deletion processing and moving processing of the adjacent vertexes in a processing for generating an approximation polygon;


[0071]
FIG. 6 is a flow chart showing an example of procedures for vertex association processing;


[0072]
FIG. 7 is a view illustrating a vertex association process;


[0073]
FIG. 8A and FIG. 8B are views each illustrating function approximation processing;


[0074]
FIG. 9 is a view showing an example of a data structure of object region data;


[0075]
FIG. 10 is a view showing an example of a data structure of approximation polygon data contained in object region data;


[0076]
FIG. 11 is a view showing an example of a data structure of vertex trajectory data contained in approximation polygon data;


[0077]
FIG. 12 is a flow chart showing an example of processing procedures in a second embodiment of the present invention; and


[0078]
FIG. 13 is a view illustrating processing in the second embodiment of the present invention.







DETAILED DESCRIPTION OF THE INVENTION

[0079] A preferred embodiment of an object region data generating method and apparatus and an approximation polygon generating method and apparatus according to the present invention will now be described with reference to the accompanying drawings.


[0080]
FIG. 1 shows a configuration example of an object region data generating apparatus according to the first embodiment of the present invention. As shown in FIG. 1, this object region data generating apparatus comprises an image storage device 100, a polygon generator 101, a vertex association unit 102, a function approximation unit 103, and an object region data storage device 104.


[0081] The image storage device 100 stores image data such as moving picture data or still picture data, and comprises a hard disk or optical disk and a semiconductor memory or the like.


[0082] The polygon generator 101 generates a polygon that represents a region of an object to be extracted from among moving picture data with respect to each frame at which such object exists. The term “object” used here denotes, for example, a specific person or animal in an image or any other creatures or manufactures or structural materials such as car or building (or parts of these materials including human head, car chassis, or gate of building). An object is specified by an operator or an object extracting algorithm, such as used for a stereoscopic display. Hereinafter, a polygon representing an area for an object is called an approximation polygon. An approximation polygon is represented by the coordinate (X-Y coordinate group) of each vertex of such polygon.


[0083] There are two methods for generating an approximation polygon: a method generating polygons with their identical vertexes in number over all frames with regard to identical objects and a method in which the number of vertexes in a polygon may be different from another every frame.


[0084] The vertex association unit 102 associates corresponding vertexes of approximation polygons between adjacent frames in a linking manner with regard to the vertexes of approximation polygons generated over a plurality of frames with regard to each object, and generates a trajectory of the vertexes of the approximation polygons over a plurality of frames with regard to each object. That is, this vertex association processing is performed for each object to be extracted from moving picture data from a frame in which such each object appears over a frame in which it disappears.


[0085] For example, when a vertex of an approximation polygon in which a region of an object in a frame is approximated is associated with a vertex of the corresponding polygon of the adjacent frame, thereby obtaining a trajectory of the vertex, a distance between the vertex in one frame and the associated vertex of the adjacent frame may be determined so as to be minimum. Before obtaining this distance, two polygons may be aligned with each other so that their centers of gravity coincides with each other.


[0086] In addition, for example, a position of a vertex of the adjacent frame is estimated from a trajectory of the already associated vertex so that a distance between the estimated vertex and a vertex in the adjacent frame may be determined so as to be minimum.


[0087] Further, for example, the characteristic quantity in vertexes of a polygon is calculated so that a vertex having its closest characteristic quantity in the adjacent frame may be associated with another vertex.


[0088] The function approximation unit 103 performs function approximation for each object with regard to the trajectory of each vertex in an approximation polygon (a series of the coordinates of the corresponding vertexes over a plurality of frames from a frame in which an object appears to a frame in which an object disappears). For example, with respect to each vertex of an approximation polygon of one object, a trajectory of a vertex is expressed as a function of a frame number “f” or a time stamp “t” assigned to a moving picture for each of the X and Y coordinates. Function approximation methods include approximation using a straight line or approximation using a spline curve and the like. The result of function approximation is expressed by a parameter value such as a polynomial coefficient of an approximation function, for example. In this manner, the trajectory of vertexes over a plurality of frames is approximated by a function and the function is expressed by only parameters, thereby a data quantity to be recorded is reduced.


[0089] The object region data storage device 104 stores in accordance with a predetermined format, object region data generated by the function approximation unit 103, the data containing data obtained by expressing a time series trajectory of each coordinate of each vertex of the approximation polygon of each object in a function approximation manner (for example, parameter value such as polynomial coefficient of approximation function). As with the image storage device 100 and the like, the object region data storage device 104 also comprises a hard disk or optical disk, a semiconductor memory and the like, for example.


[0090] The object region data generated with respect to each object is used when an object is recognized or classified in a moving picture database or a supervisory system and the like.


[0091] The image storage device 100 and the object region data storage device 104 may comprise different storage devices or identical storage devices.


[0092]
FIG. 2 is a flow chart showing an example of the flow of processing using an object region data generating apparatus according to the present embodiment.


[0093] A variety of processing procedures include:


[0094] (1) a method of performing processing sequentially for each frame by reading one frame to perform a series of processing functions (polygon generation processing, vertex association processing, function approximation processing) at steps S11 to S13, and reading a next frame;


[0095] (2) a method of first performing processing for performing polygon generation processing at step S11 with regard to a plurality of frames, performing vertex association processing at step S12 in batch, and performing function approximation processing at step S13; and


[0096] (3) a method of first performing polygon generation processing at step S11 with regard to all the target frames, performing vertex association processing at step S12 in batch, and performing function approximation processing at step S13 in batch.


[0097] The object region data generating apparatus according to the present embodiment can be implemented using software, of course. In this case, a program may be executed on a computer by creating the program in accordance with the flow chart shown in FIG. 2, for example, and installing it in the computer.


[0098] Hereinafter, a description is mainly given to a case in which function approximation using the function approximation unit 103 is performed, thereby reducing a data quantity, and recording object region data. It is possible to record the coordinate value of each vertex of an approximation graphics for each frame in object region data without performing function approximation using the function approximation unit 103 (in the latter case, the function approximation unit 103 may not be provided).


[0099] Hereinafter, each of the processing functions in the present embodiment will be described in more detail.


[0100] First, polygon generation processing using the polygon generator 101 will be described with reference to FIG. 3 to FIG. 5.


[0101]
FIG. 3 shows an example of processing procedures for generating an approximation polygon that indicates a region of an object from a frame in which an object exists.


[0102] First, at step S21, an initial polygon precisely indicating a region of an object with a plurality of vertexes is generated.


[0103] As input images, there may be used a bit map image (alpha map) in which a region in which an object exists is defined as “1” or a region in which an object does not exist is defined as “0”; an edge image obtained by removing a sharp edge from the image; or any other format image as long as a region in which an object exists or a region in which an object does not exist can be identified from each other. An example of input image is shown in an image 301 shown in FIG. 4.


[0104] One point on an object contour is searched from the input image, and that point is defined as a vertex of an initial polygon. When this vertex is defined as a start point, while an object contour is traced with a short distance, the vertex on the contour is defined as a vertex, and is sequentially added to the initial polygon. In general, although the initial polygon is formed as a polygon with a very large number of vertexes, a polygon approximation error (for example, difference between an area for an actual object region and an area for the approximation polygon or a value obtained by dividing this difference by the area for the actual object region or the area for the approximation polygon) is small. An example of the initial polygon obtained by the input image 301 is shown in an image 302 shown in FIG. 4. In this case, a large amount of data is required, and then, processing for reducing such amount of data is performed.


[0105] At step S22, a characteristic quantity is calculated with regard to each vertex of the obtained initial polygon. The characteristic quantity estimates how much polygon approximation error increases when such vertex is deleted. The characteristic quantity may be an angle formed at such vertex and two vertexes adjacent thereto or an area for a triangle formed at such vertex and two vertexes adjacent thereto.


[0106] An example of a relationship between a vertex and its characteristic quantity is shown in an image 304 shown in FIG. 5. In the image 304, an angle formed at a vertex ti and two vertex adjacent thereto is defined as θi, and an area for a triangle formed at the vertex ti and two vertex adjacent thereto is defined as si.


[0107] Where an area si for a triangle is used as a characteristic quantity of vertex ti=(xi, yi), when the adjacent vertexes are defined as ti−1=(xi−1, yi−1), ti+1=(xi+1, yi+1), the characteristic quantity si is represented by:




s


i
=|(xi+1−xi)·(yi−1, yi)−(xi−1, xi)·(yi+1, yi)|/2



[0108] If a polygon before deleting a vertex ti well approximates an object, an approximation error increased by deleting the vertex ti is proportional to the characteristic quantity si. Therefore, if the characteristic quantity si is small, even if the vertex ti is deleted, the approximation error is less affected.


[0109] At step S23, one vertex at which the smallest polygon approximation error occurs when the vertex is deleted (that is, a vertex having its minimum characteristic quantity si) is selected from the characteristic quantity obtained at step S22, and such vertex is deleted.


[0110] If a deleted point corresponds to a protrusion portion of a polygon, there is high possibility that the approximation error produced by deleting a vertex corresponds to an error produced by a polygon entering the inside of an object. In this case, the two vertexes adjacent to the deleted vertex are moved to the outside of the polygon, thereby reducing an approximation error to the minimum.


[0111] Conversely, if a deleted point corresponds to a recess portion of a polygon, there is high possibility that the approximation error produced by deleting a vertex corresponds to an error produced by the polygon being expelled to the outside of the object. In this case, therefore, two vertexes adjacent to the deleted vertex is moved to the inside of the polygon, thereby reducing an approximation error to the minimum.


[0112] Although a movement vector of the adjacent vertexes may be arbitrary, if the movement quantity is too large, the approximation error increases conversely. Because of this, for example, this vector may be determined from a polygon approximation error or its peripheral vertex coordinates.


[0113] In the case of determination from the polygon approximation error, while a movement vector is changed, the best movement vector is searched for (for example, a movement vector with its minimum error is adopted).


[0114] In the case of determination from its peripheral vertex coordinate, for example, when a vertex ti is deleted, a movement vector vi of an adjacent vertexes ti−1 and ti+1 is determined as a vector vertical to a straight line connecting a vertex ti+1 to a vertex ti−1, the vector having a length that is half of a distance between the straight line connecting the vertex ti+1 to the vertex ti−1 and a vertex ti. When a vector connecting the vertex ti to the vertex ti−1 is defined as ai, and a vector connecting the vertex ti+1 to the vertex ti−1 is defined as bi, the movement vector vi is expressed as follows.




v


i


={a


i
−(ai·bi/|bi|2bi}/2



[0115] where (ai·bi/|bi|2bi denotes a positive projection of ai and bi.


[0116] An example of a movement vector of the adjacent vertex when a vertex is deleted is shown in an image 305 shown in FIG. 5, and an example when the adjacent vertexes are moved by movement vector is shown in an image 306 shown in FIG. 5.


[0117] The vertexes are deleted, and the adjacent vertexes are moved, whereby there is a need to update the characteristic quantity of a vertex. At step S24, the characteristic quantity of a vertex to be updated is re-calculated. The vertex to be updated is a vertex moved at step S23 (vertexes ti−1 and ti+1 shown an example shown in FIG. 5) and the adjacent vertexes thereto (ti−2 and ti+2 in an example shown in FIG. 5).


[0118] At step S25, it is determined whether or not termination conditions are met.


[0119] A variety of termination conditions will be described below.


[0120] For example, there is a method of using a termination condition as to whether the number of vertexes in an approximation polygon is many more than a specified number. In this case, if the polygon includes the specified number of vertexes or less as a result of deletion of vertexes, processing is terminated. If vertexes equal to or greater than the specified number of vertexes still remains, it returns to step S23 at which steps S23 to S25 are repeated until the polygon includes the specified number of vertexes or less.


[0121] In addition, apart from a method of judgment according to whether or not a polygon includes a specified number of vertexes or less, for example, there is a method for specifying a termination condition according to whether or not a polygon approximation error exceeds a threshold. In this case, if the polygon approximation error exceeds such threshold, processing terminates.


[0122] Further, an AND or OR combination or the like of a plurality of conditions may be employed.


[0123] Where function approximation is performed using the function approximation unit 103, it is preferable that the number of vertexes be equal (there are a method in which the number of vertexes of all objects are identical to each other and a method in which the number of vertexes of objects may be different from each other). Conversely, if function approximation is not performed using the function approximation unit 103, the number of vertexes may not be equal.


[0124] An example of an approximation polygon that represents a region of an object obtained by this processing is shown in an image 303 shown in FIG. 4.


[0125] Now, vertex association processing in the adjacent frames using the vertex association unit 102 will be described with reference to FIG. 6 and FIG. 7.


[0126]
FIG. 6 shows an example of processing procedures for associating each vertex of an approximation polygon that represents a region for each object extracted for each frame.


[0127] First, at step S71, when an approximation polygon representing a region of an object in a frame is defined as Pi, and an approximation polygon in a next frame (adjacent frame) of such frame is defined as Pi+1, the centers of gravity of these polygons Pi and Pi+1 are obtained. Examples of the approximation polygons Pi and Pi+1, and the center of gravity obtained therefor are shown in the images 311 and 321 in FIG. 7.


[0128] If there are plural objects (its approximation polygons) in the adjacent frame, a distance between the center of gravity of the approximation polygon Pi of the object targeted in the frame and that of each of the approximation polygons in the adjacent frame is calculated, and then, an approximation polygon having its center of gravity that is the shortest in distance among them is defined as an approximation polygon Pi+1 that corresponds to the object. Of course, the approximation polygons may be associated with each other using one variation of another method.


[0129] Next, consider a polygon P′i in which the center of gravity of the approximation polygon Pi in the frame is moved to be positioned at the center of gravity of the approximation polygon Pi+1 in the adjacent frames. An example of a relationship between each of the polygons Pi, Pi+1, and P′i and each of their centers of gravity is shown in images 312 and 322 shown in FIG. 7.


[0130] However, if a distance between the center of gravity of the approximation polygon Pi and that of the approximation polygon Pi+1 is short, the center of gravity of the approximation polygon Pi may be defined as P′i without causing movement such that the center of gravity of the approximation polygon Pi coincides with that of the approximation polygon Pi+1.


[0131] At step S73, each vertex of the polygon Pi is associated with each vertex of Pi+1. An example in which these vertexes are associated with each other is shown in images 313 and 323 shown in FIG. 7.


[0132] Hereinafter, examples of a variety of vertex association processing functions will be shown below.


[0133] (Method 1: Method for determining based on distance between vertexes)


[0134] A distance between a vertex and each vertex of the approximation polygon Pi+1 in the adjacent frame is obtained for each polygon P′i which has been moved to a position at which the center of gravity thereof coincides with that of the approximation polygon Pi+1, and a vertex of the polygon Pi+1 with its shortest distance is defined as a vertex corresponding to such vertex.


[0135] As a result of the above processing, if one or more of the vertexes of the polygon Pi+1 cannot be associated with vertexes of the polygon P′i among the vertexes of the polygon Pi+1, such vertex is associated with a vertex that is the closest to the vertexes of the polygon P′i (this applies to methods 2 and 3).


[0136] (Method 2: Method for determining association based on the shape of a polygon)


[0137] The characteristic quantity of each vertex of an approximation polygon in each frame is obtained, and the vertexes with their close characteristic quantities are associated with each other. The characteristic quantities include an area or a vertex angle of a triangle produced at the vertex and the adjacent two vertexes thereto.


[0138] (Method 3: Method for estimating a vertex position of a frame to be associated based on the trajectory of vertex)


[0139] The trajectory of j-th vertex of the vertexes of polygons P1, . . . , Pi associated in frames is approximated by a time function Tj (t); a vertex position in the frame i+1 is estimated based on the function Tj (t); and a vertex that is the closest to the estimated vertex position, of the vertexes of the polygon Pi+1 in the frame, is defined as a vertex that corresponds to the j-th vertex.


[0140] (Method 4)


[0141] Where the number of vertexes in the polygon P′i is equal to that of the polygon Pi+1, in the above methods (1 to 3) if plural vertexes of the polygon P′i are associated with same vertex in the polygon Pi+1 as a result of the processing (that is, if one or more vertexes in the polygon Pi+1 cannot be associated with the vertexes of the polygon P′i), such association may not be used. In addition, processing may be continued in order to find out another association such that the vertexes of the polygon P′i are associated with those of the polygon Pi+1 one-to-one. For example, if plural vertexes of the polygon P′i are to be associated with the same vertex of the polygon Pi+1, one proper vertex of the plural vertexes of the polygon P′i is excluded (to be associated with the vertex of the polygon Pi+1), and the remaining vertexes are tried to be associated with one that is not associated with the vertex of the polygon P′i of the vertexes of the polygon Pi+1


[0142] (Method 5)


[0143] In the above methods (1 to 4), in the case of limitation to association such that there does not occur a portion at which the sequence of vertex association is replaced with vertex association (for example, a case in which the vertexes of the polygons in the mutually adjacent frames correspond to each other in the clockwise sequence), when association is obtained such that the sequence of vertex association is not reversed, such association is not be used. Then, processing is continuous so as to find out another association in the sequential order. For example, a part or whole of the association is changed, processing is repeated in a trial and error manner.


[0144] (Method 6)


[0145] Where the number of vertexes in a polygon P′i is equal to that of a polygon Pi+1, if the sequence of vertexes is limited to association in a predetermined order such as the clockwise sequence, for example, there is obtained a shortest distance between each of the vertexes of the polygon P′i and each of those of the polygon Pi+1 the vertex of the polygon P′i is associated with the vertex of the polygon P′i+1, and further, the right adjacent vertex of the associated vertex of the polygon P′i is associated with the right adjacent vertex of the associated vertex of the polygon Pi+1, and subsequently, such association may be repeated similarly in the clockwise direction. In addition, instead of obtaining one vertex first associated, there is obtained a sum of a distance between vertexes in the case of associating a predetermined number (a plurality) of vertexes adjacent to each other in the polygon P′i with the same number of vertexes adjacent to each other in the polygon Pi+1, a correspondence of the predetermined number of vertexes in which such sum is minimum is adopted. Further, the right adjacent vertex of the associated predetermined number of vertexes of the polygon P′i is associated with that of the polygon Pi+1, and subsequently, association may be repeated similarly in the clockwise direction. Apart from use of a distance between vertexes for determining a first vertex association, as described above, there can be used a method for using a characteristic quantity or a method for using an estimated vertex position.


[0146] (Method 7)


[0147] In addition, if the number of vertexes in the polygon P′i is equal to that in the polygon Pi+1, in one variation of the above method 1, there is obtained a distance between each of the vertexes of the polygon P′i which is obtained by moving the approximation polygon in the frame and each of the vertexes of the approximation polygon Pi+1 in the adjacent frame, so that there may be adopted a combination of association of vertexes such that a sum of distances between all of the vertexes is minimized, of combinations in which the vertexes of the polygon P′i are associated with those of the polygon Pi+1 by one-to-one. With respect to the above methods 2 and 3, a similar variation is applicable. In method 2, the sum of a difference in characteristic quantity may be employed, and in method 3, the sum of a distance between an estimated vertex position and an association vertex position may be employed.


[0148] Further, at this time, it is possible to consider a condition in which any sequence is not changed as described above, a condition based on one-to-one correspondence, or a condition based on one-to-one correspondence without changing the sequence of vertexes.


[0149] With respect to one object, if the number of vertexes in an approximation polygon that represents a region for the object may differ depending on a frame in a duration from a frame in which an object appears to a frame in which an object disappears, the following processing is performed so that the above case can be processed in a manner similar to a case in which the number of vertexes in the polygon P′i is equal to that in the polygon Pi+1.


[0150] It is assumed that the number of vertexes for obtaining a trajectory, i.e., the number of vertexes of an approximation polygon that represents a region for an object is defined as “s”, and the number of vertexes in the approximation polygon in each frame is equal to or smaller than “s”. When a frame has a number of vertexes s−Δs (Δs>0), one of the vertexes of the approximation polygon in that frame is regarded as a multiplexed vertex of the Δs+1 number of vertexes with their identical coordinates (alternatively, Δs vertexes are regarded as a multiplexed vertex of two vertexes with their identical coordinates or the other association using a similar technique is performed), whereby processing (methods 1 to 7) can be used in a manner similar to a case in which the number of vertexes is equal. There are a method for performing processing after determining vertexes to be processed in advance as being multiplexed, and a method in which a vertex processed as being multiplied is determined as a result of processing.


[0151] It is assumed that the number of vertexes for obtaining a trajectory, i.e., the number of vertexes of an approximation polygon that represents a region for an object is defined as “s”, and the number of vertexes in the approximation polygon in each frame is larger than “s”. When a frame has a number of vertexes s+Δs (Δs>0), the coordinates with regard to unassociated Δs number of vertexes of the approximation polygon of that frame is specially recorded as an isolate point; or such data is aborted as an isolate point (the vertexes are processed as an absence), whereby processing (methods 1 to 7) can be used in a manner similar to a case in which the number of vertexes is equal (there is a method for performing processing after determining vertexes to be regarded in advance as an isolate point, and a method in which a vertex regarded as an isolate point are determined as a result of processing).


[0152] In addition, if the number of vertexes for obtaining a trajectory, i.e., the number of vertexes of an approximation polygon that represents a region for an object is defined as “s”, and the number of vertexes in the approximation polygon in each frame changes from a value smaller than “s” to a value larger than “s”, the above processing may be selectively performed in accordance with whether or not the number of vertexes of the approximation polygon in the frame is equal to or smaller than “s” or exceeds “s”.


[0153] Now, function approximation processing using a function approximation unit 103 will be described below.


[0154] When the above polygon approximation and vertex association completes with regard to all frames in which objects exist, the trajectory of each coordinate of each vertex in the approximation polygon of each object is approximated by using a function, thereby reducing an amount of data. As function approximation, there may be employed any approximation technique such as linear approximation, quadric approximation, or spline function approximation.


[0155]
FIG. 8A and FIG. 8B are views each illustrating in more detail an outline of processing for approximating the trajectory of a vertex in an approximation polygon using the function approximation unit 103.


[0156] In FIG. 8A, reference numeral 331 designates one frame in a moving picture targeted for processing. Reference numeral 332 denotes each vertex when a region for an object is approximated by a polygon. Here, approximation is assumed to be done by a rectangle (four vertexes) in a series of frames for clarify. In addition, a description will be given, assuming that a spline function is used for function approximation.


[0157] In function approximation processing, a trajectory obtained when each vertex of an approximation polygon that represents each object is arranged in time series is approximated based on a spline function for each of the X and Y coordinates. In this example, the spline function in the X and Y coordinates is required for each of the four vertexes that are vertexes of an approximation polygon. Thus, a total of eight spline functions are generated.


[0158] Processing using the function approximation unit 103 include: a method for performing processing every time the coordinate value of the vertex of the approximation polygon in each frame concerning a region for the object is obtained (for example, a method for performing approximation every time the coordinate value of the vertex in each frame is obtained, and obtaining an approximation error, and then, properly setting an approximation interval, i.e., a frame interval so that such approximation error is lowered within a predetermined range); or a method for performing processing after the coordinate values of vertexes with regard to all frames concerning a region for the object are obtained.


[0159] In addition, depending on a method for performing vertex association processing using the vertex association unit 102, there are a case in which a trajectory of vertexes over all frames from a frame in which a region for the object appears to a frame in which it disappears is obtained (for example, a case in which the number of vertexes of the association polygon is equal over all the frames, and a vertex is associated with another vertex by one-to-one among all adjacent frames); and a case in which a trajectory of a vertex existing only at an interval in some of all the frames from a frame in which a region for the object appears to a frame in which it disappears (for example, a case in which the number of vertexes in an approximation polygon is different depending on the frames, and the trajectory of certain vertexes in a frame with a small number of vertexes is interrupted; or a case in which the trajectory of certain vertexes in such frame is interrupted because the corresponding vertexes in a frame correspond to each other by one-to-one, even if the number of vertexes in an approximation is equal over the all frames). In the latter case, the frame in which the trajectory of vertexes is interrupted is always defined as a node of an approximation function.


[0160] In FIG. 8B, reference numeral 333 represents an approximated spline function in a three-dimensional manner.


[0161] Data indicating the thus obtained approximation function is recorded in the object region data storage device 104 in accordance with a predetermined data format.


[0162] Hereinafter, the data format of object region data stored in the object region data storage device 104 will be described. A description will be given by exemplifying a case in which a vertex is approximated by a spline function. Of course, this is applied to a case in which such vertex is approximated by another function.


[0163]
FIG. 9 shows an example of the data format of object region data for recording a spline function that represents a region of an object in a moving picture.


[0164] ID number 400 is an identification number assigned to each object.


[0165] A starting frame number 401 and an ending frame number 402 are first and last frame numbers that define the existence of an object given by the ID number. Specifically, these numbers correspond to the numbers of frames in which an object appears and disappear in a moving picture. However, instead of the numbers of frames in which an object appears and disappears in a moving picture in reality, for example, an arbitrary frame number later than that assigned when an object appears in a moving picture may be assigned as a starting frame number. Similarly, an arbitrary frame number later than the starting frame number and earlier than that assigned when an object disappear in a moving picture may be assigned as an ending frame number. The starting/ending frame numbers can be substituted by the starting/ending time stamps.


[0166] Approximation polygon data 403 is trajectory data on vertexes for expressing an approximation polygon (for example, parameter for spline function or the like). Where one object is formed in a plurality of object regions, the approximation polygon data 403 is provided in plurality.


[0167]
FIG. 10 shows an example of a data structure of approximation polygon data 403.


[0168] The number of vertexes 431 indicates the number of vertexes for expressing an approximation polygon.


[0169] A pair of vertex trajectory data 432-1 and 432-2 are data regions concerning a spline function for expressing a trajectory of points representative of an approximation polygon. The spline function data on the X and Y coordinates is required for one vertex. Therefore, the vertex trajectory data for specifying such spline function exists by vertex number M×2. The sequence of vertexes to be recorded may be a predetermined sequence such as a sequential order of the vertexes, for example.


[0170]
FIG. 11 shows an example of a data structure of vertex trajectory data 432-1 and 432-2.


[0171] A node frame number 4320 represents a node of a spline function. This number indicates that polynomial data 4323 is effective up to this node. The number of polynomial coefficient data varies depending on the maximum order of the spline function. If the maximum order is defined as K, the number of coefficient data is defined as K+1. Thus, reference is made to a polynomial order 4321. The polynomial order 4321 is followed by polynomial coefficient data 4322 in number that corresponds to K+1.


[0172] In addition, a spline function is expressed by a polynomial between each of adjacent nodes, thus requiring polynomials in number that corresponds to the number of nodes. Therefore, data 4323 containing a node frame number 4320, polynomial coefficient 4322 or the like is repeatedly described in plurality. Where a node frame number is equal to a last frame, it means that the data 4323 is the last polynomial coefficient data. Thus, it is found that trajectory data ends.


[0173] As in the above mentioned variations, if the coordinate is recorded without causing association with regard to a vertex of an approximation polygon in an adjacent frame, the coordinate of such vertex and its frame number are described in data 4323.


[0174] In the meantime, a variety of other items of data may be added to the object region data shown in FIG. 9. For example, there may be added a variety of attribute data associated with an object of the ID number or data (such as file name, URL or any other address) indicating a method of access to associated information data associated with an object of the ID number. Associated information data may be characters, voices, still pictures, moving pictures or a combination thereof; or data describing programs or computer activities.


[0175] Where function approximation is not performed using a function approximation unit 103, for example, the coordinate value of each vertex of an approximation graphics in each frame may be recorded as it is in approximation polygon data 403 contained in the object region data shown in FIG. 9.


[0176] Now, a method for providing moving picture data or object region data will be described below.


[0177] Where the object region data created by the processing according to the present embodiment is provided for a user, it is required to provide object region data by using any method from a creator to a user. The data is provided in a variety of modes as exemplified below.


[0178] (1) moving picture data and its object region data are provided simultaneously by being recorded in one or plural recording media; and


[0179] (2) moving picture data is provided first, and then, object region data is provided simultaneously by being recorded in one or plural recording media.


[0180] The above modes are associated with a case in which the data are mainly provided by a recording medium. Otherwise, there can be provided a mode in which part of whole of the data is provided via a communication medium.


[0181] As has been described above, according to the present embodiment, there is provided an object region data generating method and apparatus for generating object region data that describes data concerning a region for an arbitrary object in moving picture data, wherein the region for the object in the moving picture data is approximated by using an approximation polygon with regard to each of the plural frames, each of plural vertexes of the approximation polygon in a preceding frame is approximated with a vertex of the approximation polygon in the current frame having a shortest distance between vertexes after the approximation polygon in the current frame is move to make the centers of gravity of both approximation polygons coincide with each other, whereby a trajectory of the associated vertexes is obtained by arranging the vertexes along the sequence of the frames. The object region data is generated based on the trajectory of the obtained plural vertexes.


[0182] According to the present embodiment, there is also provided an object region data generating method and apparatus for generating object region data that describes data concerning a region for an arbitrary object in moving picture data, wherein the region for the object in the moving picture data is approximated by using an approximation polygon with regard to each of the plural frames, each of plural vertexes of the approximation polygon in a preceding frame is approximated with a vertex of the approximation polygon in the current frame which is closest to an estimated position of the vertex based on a trajectory of the associated vertexes, whereby a trajectory of the associated vertexes is obtained by arranging the vertexes along the sequence of the frames. The object region data is generated based on the trajectory of the obtained plural vertexes.


[0183] According to the present embodiment, there is provided an object region data generating method and apparatus for generating object region data that describes data concerning a region for an arbitrary object in moving picture data, wherein the region for the object in the moving picture data is approximated by using an approximation polygon with regard to each of the plural frames, each of plural vertexes of the approximation polygon in a preceding frame is approximated with a vertex of the approximation polygon in the current frame having a closest characteristic quantity, whereby a trajectory of the associated vertexes is obtained by arranging the vertexes along the sequence of the frames. The object region data is generated based on the trajectory of the obtained plural vertexes.


[0184] A vertex association may be limited to a one-to-one association or such an association that the sequence of vertexes is not changed.


[0185] According to the present embodiment, there is provided an approximation polygon generating method and apparatus for generating an approximation polygon that approximates a region of an arbitrary object in one frame of a moving picture or an image, wherein a contour of a region of the object in one frame of a moving picture or a still picture is extracted; an initial polygon representing a region of the object is generated based on the extracted contour; a vertex to be deleted is selected based on the characteristic quantity of each vertex of a polygon at that time; the thus selected vertex is deleted; vertexes adjacent to the deleted vertex are moved; and the characteristic quantity of each vertex of the polygon is updated, and the processing is performed until a predetermined ending condition has been met, thereby generating an approximation polygon that approximates the region for the object in the frame.


[0186] As the characteristic quantity, there may be used an area for a triangle produced by the deleted vertex and the adjacent two vertexes thereto or an angle produced by the above deleted vertex and the adjacent two vertexes thereto.


[0187] As the ending condition, there may be used whether or not the number of vertexes in a polygon becomes smaller than a specified value or whether or not the approximation error value of the approximation polygon and the actual object is greater than the value.


[0188] According to the present embodiment, an object in a moving picture is represented as a trajectory of vertexes in an approximation polygon as follows.


[0189] A region in which an object exists in each frame is first detected, and an initial polygon is generated from a contour of such region. A vertex less affected by deletion is searched for in a polygon based on the characteristic quantity of each vertex, and is deleted. At this time, an increase of an error caused by vertex deletion is reduced by moving the adjacent vertexes. This processing is repeated until the ending condition has been met, thereby producing a polygon that represents object region data with a smaller number of vertexes.


[0190] Since a point on an object contour is searched for with a large amount of calculation only when an initial polygon is produced, this algorithm operates at a higher speed than a conventional algorithm. In addition, there is an advantage that an approximation error is small because a polygon is produced while an vertex is dynamically changed.


[0191] Region data with regard to each frame is generated, and vertexes are associated with each other between the adjacent frames, whereby a time-series variation of object is described as a trajectory of vertexes. For example, association is performed so that a distance between the corresponding vertexes is minimized. Then, the trajectory of each vertex is approximated by function, whereby an amount of data is reduced.


[0192] In addition, if an attempt is made to generate a trajectory of a vertex every time the object of each frame is inputted, the object region data on a frame to be generated is estimated based on the object region data on the already created frame, and the estimated object region data is deformed according to an object, whereby the trajectory of a vertex is produced.


[0193] In this manner, according to the present embodiment, it is possible to approximate the size, shape, movement, deformation or the like, of a region of a desired object in a moving picture, as a polygon, with high precision and high speed. In addition, the region of the desired object in the moving picture is described in a small amount of data, making it possible to facilitate generation of such object and data processing. As a result, in a moving picture database or a supervisory system and the like, this approximated polygon is used for object recognition, classification, search or the like.


[0194] In addition, according to the present invention, preferable function approximation is performed even if there is a possibility that the number of vertexes is different depending on each frame or there is a high possibility that vertexes between frames are associated with each other by object deformation or movement.


[0195] As has been described above, it is possible to approximate the size, shape, movement, deformation or the like, of a region of a desired object in a moving picture, as a polygon, with high precision and high speed. In addition, the region of the desired object in the moving picture is described in a small amount of data, making it possible to object generation or data processing. As a result, in a moving picture database or a supervisory system and the like, this approximated polygon is used for object recognition, classification, search or the like.


[0196] In addition, according to the present embodiment, preferable function approximation is performed even if there is a possibility that the number of vertexes is different depending on each frame or there is a plural possibilities that the vertexes between frames are associated with each other due to an object deformation or movement.


[0197] Further, the region for the object in the moving picture is described as a parameter for a curve that approximates a time series trajectory (trajectory of the coordinate of vertexes at which a frame number or a time stamp is defined as a variable). In this manner, the region of the object in the moving picture is expressed by a function parameter, and thus, there can be produced object region data in a small amount of data, the data being easily processed. In addition, an approximation polygon is easily generated from the parameter for the approximation parameter.


[0198] The second embodiment of the object region data generating method and apparatus and the approximation polygon generating method and apparatus according to the present invention will now be described with reference to the accompanying drawings.


[0199] The present embodiment is basically similar to the first embodiment. Hereinafter, differences from the first embodiment will be mainly described.


[0200] In the first embodiment, vertexes of an approximation polygon is obtained from a region of the object in each frame, and then, a correspondence of the vertexes of the approximation polygon is obtained in the adjacent two frames. In the second embodiment, each of the corresponding vertexes of an approximation polygon in a targeted frame is obtained by using the trajectory of each of the corresponding vertexes of the approximation polygon in the already obtained preceding frames. For example, this embodiment is efficient in a case in which processing is performed while a moving picture is inputted in a stream moving picture.


[0201] A configuration example of an object region data generating apparatus according to the present embodiment is similar to that shown in FIG. 1.


[0202] In addition, an example of processing flow using the object region data generating apparatus according to the present embodiment is similar to that shown in FIG. 2. However, the polygon generation processing at step S11 is performed only for the first frame in which an object appears. Alternatively, with respect to a first predetermined number of frames (several frames), a correspondence of a vertex of an approximation polygon of the current frame and that over a plurality of frames may be obtained in accordance with the processing at steps S11 and S12 in the first embodiment. Subsequently, the vertexes of the approximation polygon in the subsequent frames are obtained one after another in accordance with the processing at step S12 in the present embodiment.


[0203] Hereinafter, vertex association processing using a vertex association unit 102 in the present embodiment will be described with reference to FIG. 12 and FIG. 13.


[0204] At step S81, with respect to an object, the trajectory of each vertex of an approximation polygons in a plurality of already generated frames (for example, all frames or a predetermined number of closest frames) is approximated as a time-series function, as designated by reference numeral 601 shown in FIG. 13. This function approximation may be performed by using any approximation technique such as linear approximation, quadric approximation or spline approximation.


[0205] Next, at step S82, the vertex position in a frame to be processed is estimated based on an approximated function, thereby generating a vertex (vertexes) of a polygon estimated in the frame to be processed. An example of the estimated approximation polygon is designated by reference numeral 602 shown in FIG. 13.


[0206] If an object in each frame moves in a small quantity, a vertex (vertexes) of an approximation polygon in the closest frame is (are) employed intact as a vertex (vertexes) of an estimated polygon without performing function approximation.


[0207] In addition, if an object in each frame moves in large quantity, but deformation is small in quantity, the center of gravity of the approximation polygon in a frame to be generated is estimated from that of each of the approximation polygons in a plurality of the already generated frames, and then, a polygon obtained by moving an approximation polygon in the closest frame so that its center of gravity coincides with the estimated center of gravity may be used as an estimated approximation polygon.


[0208] At an initial stage of processing, if a small number of frames have been processed, the processing at steps S81 and S82 may be performed by a small number of such frames. Alternatively, an approximation polygon in the immediately preceding frame may be used as an estimated polygon without performing approximation at step S81. In addition, if an approximation polygon is first obtained for only one frame or a small number of frames, an approximation in the immediately preceding frame may be used as an estimated polygon without performing approximation at step S81.


[0209] Next, at step S83, an object region for a frame to be processed is read, and each vertex of the estimated approximation polygon is moved in accordance with such object region. An example of the object region for a inputted frame to be processed is designated by reference numeral 603 shown in FIG. 13.


[0210] There are a variety of moving methods including moving an object to a position on an object contour that is the closest from such estimated position for each vertex or moving an object by using a parameter optimizing method such as gradient approach or Newton approach so as to minimize an approximation error.


[0211] If a plurality of objects (i.e., approximation polygons) exist in the adjacent frames, as in the first embodiment, an object region determined to be associated with an object in the targeted frame is used.


[0212] An example wherein vertexes are thus moved to be associated with an object is designated by reference numeral 604 shown in FIG. 13. In addition, an example of the moved and generated approximation polygon is designated by reference numeral 605 shown in FIG. 13.


[0213] The present embodiment is similar to the first embodiment in function approximation processing using the function approximation unit 103.


[0214] In addition, it is possible to record the coordinate value of each vertex of the approximation polygon in each frame as it is in the object region data without performing function approximation using the function approximation unit 103. In the latter case, the function approximation unit 103 may not be provided.


[0215] As has been described above, according to the present embodiment, there is provided an object region data generating method and apparatus for generating object region data that describes data concerning a region of an arbitrary object in moving picture data over a plurality of continuous frames, wherein, based on data indicating a trajectory of position data on a plurality of vertexes of an approximation polygon that approximates the region for the object from the already generated starting frame over the immediately preceding frame, position data on a plurality of vertexes of the approximation polygon that approximates the region for the object in the frame to be processed is estimated, and is modified so as to be associated with a region for the object in the frame to be processed, whereby processing is performed for generating data indicating a trajectory of position data on a plurality of vertexes of an approximation polygon that approximates a region for the object from the starting frame to the target frame, and the object region data is generated based on a trajectory of the plurality of vertexes obtained by repeatedly performing such processing over the plurality of frames.


[0216] In order to estimate position data on vertexes of an approximation polygon in a frame to be processed, the trajectory of position data on vertexes of the approximation polygon over a plurality of already generated frames is approximated by function so as to estimate the position data on vertexes of the approximation polygon in the already generated closest frame.


[0217] In order to estimate position data on vertexes of the approximation polygon, the position data on the vertex of the approximation polygon in the already generated closest frame may be used.


[0218] In order to estimate position data on vertexes of the approximation polygon, the position of the center of gravity of the frame may be estimated from the center of gravity of the approximation polygon over a plurality of the already generated frames so as to use the position data on the vertexes of the approximation polygon when the approximation polygon in the already generated closest frame is moved to the positional center of gravity.


[0219] As described above, according to the present invention, it is possible to approximate the size, shape, movement, or deformation and the like, of a region of a desired object in a moving picture, as a polygon, with high precision and high speed. In addition, the region for the desired object in the moving picture is described in a small amount of data, making it possible to facilitate object generation or data processing. As a result, in a moving picture database or a supervisory system and the like, the approximated polygon is used for object recognition, classification, search or the like.


[0220] Further, according to the present invention, preferable function approximation is performed even if there is a possibility that the number of vertexes differs depending on each frame or there is a high possibility that vertexes between frames are associated with each other by object deformation and movement or the like.


[0221] The present invention according to the apparatus is established as that according to the method, and vice versa.


[0222] Furthermore, the present invention can be implemented as a computer readable recording medium that records a program for causing a computer to executing predetermined program codes (or causing a computer to function as predetermined means or causing a computer to implement a predetermined function).


[0223] As has been described above, according to the present invention, it is possible to approximate a region of a desired object in a moving picture as a polygon with high precision and high speed and the region of the desired object in the moving picture is described in small amount of data, making it possible to facilitate object generation or data processing.


[0224] Additional advantages and modifications will readily occur to those skilled in the art. Therefore, the present invention in its broader aspects is not limited to the specific details, representative devices, and illustrated examples shown and described herein. Accordingly, various modifications may be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalents.


Claims
  • 1. A method of generating object region data relating to object regions in frames of moving picture, the frames lined up in a time-series variation, the method comprising: approximating the object region in each of the frames with a polygon, the polygon having vertexes; associating the vertexes in each of the frames with the respective vertexes in an adjacent frame of each of the frames; obtaining trajectories linking the vertexes associated together based on the time-series variation of the frames; and generating the object region data based on the trajectories.
  • 2. The method according to claim 1, wherein the vertexes in each of the frames are associated with the respective vertexes in the adjacent frame such that a distance between the vertexes in each of the frames and the respective vertexes in the adjacent frame is minimum.
  • 3. The method according to claim 2, wherein the distance between the vertexes in each of frames and the respective vertexes in the adjacent frame is calculated after a center of gravity of the polygon in the each of the frames coincides with a center of gravity of the polygon in the adjacent frame.
  • 4. The method according to claim 2, wherein each of trajectories is approximated by a predetermined function, and the object region data is generated by using a parameter for the function.
  • 5. The method according to claim 2, wherein the object region data includes position data of the vertexes of each of the frames and association data indicting correspondence of the vertexes between the frames.
  • 6. A method according to claim 1, wherein the associating the vertexes with the respective vertexes comprises estimating vertexes in the adjacent frame based on the vertexes in the frames preceding to the adjacent frame, and selecting the vertexes in the adjacent frame, that are closest to the estimated vertexes.
  • 7. The method according to claim 6, wherein each of the trajectories is approximated by a predetermined function, and the object region data is generated by using a parameter for the function.
  • 8. The method according to claim 6, wherein the object region data includes position data of the vertexes of each of the frames and association data indicting correspondence of the vertexes between the frames.
  • 9. The method according to claim 1, wherein the vertexes in each of frames have a characteristic quantity and the vertex in each of the frames and the vertex in the adjacent frame which have the closest characteristic quantity are associated with each other.
  • 10. The method according to claim 9, wherein each of the trajectories is approximated by a predetermined function, and the object region data is generated by using a parameter for the function.
  • 11. The method according to claim 9, wherein the object region data includes position data of the vertexes of each of the frames and association data indicting correspondence of the vertexes between the frames.
  • 12. A method of generating object region data relating to an object region in moving picture data having a plurality of frames, the method comprising: approximating the object region in each of the plurality of frames with a polygon; estimating a plurality of vertexes of the polygon in a given frame based on trajectory data indicating vertexes of the polygon from a first frame to an immediately preceding frame of the given frame; modifying the plurality of vertexes estimated so as to be associated with the object region in the given frame; obtaining trajectory data indicating vertexes of the polygon from the first frame to the given frame; and generating the object region data based on the trajectory data indicating vertexes of the polygon from the first frame to the given frame.
  • 13. The method according to claim 12, wherein each of the plurality of trajectories is approximated by a predetermined function, and the object region data is generated by using a parameter for the function.
  • 14. The method according to claim 124 wherein the object region data includes position data of the vertexes of each of the plurality of frames and association data indicting correspondence of the vertexes between the plurality of frames.
  • 15. A method of generating a polygon that approximates an object region in an image, the method comprising: extracting a contour of the object region to generate an initial polygon; obtaining a characteristic quantity of each of vertexes of the initial polygon; selecting the vertex to be deleted based on the characteristic quantity; deleting the selected vertex; correcting a position of vertexes adjacent to the deleted vertex, thereby performing processing for updating the polygon; and repeatedly performing the processing until a predetermined ending condition has been met, thereby generating the polygon that approximates the object region.
  • 16. An apparatus for generating object region data relating to object regions in frames of moving picture, the frames lined up in a time-series variation, the apparatus comprising: an approximation unit configured to approximate the object region in each of the frames with a polygon, the polygon having vertexes; an association unit configured to associate the vertexes in each of the frames with the respective vertexes in an adjacent frame of each of the frames; a trajectory unit configured to obtain trajectories linking the vertexes associated together based on the time-series variation of the frames; and a data generation unit configured to generate the object region data based on the trajectories.
  • 17. The apparatus according to claim 16, wherein the association unit associates the vertexes in each of the frames with the respective vertexes in the adjacent frame such that a distance between the vertexes in each of the frames and the respective vertexes in the adjacent frame is minimum.
  • 18. The apparatus according to claim 16, wherein the association unit comprises an estimation unit configured to estimate vertexes in the adjacent frame based on the vertexes in the frames preceding to the adjacent frame, and a selector configured to select the vertexes in the adjacent frame, that are closest to the estimated vertexes.
  • 19. The apparatus according to claim 16, wherein the vertexes in each of frames have a characteristic quantity and the vertex in each of the frames and the vertex in the adjacent frame which have the closest characteristic quantity are associated with each other.
  • 20. An apparatus for generating object region data relating to an object region in moving picture data having a plurality of frames, the apparatus comprising: an approximation unit configured to approximate the object region in each of the plurality of frames with a polygon; an estimation unit configured to estimate a plurality of vertexes of the polygon in a given frame based on trajectory data indicating vertexes of the polygon from a first frame to an immediately preceding frame of the given frame; a modifying unit configured to modify the plurality of vertexes estimated so as to be associated with the object region in the given frame; a trajectory unit configured to obtain trajectory data indicating vertexes of the polygon from the first frame to the given frame; and a data generation unit configured to generate the object region data based on the trajectory data indicating vertexes of the polygon from the first frame to the given frame.
  • 21. An apparatus for generating a polygon that approximates an object region in an image, the apparatus comprising: an extraction unit configured to extract a contour of the object region to generate an initial polygon; a characteristic quantity unit configured to obtain a characteristic quantity of each of vertexes of the initial polygon; a selector configured to select the vertex to be deleted based on the characteristic quantity; a deletion unit configured to delete the selected vertex; a correction unit configured to correct a position of vertexes adjacent to the deleted vertex, thereby performing processing for updating the polygon; and a controller configured to repeatedly perform the processing until a predetermined ending condition has been met, thereby generating the polygon that approximates the object region.
  • 22. An article of manufacture comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means comprising: first computer readable program code means for causing a computer to approximate the object region in each of the frames with a polygon, the polygon having vertexes; second computer readable program code means for causing a computer to associate the vertexes in each of the frames with the respective vertexes in an adjacent frame of each of the frames; third computer readable program code means for causing a computer to obtain trajectories linking the vertexes associated together based on the time-series variation of the frames; and fourth computer readable program code means for causing a computer to generate the object region data based on the trajectories.
  • 23. The article of manufacture according to claim 22, wherein the second computer readable program code means causes a computer to associate vertexes in each of the frames with the respective vertexes in the adjacent frame such that a distance between the vertexes in each of the frames and the respective vertexes in the adjacent frame is minimum.
  • 24. The article of manufacture according to claim 22, wherein the second computer readable program code means causes a computer to estimate vertexes in the adjacent frame based on the vertexes in the frames preceding to the adjacent frame, and select the vertexes in the adjacent frame, that are closest to the estimated vertexes.
  • 25. The article of manufacture according to claim 22, wherein the vertexes in each of frames have a characteristic quantity and the vertex in each of the frames and the vertex in the adjacent frame which have the closest characteristic quantity are associated with each other.
  • 26. An article of manufacture comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means comprising: first computer readable program code means for causing a computer to approximate the object region in each of the plurality of frames with a polygon; second computer readable program code means for causing a computer to estimate a plurality of vertexes of the polygon in a given frame based on trajectory data indicating vertexes of the polygon from a first frame to an immediately preceding frame of the given frame; third computer readable program code means for causing a computer to modify the plurality of vertexes estimated so as to be associated with the object region in the given frame; fourth computer readable program code means for causing a computer to obtain trajectory data indicating vertexes of the polygon from the first frame to the given frame; and fifth computer readable program code means for causing a computer to generate the object region data based on the trajectory data indicating vertexes of the polygon from the first frame to the given frame.
  • 27. An article of manufacture comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means comprising: first computer readable program code means for causing a computer to extract a contour of the object region to generate an initial polygon; second computer readable program code means for causing a computer to obtain a characteristic quantity of each of vertexes of the initial polygon; third computer readable program code means for causing a computer to select the vertex to be deleted based on the characteristic quantity; fourth computer readable program code means for causing a computer to delete the selected vertex; fifth computer readable program code means for causing a computer to correct a position of vertexes adjacent to the deleted vertex, thereby performing processing for updating the polygon; and sixth computer readable program code means for causing a computer to repeatedly perform the processing until a predetermined ending condition has been met, thereby generating the polygon that approximates the object region.
Priority Claims (1)
Number Date Country Kind
2000-077161 Mar 2000 JP