The present application, claims priority of the Indian provisional patent applications with serial number 1236/CHE/2012 and 1237/CHE/2012 filed on Apr. 30, 2012, and that applications are incorporated in its entirety at least by reference.
1. Technical Field
The embodiments herein, generally relate to image processing systems and methods and particularly relates to a method and system for recognizing an image and searching the image data in real time. The embodiments herein more particularly relates to a method and system for expediting real time image recognition on a user mobile device by performing feature extraction and enumerating feature ranking.
2. Description of the Related Art
An image generally has one or more enclosed/closed contours, which is a fixed or random area in the image depicting an object, a logo or a HD picture. The enclosed contour is made up of various regions whose characteristics changes with a variation in the scale of the enclosed contour, variation in the rotation of an angle of view of the enclosed contour or depends on affinity variation. The conventional methods use High Definition (HD) cameras to capture an image of an enclosed contour or logo. The captured image is processed through a plurality of applications installed on end user device like a PC, a laptop and a Smartphone. The ED cameras capture the image of a logo with a fixed background to ensure the accuracy of processing of the image.
The existing method classifies an image on the basis of intensity; color and visual orientation to provide recognition and description of successive frames of the captured images. Further classification of the image is done to map ideal features of the plurality of images. However, the mapping of the ideal image features is affected variably when the images in the successive frames render wide differences in respective of intensity, color, visual orientation and the like.
The existing technologies perform image recognition by segmenting the captured image into one or more connected regions. The segmented objects comprise significant information which is then processed by a processor application. However, the processing of the segmented connected regions is significantly affected by the variations in the image parameters such as a scale of the image, a view angle of the image and an affinity of the image. The variation in the plurality of the image parameters leads to improper outcome after processing of images. Since a segmented zone comprises one or more colors, the segmentation of the image on the basis of an area leads to inefficient processing. Similarly, processing of an image/logo with frequently varying background is highly cumbersome. As the background of the logo frequently changes in Television and motion pictures, the captured image of the logo suffers aberrations and frequent color variations. The processing of the logo becomes difficult as the background of the logo on the Television is dynamic in nature. Furthermore the recognition of the logo becomes difficult on the Television due to the raster lines created by Television, uneven lighting conditions and noise. Also the image of the Television logo which is captured by the smartphone is of low quality so the processing of the captured image is inappropriate.
Further the current technologies explain only a generic image recognition methods using mobile device and does not explain a contour or shape based image recognition. Another prior art uses key points based on only image processing, but not image recognition. The prior arts also tail to provide details regarding differentiating various shapes in a particular image.
In the view of foregoing, there is a need for a method and system for recognizing the images and enable image searching in a mobile device. There is also a need for a method and system for extracting contour data of a particular image area in a captured image with high efficiency. Further is a need for a method and system for recognizing the image based on a logo with dynamic background and to provide the relevant contents based on the recognized logo to the mobile device.
The above mentioned shortcomings, disadvantages and problems are addressed herein and which will be understood by reading and studying the following specification.
The primary objective of the embodiments herein is to provide a method and system for real time image recognition on a mobile device based on a ranking based procedure.
Another objective of the embodiments herein is to provide a method and system to track a plurality of feature points in the captured image and designate ranks to the tracked/matched feature points.
Another objective of the embodiments herein is to provide a method and system for image recognition based on an enclosed contour in a captured image on a mobile device.
Another objective of the embodiments herein is to provide a method and system for Image recognition of an enclosed contour based on a color pattern.
Another objective of the embodiments herein is to provide a method and system to segment a captured close contour where the edges are not prominent on the basis of color pattern.
Another objective of the embodiments herein is to provide a method and system for-image recognition based on the enclosed contour which is shape variant, angle variant, and affine invariant.
Another objective of the embodiments herein is to provide a method and system to match the features of the plurality of the images in the successive frames offline on the mobile phone.
Another objective of the embodiments herein is to provide a method and system for real time recognition of a logo on a plurality of Television channels with varying background through a mobile device.
Another objective of the embodiments herein is to provide a method and system for retrieval of relevant digital content based on the recognized logo.
These and other objects and advantages of the present embodiments will become readily apparent from the following detailed description taken in conjunction with the accompanying drawings.
The various embodiments of the present invention provide a method for real time image recognition on a mobile device. The method comprises of installing an image recognition application in the mobile device, capturing a plurality of images using the mobile device and recognizing a plurality of images in successive frames by ranking one or more feature points of the captured images through the image recognition application. The method of ranking one or more feature points of the captured images comprises of generating a random forest for the plurality of images, storing the generated random forest in a training module in an application server, passing the random forest to the mobile device, passing the captured images through an image recognition process on the mobile device, obtaining a plurality of features points in the captured images using a feature based algorithm, matching the image captured through the mobile device with the plurality of images stored in the random forest, designating a rank for the tracked feature points in the images, incrementing the designated ranks based on a repetition of the feature points in the images of the successive frames, determining one or more stable features of the images by ranking the features points based on a threshold and repetition, applying Ransac on the identified stable features, recognizing the matched image and delivering the content based on the recognized image.
According to an embodiment herein, the incremented ranks for the tracked feature points are matched with a predetermined threshold value in each frame through at least one of an inliers count and a Ransac percentage count.
According to an embodiment herein, the stable features comprise one or more feature points whose incremented rank equalize or cross the predetermined threshold value.
According to another embodiment herein, the method comprises recognizing an image based on an enclosed contour in a image. The method comprises of capturing the image of the enclosed contour through the mobile device, subjecting the captured image to the image recognition application, analyzing a color pattern of the enclosed contour through the image recognition application, extracting a shape of the enclosed contour from the identified color pattern, segmenting the enclosed contour into a plurality of connected regions based on the identified color pattern and the shape and transforming and normalizing the identified shapes to recognized the image contour.
According to an embodiment herein, the method of extracting the shape of the enclosed contour from the color pattern comprises of binarizing the image of the enclosed contour based on one or more image dependent techniques, performing blob segmentation of the image after binarization, normalizing each segmented blob for scaling and orientation, passing the segmented blob to a Zernike moment generator and storing the Zernike moments as descriptors to define the shape.
According to an embodiment herein, binarizing the image is performed based on at least one of a color; brightness threshold and adaptive threshold.
According to an embodiment herein, the method of normalizing the identified shape comprises of segmenting the binarized enclosed contour to fit into an elliptic region, obtaining the elliptical properties of the shape of the segmented and binarized contour, calculating the central moments, calculating the elliptical values derived, computing a new normalized contour, subjecting the new normalized contour to a descriptor computation process by convolving the normalized contour with one or more Zernike polynomials.
According to an embodiment herein, convolution of the normalized contour with the one or more Zernike polynomials provides a 36 dimensional contour descriptor. Here both magnitude component and a phase component are included to represent the contour shape in the form of descriptor.
According to another embodiment, herein, the method of extracting the shape of the enclosed contour is based on a scale space.
According to another embodiment, herein, the real time image recognition further comprising providing information on at least one logo included in at least one digital content in the mobile device. The method of providing information on logo, for instance, television logo comprises capturing the image of the logo from the digital content through the mobile device, extracting one or more features from the image of the logo, passing the extracted features through a K-dimensional tree, matching the extracted features with a plurality of pre-stored logos stored in a Random Forrest, recognizing the matched image based on stability of features on one or more preceding frames and delivering a content based on the recognized image of the logo to the mobile device. The logo is at least one of a symbol, text or a graphical image which represents an identity of a producer, content distributor or broadcasting network of the digital content. The method of recognizing logo further comprising initializing an image recognition application installed in the mobile device, recognizing the image of the logo by the image recognizing application, obtaining a key ID corresponding to the logo and getting the contents of the recognized image from an application server to the image recognition application based on the key ID.
According to an embodiment herein, the contents of the recognized logo is downloaded from the application server or streamed through the application server.
According to an embodiment herein, the digital content is a program content with varying background broadcasted on a television channel.
According to an embodiment herein, the method of generating the random forest for the plurality of images comprises calculating the feature points of the pre-stored training images, describing and labeling a data set for the one or more images, clustering the labeled data set using a K-means clustering, creating a K-dimensional tree for the clustered data based on the calculated feature points, generating an XML code and parsing the clustered data from the application server to the mobile device in the form of extensible markup language (XML).
According to an embodiment herein, the random forest is an ensemble classifier comprising a plurality of decision trees and adapted to provide a class, where the class is a mode of the classes output by one or more individual trees.
According to an embodiment herein, extracting one or more features from the image comprises calculating one or more feature points for the Image using a feature based algorithm.
Embodiments herein provide a system for real, time Image recognition on a mobile device. The system comprising a mobile device equipped with a camera, an image recognition application Installed in the mobile device, an application server, a processor means and a training module provided in the application server. The image recognition application in the mobile device is adapted for recognizing the plurality of images in successive frames and matching the captured image with one or more pre-stored images. The processor means is adapted for obtaining a plurality of features points in the captured images using a feature based algorithm, matching the plurality of feature points with the plurality of images stored in the random forest, designating a rank for the tracked feature points in the images, incrementing the designated ranks based on the repetition of the feature points in the images of successive frames and determining one or more stable features of the images, matching the stable features with the features belonging to the plurality of images stored in the random forest and recognizing the images based on the stable features.
The training module provided in the application server is adapted for storing a plurality of pre-loaded images and generating a random forest for the plurality of images.
According to an embodiment herein, tire processor means is further adapted for initiating the image recognition application to identify the image of an enclosed contour, analyzing a color pattern, a brightness threshold and an adaptive threshold of the enclosed contour, extracting a shape of the enclosed contour, segmenting the enclosed contour into a plurality of connected regions based on the shape and transforming and normalizing the identified shapes.
According to an embodiment herein, the image recognition application is a software application installed in the mobile device through which the captured image is analyzed and processed.
Embodiments herein further provide a system for identifying a logo on a Television with a varying background. The system comprises a mobile device equipped with a camera with which the user captures images of one or more television logos/normal logos, an image recognition application installed in the mobile device adapted for recognizing the image of the logo, obtaining a key ID corresponding to the recognized logo and extracting contents for the recognized logo. The system further comprises an application server and a training module provided in the application server adapted for storing a plurality of training images of logos and constructing a random forrest for facilitating the logo search.
These and other aspects of the embodiments herein will be better appreciated and understood when considered in conjunction with the following description and the accompanying drawings. It should be understood, however, that the following descriptions, while indicating preferred embodiments and numerous specific details thereof are given by way of illustration and not of limitation. Many changes and modifications may be made within the scope of the embodiments herein without departing from the spirit thereof, and the embodiments herein include ail such modifications.
The other objects, features and advantages will occur to those skilled in the art from the following description of the preferred embodiment and the accompanying drawings in which;
Although the specific features of the present embodiments are shown in some drawings and not in others. This is done for convenience only as each feature may be combined with any or all of the other features in accordance with the present embodiments.
In the following detailed description, a reference is made to the accompanying drawings that form a part hereof, and in which the specific embodiments that may be practiced is shown by way of illustration. These embodiments are described in sufficient detail to enable those skilled in the art to practice the embodiments and it is to be understood that the logical, mechanical and other changes may be made without departing from the scope of the embodiments. The following detailed description is therefore not to be taken in a limiting sense.
The various embodiments of the present invention provide a method for real time image recognition on a mobile device. The method comprises of installing an image recognition application in the mobile device, capturing a plurality of images using the mobile device and recognizing a plurality of images in successive frames by ranking one or more feature points of the captured images through the image recognition application. The method of ranking one or more feature points of the captured images comprises of generating a random forest for the plurality of images, storing the generated random forest in a training module in an application server, passing the random forest to the mobile device, passing the captured images through an image recognition process on the mobile device, obtaining a plurality of features points in the captured images using a feature based algorithm, matching the image captured through the mobile device with the plurality of images stored in the random forest, designating a rank for the tracked feature points in the images, incrementing the designated ranks based on a repetition of the feature points in the images of the successive frames, determining one or more stable features of the images by ranking the features points based on a threshold and repetition, applying Ransac on the identified stable features, recognizing the matched image and delivering the content based on the recognized image.
The incremented ranks for the tracked feature points are matched with a pre-determined threshold value in each frame through at least one of an inliers count and a Ransac percentage count. Here the stable features comprise one or more feature points whose incremented rank equalize or cross the predetermined threshold value.
The method of recognizing an image based on an enclosed contour in an image comprises of capturing the image of the enclosed contour through the mobile device, subjecting the captured image to the image processor application, analyzing a color pattern of the enclosed contour through the image recognition application, extracting a shape of the enclosed contour from the identified color pattern, segmenting the enclosed contour into a plurality of connected regions based on the identified color pattern and the shape and transforming and normalizing the identified shapes. Extracting the shape of the enclosed contour from the color pattern comprises of binarizing the image of the enclosed contour based on one or more image dependent techniques, performing blob segmentation of the image after binarization, normalizing each segmented blob for scaling and orientation, passing the segmented blob to a Zernike moment generator and storing the Zernike moments as descriptors to define the shape.
The binarization of the image is performed based on at least one of a color, brightness threshold and adaptive threshold.
Normalizing the identified shape of the enclosed contour comprises of segmenting the binarized enclosed contour to lit into an elliptic region, obtaining the elliptical properties of the shape of the segmented and binarized contour, calculating the central moments, calculating the elliptical values derived, computing a new normalized contour, subjecting the new normalized contour to a descriptor computation process by convolving the normalized contour with one or more Zernike polynomials. The convolution of the normalized contour with the one or more Zernike polynomials provides a 36 dimensional contour descriptor. Here both magnitude component and a phase component are included to represent the contour shape in the form of descriptor.
In one embodiment herein, the method of extraction of the shape of the enclosed contour is based on a scale space.
The real time image recognition further comprises of providing information on at least one logo included in at least one digital content in the mobile device. The method of providing Information on logo, for instance, television logo comprises capturing the image of the logo from the digital content through the mobile device, extracting one or more features from the image of the logo, passing the extracted features through a K-dimensional tree, matching the extracted features with a plurality of pre-stored logos stored in a Random Forrest, recognizing the matched image based on stability of features on one or more preceding frames and delivering a content based on the recognized image of the logo to the mobile device. The logo is at least one of a symbol, text or a graphical image which represents an identity of a producer, content distributor or broadcasting network of the digital content. The method of recognizing logo further comprising initializing an image recognition application installed in the mobile device, recognizing the image of the logo by the image recognizing application, obtaining a key ID corresponding to the logo and getting the contents of the recognized image from an application server to the image recognition application based on the key ID.
The contents of the recognized logo is downloaded from the application server or streamed through the application server. The digital content is a program content: with varying background broadcasted on a television channel.
The method of generating the random forest, for the plurality of images comprises calculating the feature points of the pre-stored training images, describing and labeling a data set for the one or more images, clustering the labeled data set using a K-means clustering, creating a K-dimensional tree for the clustered data based on the calculated feature points, generating an XML code and parsing the clustered data from the application server to the mobile device in the form of extensible markup language (XML). The random forest is an ensemble classifier comprising a plurality of decision trees and adapted to provide a class, where the class is a mode of the classes output by one or more individual trees.
Embodiments herein provide a system for real time image recognition on a mobile device. The system comprising a mobile device equipped with a camera a plurality of images, an image recognition application installed in the mobile device, an application server, a processor means and a training module provided in the application server. The image recognition application in the mobile device is adapted for recognizing the plurality of images in successive frames and matching the captured image with one or more pre-stored images. The processor means is adapted for obtaining a plurality of features points in the captured images using a feature based algorithm, matching the plurality of feature points with the plurality of images stored in the random forest, designating a rank for the tracked feature points in the images, incrementing the designated ranks based on the repetition of the feature points in the images of successive frames determining one or more stable features of the images, matching the stable features with the features belonging to the plurality of images stored in the random forest and recognizing the images based on the stable features.
The training module provided in the application server is adapted for storing a plurality of pre-loaded images and generating a random forest for the plurality of images.
The processor means is further adapted for initiating the image recognition application to identify the image of an enclosed contour, analyzing a color pattern, a brightness threshold and an adaptive threshold of the enclosed contour, extracting a shape of the enclosed contour, segmenting the enclosed contour into a plurality of connected regions based on the shape and transforming and normalizing the identified shapes.
The image recognition application is a software application installed in the mobile device through which the captured image is analyzed and processed.
Embodiments herein further provide a system for identifying a logo on a Television with a varying background. The system comprises a mobile device equipped with a camera with which the user captures images of one or more television logos, an image recognition application installed in the mobile device adapted for recognizing the image of the logo, obtaining a key ID corresponding to the recognized logo and extracting contents for the recognized logo. The system further comprises an application server and a training module provided in the application server adapted for storing a plurality of training images of logos and constructing a random forrest for facilitating the logo search.
The image recognition application 103 installed in the mobile device provides recognition of a plurality of images in the successive frames. The image recognition application 103 matches the captured image with the image provided by the training module 106 in the application server 105. Alternatively, instead, of recognizing the captured image in the mobile device, the image recognition application 103 uploads the captured images to the application server 105 and the matching process of the uploaded image is performed in the application server 105.
According to an embodiment herein, the processing and identification of an image is done in real time in high end mobile devices 101 such as smart phones. For the low end mobile devices 101, the captured images are sent to the application server for processing. The training module 106 provided in the application server stores a plurality of pre-loaded images and constructs a random forest tree for the plurality of images. The images are further recognized based on the stable features.
According to an embodiment herein, the central moments are computed by given equations:
The Elliptical values derived from above mentioned equations are further computed using the following equations:
From a major axis length (2a), a minor axis length (2b) and an angle (θ) are used to compute a new normalized image (x′, y′):
According to an embodiment herein, for high end smart phones the processing and identification of a logo is done on real time within the smart phone itself. For the tow end mobile devices the image processing is done at the application server. The method for recognizing an image of a Television logo within the mobile device comprises installing and initiating an image recognition application. The user captures the image of a Television logo in real time. The captured logo is then processed through the image recognition application. The image recognition application calculates the features of the image though the feature based algorithm. The features are transferred to the K-dimensional tree for searching. The K-dimensional tree is stored in the training module of the application server. The image recognition application uses BestBin Search method for searching the features in the K-dimensional tree. The outlines and mismatched features are removed by the image recognition application by using a Ransac method. The image matching is done on the basis of the non-linear and Ransac percentage count. The content of the image of the logo is sent to the application server on the basis of a key ID. The key ID is generated corresponding to the individual Television logo. The contents for the recognized logo is downloaded or streamed through the application server. The contents for the recognized logo is any one of a single tone audio, a multiple tone audio, a two-dimensional image (2D), a 2D video, a three-dimensional image (3D), a 3D video and a text.
The foregoing description of the specific embodiments will so fully reveal the general nature of the embodiments herein that others can, by applying current knowledge, readily modify and/or adapt for various applications such specific embodiments without departing from the generic concept, and therefore, such adaptations and modifications should and are intended to be comprehended within the meaning and range of equivalents of the disclosed embodiments, it is to be understood that the phraseology or terminology employed herein is for the purpose of description and not of limitation. Therefore, while the embodiments herein have been described in terms of preferred embodiments, those skilled in the art will recognize that the embodiments herein can be practiced with modification.
Number | Date | Country | Kind |
---|---|---|---|
1236/CHE/2012 | Apr 2012 | IN | national |
1237/CHE/2012 | Apr 2012 | IN | national |