The present application claims the priority to Chinese Patent Application No. 201310292056.4, titled “BANKNOTE RECOGNITION AND CLASSIFICATION METHOD AND SYSTEM”, filed on Jul. 11, 2013 with the State Intellectual Property Office of People's Republic of China, which is incorporated herein by reference in its entirety.
The disclosure relates to the technical field of banknote recognition systems, and particularly to a method for recognizing and classifying banknotes and a system thereof.
At present, in banknote processing devices for the financial field such as cash circulator, banknote sorter, etc, a banknote recognition system has two main parts: a banknote classification learning system and a banknote recognition system, of which schematic structural diagrams are shown in
For higher robustness of the banknote recognition system, i.e., to eliminate interference exerted by the quality of samples to be recognized on the recognition result as far as possible, abundant and diversified samples to be learned are normally input in the learning of the banknote classifier. When selecting samples, besides considering banknote samples in brand new condition, banknote samples in various conditions and banknote samples with contamination, incompletion, crack and folds in varying degrees need to be considered. Thus there is a large number of samples to be selected. Difficulty of the sample selection lies in the difficulty in collecting all types of banknotes in circulation, and particularly, for developing an algorithm with respect to foreign banknotes, it is almost impossible to collect a complete set of banknote samples. Generally speaking, the banknote samples to be learned mainly include the following types for selection: brand new condition, 80%-90% new condition, 70%-80% new condition, 0-70% new condition, contamination in varying degrees, incompletion in varying degrees, crack in varying degrees, and fold in a certain region, and if 30 banknotes are to be selected as samples for one type, 240 actually circulating banknotes satisfying the conditions are totally needed. If all needed types of banknotes may be completely collected to design a classifier, precision of the classifier may be ensured, and if the types of the samples are inadequate, it is possible that the precision of the classifier does not satisfy application requirement. However, to completely collect the various needed types of banknote samples in circulation, a large number of human resources and material resources may be needed, thereby affecting cost and efficiency for developing banknote recognition product; in other words, without extra cost spent to collect and screen the samples, the designed classifier may have decreased precision.
Hence, under the situation of limited number of selectable banknote samples, it is urgent for those skilled in the art to provide a method for recognizing and classifying banknotes and a system thereof to reduce extra cost while ensuring an improved classifier precision.
In view of this, a method for recognizing and classifying banknotes and a system thereof are provided according to the disclosure to conquer a conventional problem that extra cost may not be reduced while ensuring classifier precision due to the case that adequate variety of needed samples can not be ensured when actual samples are collected.
To achieve the above purpose, the technical solutions provided according to the disclosure are as follows.
A method and for recognizing and classifying banknotes includes:
acquiring sample information of brand new banknotes to be learned and banknote sample information to be recognized;
establishing, according to a preset rule, a banknote sample signal degeneration model;
inputting the sample information into the banknote sample signal degeneration model to acquire various banknote sample information corresponding to the brand new banknotes to be learned;
inputting the various banknote sample information to perform classifier learning, and outputting a banknote classification model; and
performing sample signal preprocessing and feature extraction on the sample information to be recognized, performing classification decision on the banknote to be recognized by using the classification model, and outputting a final classification result.
The banknote sample signal degeneration model includes: a banknote condition degeneration model established based on linear change of image brightness and a banknote image degeneration model established based on randomness of a statistic model.
The banknote image degeneration model includes signal degeneration models for banknote contamination, banknote incompletion, banknote crack, and banknote fold or deflection, and the banknote condition degeneration model includes degeneration models for banknotes in brand new condition, banknotes in 80%-90% new condition, banknotes in 70%-80% new condition, and banknotes in 0-70% new condition.
The establishing a banknote condition degeneration model according to a preset rule includes:
analyzing a gray distribution f(x)=ax+b of an image for a banknote of a specific denomination of a specific currency, and dividing, according to gray similarity, the banknote of the specific denomination of the currency into five regions;
selecting a set of samples in brand new condition, and performing statistics on average gray value G for each banknote in the sample set;
selecting a set of samples in one of the conditions, and performing statistics on average gray value g for respective regions of each sample;
matching the average gray values G to the average gray values g respectively;
combining every two of the formulas f(x)=ax+b for the five regions to calculate a and b for each formula; and
selecting a set of samples in brand new condition, and calculating average gray value for each region of all banknote images, where each average gray value corresponds to a mapping to the gray distribution f(x)=ax+b.
The establishing a banknote contamination degeneration model according to a preset rule includes:
presetting that a banknote contamination region is circular and a stain is circular, and each banknote only have one contamination region; and
determining, according to statistics analysis, that probability density curves for a position of the contamination region and a position of the stain in the contamination region are constants, i.e., the probability density curves are in uniform distribution X˜U(a,b) and probability density curve for a size of the contamination region and probability density curves for size, density and gray value of the stain are in normal distribution X˜N(μ,σ2).
The establishing a banknote incompletion degeneration model according to a preset rule includes:
determining, according to statistics analysis, a position, a size and a shape of an incompletion, where a probability density curve of the position of the incompletion is a constant;
a probability density curve of the size of the incompletion is in normal distribution; and
the shape of the incompletion is polygon which is any one of trigon to octagon, convex polygon or concave polygon, and a probability density curve of the shape of the incompletion is a constant.
The establishing a banknote folding or deflection degeneration model includes:
dividing the banknote into two columns and two rows to form four uniform rectangular regions each having a long side and a short side which belong to edges of the banknote;
randomly selecting one of the regions, randomly selecting one point of the short side of the region, and randomly selecting another point of the long side of the region;
determining whether a distance between the two points, i.e., the distances x (a distance on the long side) and y (a distance on the short side) from the two points to the vertex, satisfy a constraint condition of √{square root over (x2+y2)}<k, x<m, y<n, if the distance between the two points satisfies the constraint condition, proceeding to a next step, and if the distance between the points does not satisfy the constraint condition, returning to the previous step; and
filing a deflection region, which has an edge being a straight line determined by the two points and has a point beyond the edge, with background color.
The establishing a banknote crack degeneration model according to a preset rule includes:
randomly acquiring a line segment s with a length of L on the boundary of the banknote, where L is in uniform distribution, Lε(0,MaxL), and MaxL is a maximum length of the boundary of the banknote;
determining a position of another point N, where a distance between the point N and a midpoint M of the line segment s is 1, and an angle between the line segment MN and the line segment s is, where lε(0,Maxl), the angle αε(π/3,2π/3), and α and l are in normal distribution; and
determining a triangle region bounded by the point N and the segment line s as the crack region, and filling the crack region with the background color.
A system for recognizing and classifying banknotes is disclosed according to the disclosure. The system includes:
an acquiring unit configured to acquire sample information of brand new banknotes to be learned and banknote sample information to be recognized;
a model establishing unit configured to establish a banknote sample signal degeneration model according to a preset rule;
an inputting unit configured to input the sample information into the banknote sample signal degeneration model to acquire various banknote sample information corresponding to the brand new banknotes to be learned;
a classifier learning unit configured to input the various banknote sample information to perform classifier learning, and output a banknote classification model; and
a classification result outputting unit configured to perform sample signal preprocessing and feature extraction on the sample information to be recognized, perform classification decision on the banknote to be recognized by using the classification model, and output a final classification result.
It can be known from above technical solutions that compared with conventional technology, a method for recognizing and classifying banknotes and a system thereof are disclosed according to the disclosure, and the method includes: acquiring sample information of brand new banknotes to be learned and banknote sample information to be recognized; establishing, according to a preset rule, a banknote sample signal degeneration model; inputting the sample information into the banknote sample signal degeneration model to acquire various banknote sample information corresponding to the brand new banknotes to be learned; inputting the various banknote sample information to perform classifier learning, and outputting a banknote classification model; performing sample signal preprocessing and feature extraction on the sample information to be recognized, performing classification decision on the banknote to be recognized by using the classification model, and outputting a final classification result. In the method, large amount of existing samples which are reliable and easily accessible are used to statistically establish a sample signal degeneration model which satisfies application requirement, to simulate the states of banknotes such as brand new condition, 80%-90% new condition, 70%-80% new condition, 0-70% new condition, contamination in varying degrees, incompletion in varying degrees, crack in varying degrees and folds in some regions, then classifier learning is performed, and classification recognition is performed on the sample to be recognized, thereby accurately acquiring a classification result, and decreasing cost and efficiency for developing banknote recognition product while ensuring improvement of classifier precision.
To illustrate the technical solutions according to the embodiments of the present disclosure or technical solutions in conventional technology more clearly, the drawings involved in the embodiments of the present disclosure or in the conventional technology are introduced briefly in the following. Apparently, the drawings described below are only embodiments of the disclosure, and persons of ordinary skills in the art can derive other drawings according to the drawings without any creative effort.
Technical solutions of the embodiments of the present disclosure are illustrated completely and clearly with the following drawings of the embodiments of the disclosure. Apparently, the described embodiments are merely a few rather than all of the embodiments of the present disclosure. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.
A method for recognizing and classifying banknotes and a system thereof are disclosed according to the disclosure. The method includes: acquiring sample information of brand new banknotes to be learned and banknote sample information to be recognized; establishing, according to a preset rule, a banknote sample signal degeneration model; inputting the sample information into the banknote sample signal degeneration model to acquire various banknote sample information corresponding to the brand new banknote to be learned; inputting the various banknote sample information to perform classifier learning, and outputting a banknote classification model; performing sample signal preprocessing and feature extraction on the sample information to be recognized, performing classification decision on the banknote to be recognized by using the classification model, and outputting a final classification result. In the method, large amount of existing samples which are reliable and easily accessible are used to statistically establish a sample signal degeneration model which satisfies application requirement, to simulate the states of banknotes such as brand new condition, 80%-90% new condition, 70%-80% new condition, 0-70% new condition, contamination in varying degrees, incompletion in varying degrees, crack in varying degrees and folds in some regions, then classifier learning is performed, and classification recognition is performed on the sample to be recognized, thereby accurately acquiring a classification result, and decreasing cost and efficiency for developing banknote recognition product while ensuring improvement of classifier precision.
Step 101 includes: acquiring sample information of brand new banknotes to be learned and banknote sample information to be recognized.
For higher robustness of the recognition system, abundant and diversified samples need to be input to design a classifier. However, in the field of circulation, in particular to design an algorithm for recognizing foreign banknotes, it is almost impossible to completely collect all needed types of banknote samples to be learned. Hence, in this solution, since brand new banknote samples are easily accessible, it is designed to acquire the sample information of brand new banknotes to be learned. According to these brand new banknotes, various banknotes are simulated. Generally speaking, banknote samples to be learned may be roughly selected from following types: brand new condition, 80%-90% new condition, 70%-80% new condition, 0-70% new condition, contamination in varying degrees, incompletion in varying degrees, crack in varying degrees, and fold in a certain region.
Step 102 includes: establishing a banknote sample signal degeneration model according to a preset rule.
Based on the acquired sample information of brand new banknotes to be learned, the banknote sample signal degeneration model is established according to the preset rule. The establishment of the degeneration model includes establishing a banknote condition degeneration model based on linear change of image brightness, and establishing a banknote image degeneration model based on randomness of a statistic model.
The banknote image degeneration model includes signal degeneration models for banknote contamination, banknote incompletion, banknote crack, and banknote fold or deflection, and the banknote condition degeneration model includes degeneration models for banknotes in brand new condition, banknotes in 80%-90% new condition, banknotes in 70%-80% new condition, and banknotes in 0-70% new condition.
Step 103 includes: inputting the sample information into the banknote sample signal degeneration model, to acquire various banknote sample information corresponding to the brand new banknotes to be learned.
Step 104 includes inputting the various banknote sample information to perform classifier learning, and outputting a banknote classification model.
Step 105 includes: performing sample signal preprocessing and feature extraction on the sample information to be recognized, performing classification decision on the banknote to be recognized by using the classification model, and outputting a final classification result.
After a banknote is used for a period, due to characteristics of paper, paper fiber suffers a certain degree of wear or a certain accumulation of dirt, thus the banknote image has a decreased overall gray value. Through statistical analysis, gray value for each pixel point of the image changes linearly, i.e., y=f(x), theoretically f(x) may be fitted through a certain amount of sample data, but the form of f(x) is hard to be determined, and through lots of sample experiments, an intuitive method adaptive to engineering implementation is provided.
A sample set with abundant and diversified banknotes is used to establish parameters for a condition degeneration model. It is assumed that f(x)=ax+b. However, the mapping may not be applicable to each point in the banknote image. Through analysis, in a region with originally high gray value, change in gray value is relatively more significant, and in a region with originally low gray value, change in gray value is relatively less significant, that is, f(x) is different in different gray regions.
Step 201: analyzing a gray distribution f (x)=ax+b of an image for a banknote of a specific denomination of a specific currency, and dividing, according to gray similarity, the banknote of the specific denomination of the specific currency into five regions.
As shown in
Step 202: selecting a set of samples in brand new condition, and performing statistics on average gray value G for each banknote in the sample set,
i.e., G1i, G2i, G3i, G4i and G5i, where i=1, 2, 3, . . . , n, and the sample set has n samples.
Step 203: selecting a set of samples in one of the conditions, and performing statistics on average gray value g for respective regions of each sample.
For example, taking banknotes in 80%-90% new condition as an example, statistics is performed on average gray values g11i, g112, g13i, g114, and g15i for regions of respective samples, where i=1, 2, 3, . . . , n, and the sample set has n samples.
Step 204: matching the average gray values G to the average gray values g respectively.
(1) f1(x) in region 1 is fitted, the average gray values acquired in step 2 and step 3 are respectively matched, {G1i, g11i}, where i=1, 2, 3, . . . , n, and there are total n groups of data.
(2) At least two groups of data are needed to calculate a1 and b1, every two groups of data which are acquired in step (1) are combined, and respective values of a1m and b1m are calculated, i.e., {G11, g111} and {G12, g112} are combined to acquire {a11, b11}, {G11, g111} and {G12, g112} are combined to acquire {a12, b12}, by that analogy, {a11, b11}, {a12, b12}, . . . {a1m, b1m} are acquired, where m=n/2.
(3) A data distribution of the set of {a1m, b1m} is analyzed, abnormal data is removed, a median of data in the set (or an average value, or a value determined by another rule) is used as (a1, b1).
(4) Similarly, steps (1)-(3) are repeated, thus values in other regions, i.e., (a2, b2), (a3, b3), (a4, b4) and (a5, b5) are calculated.
Step 205: combining every two of the formulas f(x)=ax+b for the five regions to calculate a and b for each formula.
Step 206: selecting a set of samples in brand new condition, and calculating average gray value for each region of all banknote images, where each average gray value corresponds to a mapping to the gray distribution f(x)=ax+b.
That is, the average gray values are G1, G2, G3, G4 and G5, of which mappings are G1−f1(x), G2−f2(x), G3−f3(x), G4−f4(x) and G5−f5(x).
According to the established banknote condition degeneration model, through statistics and fitting for a large number of data, for banknote images with similar textures, 0-255 gray levels may be divided into 16 gray segments each corresponding to a degeneration mapping, i.e., (0x00-0x0F)−f1(x), (0x10-0x1F)−f2(x), (0x20-0x2F)−f3(x), (0x30-0x3F)−f4(x) (0x40-0x4F)−f5(x), (0x50-0x5F)−f6(x), (0x60-0x6F)−f7(x), (0x70-0x7F)−f8(x), (0x80-0x8F)−f9(x), (0x90-0x9F)−f10(x), (0xA0-0xAF)−f11(x), (0xB0-0xBF)−f12(x), (0xC0-0xCF)−f13(x), (0xD0-0xDF)−f14(x), (0xE0-0xEF)−f15(x) and (0xF0-0xFF)−f16(x).
After the banknote condition degeneration model is established, steps of simulating condition of banknote images with insufficient samples are as follows. It is assumed that the banknote image to be processed is in brand new condition.
A first step includes: dividing, according to a gray distribution of a banknote image, the image into a plurality of regions, and calculating average gray value for each region.
A second step includes: determining a corresponding degeneration function according to the average gray value for each region acquired in the first step. For example, as shown in
A third step includes: performing corresponding degeneration mappings on gray vales for respective pixel points in each region in turn to acquire gray values for the respective pixel points after degeneration until all pixel points of the image are mapped.
Contamination, incompletion, crack and fold may be seen as special image noise for establishing relevant models, which are different from traditional noise; the noise generated from the traditional noise model is in the form of singular random points, and the noise generated from the noise model proposed in these embodiments is in the form of points in a random region, which have a special feature as well as a certain randomness.
The contamination noise mainly has features of shape, size and position of a contamination region, density of stains in the region, and shape, size and gray value for each stain.
Step 301 includes: presetting that the banknote contamination region is circular and the stain is circular, and each banknote only have one contamination region.
Step 302 includes: determining, according to statistics analysis, that probability density curves for the position of the contamination region and a position of the stain in the contamination region are constants, i.e., the probability density curves are in uniform distribution X˜U(a,b), and a probability density curve for a size of the contamination region and probability density curves for size, density and gray value of the stain are in normal distribution X˜N(μ,σ−2).
The probability density curve for the position of the contamination region is a constant, i.e., the contamination region may appear, with equal probability, at any position of the banknote.
Through statistics analysis on the size of the contamination region and the density of the stains in the region, the size (radius) of the contamination region is in normal distribution with an average value of μ11 and a variance of σ11, the density of the stains is irrelevant to the size of the contamination region, and the probability density of the stains satisfies an independent normal distribution with an average value of μ12 and a variance of σ12.
The probability density curve of the position of the stain in the region is a constant, i.e., the stain appears, with equal probability, at any position of the region.
Probability density curves for the size of the stain and the gray value of the stain are in independent normal distribution respectively; the size of the stain has an average value of μ13 and a variances of σ13, the gray value of the stain has an average value of μ4 and a variance of σ14.
A first step includes: randomly generating a special position in a banknote region according to a probability density curve of a position of a contamination region.
A second step includes: randomly generating, according to a probability density curve of a size of the contamination region, a radius value, and determining the contamination region and the size thereof by using the position of the point generated in the first step as a center of a circle.
A third step includes: randomly generating a density value according to a probability density function of density of stains in the contamination region, and determining a quantity of the stains in the region.
A fourth step includes: determining position, size and gray value for each stain in the region, marking each stain in the region sequentially, and randomly determining corresponding values according to probability density curves respectively.
The fourth step includes following sub-steps:
(1) randomly generating coordinate values for a stain in the region according to the probability density curve of the position of the stain in the region;
(2) randomly generating a radius of the stain according to the probability density curve of the size of the stain, and determining the position and the size of the stain by using the coordinate point in step (1) as the center of the stain;
(3) randomly generating the gray value of the stain according to the probability density curve of the gray value of the stain; and
(4) determining whether the stain is the last point in the contamination region; if the stain is the last point in the contamination region, proceeding to a fifth step; and if the stain is not the last point in the contamination region, returning to step (1) and continually generating a stain.
The fifth step includes fusing the generated noise with the original image.
Step 401 includes: determining, according to statistics analysis, a position, a size and a shape of an incompletion, where a probability density curve of the position of the incompletion is a constant,
i.e., the incompletion appears, with equal probability, at any position of the banknote.
Step 402 includes: determining that a probability density curve of the size of the incompletion is in normal distribution with an average value of μ21, and a variance of σ21.
Step 403 includes: determining that the shape of the incompletion is polygon which is any one of trigon to octagon, convex polygon or concave polygon, and a probability density curve of the shape of the incompletion is a constant, i.e., the incompletion is in any shape with equal probability.
A first step includes: randomly determining a special position in the banknote region according to the probability density curve of a position of an incompletion region.
A second step includes: randomly generating a radius of the incompletion region according to the probability density curve of a size of the incompletion, and using the coordinates of the position acquired in the first step as a circle center of the region.
A third step includes: determining a shape of the incompletion region. Specially, the third step are implemented as following steps:
(1) determining a circle region with the circle center generated in the first step and the radius generated in the second step;
(2) randomly generating n, where the incompletion is a polygon with n edges, and n is in uniform distribution, where nε[3,8],nεZ;
(3) evenly dividing the circle region acquired in step (1) into n fan regions by using the circle center as a center;
(4) acquiring one random point from each fan region, where the point locates, with equal probability, at any position of the fan region; and
(5) jointing, through straight lines, the n points to form a closed polygon.
A fourth step includes: filling the region within the closed polygon with background color (black) and using the region as a banknote incompletion.
The banknote is normally deflected at the edge portion, and the deflected portion of the banknote is generally small. According to this character, the folding (deflection) noise model may be established according to following steps.
Step 501 includes: dividing the banknote into two columns and two rows to form four uniform rectangular regions each having a long side and a short side which belong to edges of the banknote.
Step 502 includes: randomly selecting one of the regions, randomly selecting one point of the short side of the region, and randomly selecting another point of the long side of the region.
Step 503 includes: determining whether a distance between the two points. i.e., the distances x (a distance on the long side) and y (a distance on the short side) from the two points to the vertex, satisfy a constraint condition of √{square root over (x2+y2)}<k, x<m, y<n; if the distance between the two points satisfies the constraint condition, proceeding to a next step; and if the distance between the points does not satisfy the constraint condition, returning to the previous step.
Step 504 includes: filing a deflection region, which has an edge being a straight line determined by the two points and has a point beyond the edge, with background color.
Step 601 includes: randomly acquiring a line segment s with a length of L on the boundary of the banknote, where L is in uniform distribution, Lε(0,MaxL), and MaxL is a maximum length of the boundary of the banknote.
Step 602 includes: determining a position of another point N, wherein a distance between the point N and a midpoint M of the line segment s is 1, and an angle between the line segment MN and the line segment s is a, wherein lε(0,Maxl), the angle αε(π/3,2π/3), and α and l are in normal distribution.
Step 603 includes: determining a triangle region bounded by the point N and the segment line s as the crack region, and filling the crack region with the background color.
Based on the foregoing embodiments disclosed according to the disclosure, a system for recognizing and classifying banknotes is disclosed according to the disclosure.
In conclusion, a method and for recognizing and classifying banknotes and a system thereof are disclosed according to the disclosure. The method includes: acquiring sample information of brand new banknotes to be learned and banknote sample information to be recognized; establishing, according to a preset rule, a banknote sample signal degeneration model; inputting the sample information into the banknote sample signal degeneration model to acquire various banknote sample information corresponding to the brand new banknote to be learned; inputting the various banknote sample information to perform classifier learning, and outputting a banknote classification model; performing sample signal preprocessing and feature extraction on the sample information to be recognized, performing classification decision on the banknote to be recognized by using the classification model, and outputting a final classification result. In the method, large amount of existing samples which are reliable and easily accessible are used to statistically establish a sample signal degeneration model which satisfies application requirement, to simulate the states of banknotes such as brand new condition, 80%-90% new condition, 70%-80% new condition, 0-70% new condition, contamination in varying degrees, incompletion in varying degrees, crack in varying degrees and folds in some regions, then classifier learning is performed, and classification recognition is performed on the sample to be recognized, thereby accurately acquiring a classification result, and decreasing cost and efficiency for developing banknote recognition product while ensuring improvement of classifier precision.
According to the description of the disclosed embodiments, the disclosure may be implemented or used by the person skilled in the art. Various modifications made to these embodiments are apparent for persons skilled in the art, and a normal principle defined in the disclosure may be implemented in other embodiments without departing from spirit or scope of the disclosure. Therefore the disclosure is not limited to the embodiments described in the disclosure but confirms to a widest scope in accordance with principles and novel features disclosed in the disclosure.
Number | Date | Country | Kind |
---|---|---|---|
201310292056.4 | Jul 2013 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2014/071202 | 1/23/2014 | WO | 00 |