This application is a national stage application of PCT/EP2020/064920 filed May 28, 2020, the disclosures of which are incorporated herein by reference in their entirety.
The present invention relates to methods of performing biometric authentication, and systems and computer programs adapted to carry out such methods of performing biometric authentication.
Biometric authentication is a security process that uses biological or biometric characteristics of a user (or person) U1 to verify (or authenticate/test/check) the identity of that user U1. Many methods of performing such biometric authentication are known. Such authentication may be based, for example, on one or more of: facial characteristics; voice characteristics; fingerprints; eye characteristics (e.g. iris or retina patterns); etc. During the biometric authentication, an input X is obtained from (or is provided by) the user U1 whose identity is to be authenticated, where the input X is based on (or represents) one or more biological or biometric characteristics of the user U1—this input X could be obtained or derived, for example, based on one or more of: one or more captured images of the user's face or eye(s); fingerprint data obtained from a fingerprint scanner; capture/recording of the user's voice (which may or may not be predetermined words spoken); etc. depending on which biometric characteristics are to be used by the authentication system. The objective of the biometric authentication is then to determine whether the input X was provided by (or is based on or relates to or was obtained from) a predetermined user UD, e.g. whether the captured image(s) of the user's face or eye(s) corresponds to the face or eye(s) of the predetermined user UD; whether the obtained fingerprint data corresponds to one or more fingerprints of the predetermined user UD; whether the captured voice corresponds to the voice of the predetermined user UD; etc. If it is determined that the user being tested, U1, is the predetermined user UD, then the user U1 may be processed accordingly as if they were the predetermined user UD (e.g. pass a passport check; be granted access to a facility or to data or to a device; be permitted to perform certain actions; etc.); if it is determined that the user being tested, U1, is not the predetermined user UD, then the user U1 may be processed accordingly as if they were not the predetermined user UD (e.g. fail a passport check; be denied access to a facility or to data or to a device; be prevented from performing certain actions; etc.). For example, a device (e.g. a computer or smartphone) may belong to a particular person UD, and it may be desirable to perform biometric authentication of a person U1 wishing to login or make use of the device to check whether that person U1 is the owner UD, and then allow only the authenticated owner UD to login and make use of the device.
Associated with biometric authentication are false rejections and false acceptances. A false rejection occurs if the biometric authentication does not accept a correctly claimed identity, i.e. the user U1 being tested is indeed the predetermined user UD but the biometric authentication incorrectly concludes that the user U1 being tested is not the predetermined user UD. A false acceptance occurs if the biometric authentication accepts an incorrectly claimed identity, i.e. the user U1 being tested is not the predetermined user UD but the biometric authentication incorrectly concludes that the user U1 being tested is the predetermined user UD. False rejections relate to the (in)convenience experienced by legitimate users—after a false rejection, the legitimate user may, for example, need to repeat the biometric authentication or liaise with a system administrator. False acceptances relate to the security of the biometric authentication—after a false acceptance, an unauthorized or incorrect user may, for example, be able to access data or devices that they may not normally be able/allowed to access, or may be able perform tasks that they may not be normally be able/allowed to perform.
It would be desirable to provide a method of conducting biometric authentication that provides for low rates of both false rejections and false acceptances.
According to a first aspect of the invention, there is provided a method of performing biometric authentication for a first user, the method comprising:
performing one or more first tests, wherein for each first test, performing said first test comprises:
wherein performing the second test comprises:
In some embodiments of the first aspect, for each first test, the respective first log-likelihood ratio is r(X(j))=log p(X(j)|λg)−log p(X(j)|λi), where X(j) is the respective first input for said first test, λg is the first model and Ai the second model. In such embodiments, for each first test, determining that the first user is not the predetermined user when the respective first log-likelihood ratio for the first likelihood and the second likelihood does not exceed the respective first threshold may comprise one of: (a) calculating r(X(j)) and determining that the first user is not the predetermined user if r(X(j))<θL(j), where θL(j) is a respective predetermined threshold for the first test; or (b) calculating
and determining that the first user is not the predetermined user if
where θL(j) is a respective predetermined threshold for the first test; or (c) calculating a metric based on a ratio between the first likelihood and the second likelihood and comparing the metric to a threshold based on the respective first threshold. Additionally or alternatively, in such embodiments, for each first test, determining that the first user is the predetermined user when the respective first log-likelihood ratio exceeds the respective second threshold may comprise one of: (a) calculating r(XU)) and determining that the first user is the predetermined user if r(X(j))>θR(j), where θR(j) is a respective predetermined threshold for the first test; or (b) calculating
and determining that the first user is the predetermined user if
where θR(j) is a respective predetermined threshold for the first test; or (c) calculating a metric based on a ratio between the first likelihood and the second likelihood and comparing the metric to a threshold based on the respective second threshold.
In some embodiments of the first aspect, the second log-likelihood ratio is r(X(N))=log p(X(N)|λg)−log p(X(N)|λi), where X(N) is the second input, λg is the first model and λi the second model. In such embodiments, determining that the first user is the predetermined user when the second log-likelihood ratio for the third likelihood and the fourth likelihood exceeds the third threshold may comprise one of: (a) calculating r(X(N)) and determining that the first user is the predetermined user if r(X(N))>θ(N), where θ(N) is a predetermined threshold; or (b) calculating
and determining that the first user is the predetermined user if
where θ(N) is a predetermined threshold; or (c) calculating a metric based on a ratio between the third likelihood and the fourth likelihood and comparing the metric to a threshold based on the third threshold.
In some embodiments of the first aspect, the one or more biometric characteristics are based on one or more of: a face of the first user; a voice of the first user; one or more fingerprints of the first user; one or more eyes of the first user.
According to a second aspect of the invention, there is provided a system arranged to perform biometric authentication for a first user, the system adapted to:
perform one or more first tests, wherein for each first test, performing said first test comprises:
wherein performing the second test comprises:
In some embodiments of the second aspect, for each first test, the respective first log-likelihood ratio is r(X(j))=log p(X(j)|λg)−log p(X(j)|λi), where X(j) is the respective first input for said first test, λg is the first model and λi the second model. In such embodiments, for each first test, determining that the first user is not the predetermined user when the respective first log-likelihood ratio for the first likelihood and the second likelihood does not exceed the respective first threshold may comprise one of: (a) calculating r(X(j)) and determining that the first user is not the predetermined user if r(X(j))<θL(j), where θL(j) is a respective predetermined threshold for the first test; or (b) calculating
and determining that the first user is not the predetermined user if
where θL(j) is a respective predetermined threshold for the first test; or (c) calculating a metric based on a ratio between the first likelihood and the second likelihood and comparing the metric to a threshold based on the respective first threshold. Additionally or alternatively, in such embodiments, for each first test, determining that the first user is the predetermined user when the respective first log-likelihood ratio exceeds the respective second threshold may comprise one of: (a) calculating r(X(j)) and determining that the first user is the predetermined user if r(X(j))>θR(j), where θR(j) is a respective predetermined threshold for the first test; or (b) calculating
and determining that the first user is the predetermined user if
where θR(j) is a respective predetermined threshold for the first test; or (c) calculating a metric based on a ratio between the first likelihood and the second likelihood and comparing the metric to a threshold based on the respective second threshold.
In some embodiments of the second aspect, the second log-likelihood ratio is r(X(N))=log p(X(N)|λg)−log p(X(N)|λi), where X(N) is the second input, λg is the first model and λi the second model. In such embodiments, determining that the first user is the predetermined user when the second log-likelihood ratio for the third likelihood and the fourth likelihood exceeds the third threshold may comprise one of: (a) calculating r(X(N)) and determining that the first user is the predetermined user if r(X(N))>θ(N), where θ(N) is a predetermined threshold; or (b) calculating
and determining that the first user is the predetermined user if
where θ(N) is a predetermined threshold; or (c) calculating a metric based on a ratio between the third likelihood and the fourth likelihood and comparing the metric to a threshold based on the third threshold.
In some embodiments of the second aspect, the one or more biometric characteristics are based on one or more of: a face of the first user; a voice of the first user; one or more fingerprints of the first user; one or more eyes of the first user.
According to a third aspect of the invention, there is provided a computer program which, when executed by one or more processors, causes the one or more processors to carry out the method according to the first aspect or any embodiment thereof. The computer program may be stored on a computer readable medium.
Embodiments of the invention will now be described, by way of example only, with reference to the accompanying drawings, in which:
In the description that follows and in the figures, certain embodiments of the invention are described. However, it will be appreciated that the invention is not limited to the embodiments that are described and that some embodiments may not include all of the features that are described below. It will be evident, however, that various modifications and changes may be made herein without departing from the broader spirit and scope of the invention as set forth in the appended claims.
1—Underlying Mathematical Basis
As mentioned, the objective of biometric authentication is to determine, given an input X that is provided by (or that originated from) a first user U1 and that is based on (or represents) one or more biological or biometric characteristics of the first user U1, whether the first user U1 is a predetermined or specific user UD. This may be formulated as a hypothesis test in which the null hypothesis H0 is that X originated from (or was provided by or is based on or corresponds to) the predetermined user UD, and in which the alternative hypothesis H1 is that X did not originate from (or was not provided by or is not based on or does not correspond to) the predetermined user UD. One possible test is to compare the likelihoods p(X|H0) and p(X|H1) where p(X|Hj) for j=0.1 is the probability density function for the hypothesis Hj evaluated for the input X, for example by calculating the log-likelihood ratio r(X)=log p(X|H0)−log p(X|H1). In the following, the natural logarithm is used, but it will be appreciated that other logarithms could be used (with the examples and equations updated accordingly). The value of r(X) may be compared with a threshold θ, and H0 is accepted (i.e. it is concluded that the user U1 is the predetermined user UD) if r(X)>θ, whereas H1 is accepted and H0 is rejected (i.e. it is concluded that the user U1 is not the predetermined user UD) if r(X)<θ. It will be appreciated that the decision on whether to accept or reject H0 if r(X)=θ is a design choice.
Determination of the likelihoods p(X|H0) and p(X|H1) may be based on models λg and λi respectively, where λg is a first model in which input is obtained from the predetermined user UD, and wherein λi is a second model in which input is obtained from one or more users other than the predetermined user UD, i.e. λg is a model of the predetermined user UD and λi is a model of one or more “imposters” (i.e. users other than the predetermined user UD). The models λg and λi then represent the hypotheses H0 and H1 respectively. The models may then be used to calculate the likelihoods p(X|H0) and p(X|H1) as p(X|λg) and p(X|λi) respectively, so that a log-likelihood ratio can be calculated as r(X)=log p(X|λg)−log p(X|λi).
Examples of, and the nature of, the models λg and λi shall be described later.
Given random variables Xg and Xi that represent, respectively, test inputs X of a “genuine” user (i.e. the predetermined user UD) and of one or more “imposters” (i.e. users other than the predetermined user UD), the random variables r(Xg) and r(Xi) may be assumed to have respective Gaussian distributions, i.e. r(Xg)˜(μg,σg2) and r(Xi)˜(μi,σi2). Examples of how the respective means μg and μi and the respective variances σg2 and σi2 a may be estimated shall be described later. The respective probability density functions ƒg and ƒi for the random variables r(Xg) and r(Xi) are then:
Using the well-known error function
the cumulative distribution functions Fg and Fi for the random variables r(Xg) and r(Xi) are:
The analysis below assumes that the following design choice is made: H0 is rejected if r(X)=θ. However, as mentioned above, the skilled person will appreciate that this is a mere design choice and that the analysis below can be easily adapted to examples in which H0 is accepted if r(X)=θ.
A Type I error (or a false rejection or false positive) happens when H0 is true but is rejected, i.e. the user U1 being tested is the predetermined user UD, but the biometric authentication incorrectly concludes that the user U1 is not the predetermined user UD. The probability, pFP, of a Type I error may therefore be expressed as:
(where t parameterizes this expression for pFP and represents the parameterized threshold θ).
As mentioned above, pFP relates to the (in)convenience experienced by a genuine user—i.e. the user U1 being tested is indeed the predetermined user UD, but the biometric authentication incorrectly concludes that the user U1 is not the predetermined user UD (and therefore is inconvenienced) with a probability of pFP. The lower the value of pFP, the greater the convenience for the genuine user.
A Type II error (or a false acceptance or false negative) happens when H0 is false but is accepted, i.e. the user U1 being tested is not the predetermined user UD, but the biometric authentication incorrectly concludes that the user U1 is the predetermined user UD. The probability, pFN, of a Type II error may therefore be expressed as:
(where t parameterizes this expression for pFN and represents the parameterized threshold θ).
As mentioned above, pFN relates to the security of the biometric authentication system/method—i.e. the user U1 being tested is not the predetermined user UD, but the biometric authentication incorrectly concludes that the user U1 is the predetermined user UD (and security may be considered compromised) with a probability of pFN. The lower the value of pFN, the greater the security.
One or both of pFP and pFN may be used to set the value of θ for performing the test of r(X)<θ. For example, with a focus on user convenience, a target value βFP>0 for pFP may be chosen to be as low as desired, with a value of θ being chosen accordingly so that
Likewise, with a focus on security, a target value βEN>0 for pFN may be chosen to be as low as desired, with a value of B being chosen accordingly so that
The so-called Equal Error Rate (EER) may be used to define the value of 0 and provides a measure of effectiveness of the biometric authentication. The value of B to achieve EER, namely θEER, is the value of t for which pFP(t|μg,σg2)=pFN(t|μi,σi2), and the EER is then defined as pFP(θEER|g,σg2). The lower the value of EER, the more secure and more user-convenient the biometric authentication is.
Example 1: As an example of the above method of biometric authentication, consider the following values for the parameters for the two Gaussian distributions: μg=2.73, σg2=0.44, μi=−1.86 and σi2=0.90.
An enhancement on the above-described method for performing biometric authentication shall now be described. This enhancement enables greater security and/or greater user-convenience, as shall become apparent. The same underlying models λg and λi are used in this enhanced method of biometric authentication.
With this enhanced method of biometric authentication, instead of using just a single threshold θ for performing the test of r(X)<θ, multiple thresholds are used, as set out below, with one or more tests being performed for a single biometric authentication. In particular, a first type of test (referred to herein as a first test) may be performed one or more times, up to a maximum number M times. A second type of test (referred to herein as a second test) may then be performed (if the first test has been performed M times and in dependence on the outcome of the Mth first test). Thus, at most N tests are performed, where N=M+1. In the following, the parameter j is used to indicate the current test being performed, with j ranging from 1 to N and j being initialized to 1. The value of M is predetermined and can be any positive integer; thus, likewise, the value of N is predetermined and can be any positive integer greater than 1. In practice, it may be that the value of N is set, as this represented the maximum number of tests (be they first or second tests) that the user will need to undergo for a given biometric authentication, with the value of M being set, or determined, based on N as M=N−1. Some embodiments may be implemented to use a value for M without explicitly using a value for N (e.g. as discussed later with reference to
In particular, the jth first test is performed in which r(X(j)) is calculated for an input X(j) for the jth first test and: (a) if r(X(j))<θL(j) for a first threshold θL(j) for the jth first test, then H1 is accepted and H0 is rejected (i.e. it is concluded that the user U1 is not the predetermined user UD); (b) if r(X(j))>θR(j) for a second threshold θR(j) for the jth first test with θR(j)>θL(j), then H0 is accepted (i.e. it is concluded that the user U1 is the predetermined user UD); (c) if, however, θL(j)<r(X(j))<θR(j), then the conclusion of the first test is either that: (i) another first test is to be performed if j<M (in which case j will be incremented by 1 so that the (j+1)th first test is then performed); or (ii) a second test is to be performed if j=M (in which case, the second test will be the Nth test that is performed)—thus, the first test is performed a maximum of M times and the second test is performed if the first test has been performed M times and θL(M)<r(X(M))<θR(M).
In some embodiments, H0 is rejected at the jth first test if r(X(j))=θL(j), whereas in other embodiments, a further first test or the second test is performed as appropriate if r(X(j))=θL(j): this is a design choice. Likewise, in some embodiments, H0 is accepted at the jth first test if r(X(j))=θR(j), whereas in other embodiments, a further first test or the second test is performed as appropriate if r(X(j))=θR(j): again, this is a design choice.
Each of the one or more first tests comprises obtaining a respective first input XU). The first input X(j) may be obtained in the same way as the input X is obtained, as discussed above. The input X(j) is of the same type as the input X (i.e. they both relate to the same biological or biometric characteristics and can be analyzed based on the same models λg and λi).
For the second test, r(X(N)) is calculated and: (a) if r(X(N))<θ(N) for a third threshold θ(N), then H1 is accepted and H0 is rejected (i.e. it is concluded that the user U1 is not the predetermined user UD); (b) if r(X(N))>θ(N) for the third threshold θ(N), then H0 is accepted (i.e. it is concluded that the user U1 is the predetermined user UD). In some embodiments, H0 is accepted if r(X(N))=θ(N) whereas in other embodiments, H0 is rejected if r(X(N))=θ(N): again, this is a design choice.
The second test comprises obtaining a second input X(N). The second input X(N) may be obtained in the same way as the input X is obtained, as discussed above. The second input X(N) is of the same type as the input X (i.e. they both relate to the same biological or biometric characteristics and can be analyzed based on the same models λg and λi).
The analysis below assumes that the following design choices are made: H0 is rejected at the jth first test if r(X(j))=θL(j) and at the second test if r(X(N))=θ(N), and a further first test or the second test is performed as appropriate if r(X(j))=θR(j). The analysis below also assumes that the random variables r(X(j)) (for j=1, 2, . . . , N−1) and r(X(N)) of each respective first input X(j) (for j=1, 2, . . . , N−1) and the second input X(N) respectively are independent, and that the random variables r(X(j)) (for j=1, 2, . . . , N−1) and r(X(N)) of each respective first input X(j) (for j=1, 2, . . . , N−1) and the second input X(N) that are associated with a single biometric authentication are identically distributed. In particular, as above, given random variables Xg and Xi that represent, respectively, test inputs X(j) (or X(N)) of a “genuine” user (i.e. the predetermined user UD) and of one or more “imposters” (i.e. users other than the predetermined user UD), the random variables r(Xg) and r(Xi) may be assumed to have respective Gaussian distributions, i.e. r(Xg)˜(μg,σg2) and r(Xi)˜(μi,σi2). As previously mentioned, examples of how the respective means μg and μi and the respective variances σg2 and σi2 may be estimated shall be described later.
Accordingly, in embodiments in which the maximum number of tests N equals 2 (i.e. M=1), the probability pFP(2) of a Type I error and the probability pFN(2) of a Type II error are given by:
pFN(2)(tL(1),tR(1),t(2)|μg,σg2)=Pr(r(Xg)>tL(1))+Pr(tL(1)<r(Xg)≤tR(1))·Pr(r(Xg)>t(2))
pFN(2)(tL(1),tR(1),t(2)|μi,σi2)=Pr(r(Xi)>tR(1))+Pr(tL(1)<r(Xi)≤tR(1))·Pr(r(Xi)>t(2))
(where tL(1), tR(1)), t(2) parameterize these expressions for pFP(2)) and pFN(2), and represent the parameterized thresholds θL(1), θR(1) and θ(2))
so that
One or both of pFP(2)) and pFN(2) may be used to set the values of θL(1), θR(1) and θ(2) for performing the above-mentioned first and second tests. For example, with a focus on user convenience, a target value βFP>0 for pFP(2) may be chosen to be as low as desired, with values of θL(1), θR(1) and θ(2) being chosen accordingly so that
βFP≥pFP(2)(θL(1),θR(1),θ(2)|μg,σg2)
There may be more than one triple of values (θL(1),θR(1),θ(2)) which satisfy this, any of which may be chosen. In some embodiments, the triple of values (θL(1),θR(1),θ(2)) is chosen so that value of pFP(2)(θL(1),θR(1),θ(2)|μi,σi2) is as small as possible.
Example 2: Considering the same values of the parameters μg, σg2, μi and σi2 as for Example 1 above, and setting βFP=0.0022, it can be determined that pFP(2)(θL(1),θR(1),θ(2)|μg,σg2≈0.0000081 for θL(1)≈0.68, θR(1)≈2.62 and θ(2)≈0.89. The probability of a false negative in this example is around 273 times smaller than the corresponding value in Example 1, which is a considerable improvement for security.
Likewise, with a focus on security, a target value βFN>0 for pFN(2) may be chosen to be as low as desired, with the values of θL(1), θR(1) and θ(2) being chosen accordingly so that
βFP≥pFP(2)(θL(1),θR(1),θ(2)|μi,σi2)
Again, there may be more than one triple of values (θL(1),θR(1),θ(2)) which satisfy this, any of which may be chosen. In some embodiments, the triple of values (θL(1),θR(1),θ(2)) is chosen so that the value of pFP(2)(θL(1),θR(1),θ(2)|μg,σg2) is as small as possible.
Example 3: Considering the same values of the parameters μg, σg2, μi and σi2 as for Example 1 above, and setting βFN=0.0022, it can be determined that pFP(2)(θL(1),θR(1),θ(2)|μg,σg2)≈0.00000078 for θL(1)≈−0.59, θR(1)≈1.20 and θ(2)≈0.14. The probability of a false positive in this example is around 2,850 times smaller than the corresponding value in Example 1, which is a considerable improvement for the convenience of the genuine user.
An adapted EER approach could be used to set the values of θL(1), θR(1) and θ(2). Let the function u be defined by:
u(tL(1),tR(1),t(2)|μg,σg2μi,σi2)=max{pFP(2)(tL(1),tR(1),t(2)|μg,σg2),pFP(2)(tL(1),tR(1),t(2)|μi,σi2)}
and let
then the values of θL(1), θR(1) and θ(2) could be set to be values for tL(1), tR(1) and t(2) for which ERminmax is reached. Again, there may be more than one triple of values (θL(1), θR(1), θ(2)) which satisfy this, any of which may be chosen.
Example 4: Considering the same values of the parameters μg, σg2, μi and σi2 as for Example 1 above, ERminmax may be determined numerically, with ERminmax≈0.00012 for θL(1)≈0.14, θR(1)≈1.97 and θ(2)≈0.57. The value of ERminmax in this example is around 18 times smaller than the EER in Example 1, which is a considerable improvement for both the convenience of the genuine user and security.
It will be appreciated that other methods may be used to set the values of θL(1), θR(1) and θ(2), depending, for example, on the desired balance between rates of false acceptances and false rejections.
In embodiments in which the maximum number of tests N equals 2 (i.e. M=1), the probability that exactly m tests are performed for m=1 and m=2, given that the first user U1 is the “genuine” user (i.e. the predetermined user UD) or that the first user U1 is an “imposter” (i.e. a user other than the predetermined user UD), are pgm tests and pim tests respectively, where:
Note that pim tests may be viewed as relating, in part, to the security of the biometric authentication since an attacker/imposter may be able to obtain further information relating to the biometric authentication via the second test. To help mitigate this, some embodiments of the invention may impose a maximum number of consecutive times that the biometric authentication process, resulting in the second test being conducted, may be performed.
Note also that pgm tests relates to the (in)convenience experienced by the genuine (predetermined) user UD. It may, for example, be desirable to ensure that pg2 tests is at most a threshold value α for some 0<α<1 and then set the values of θL(1), θR(1) and θ(2) to be values for tL(1), tR(1) and t(2) that achieve ERminmax(α), where:
i.e. ERminmax(α) is ERminmax but with the minimization constrained to triples (tL(1),tR(1),t(2)) for which pg2 tests (tL(1),tR(1),t(2)|μg,σg2)≤α.
Example 5: Considering the same values of the parameters μg, σg2, μi and σi2 as for Example 2 above, and the same values of θL(1), θR(1) and θ(2),
pg1 tests(θL(1),θR(1),θ(2)|μg,σg2)≈0.874
pg2 tests(θL(1),θR(1),θ(2)|μg,σg2)≈0.126
pi1 tests(θL(1),θR(1),θ(2)|μi,σi2)≈0.983
pi2 tests(θL(1),θR(1),θ(2)|μi,σi2)≈0.017
In particular, ERminmax(α)<0.00012 if 0.126<α<1. For 0<α<0.126, the value of ERminmax(α) may be determined numerically and is shown in
pg1 tests(θL(1),θR(1),θ(2)|μg,σg2)≈0.990
pg2 tests(θL(1),θR(1),θ(2)|μg,σg2)≈0.010
pi1 tests(θL(1),θR(1),θ(2)|μi,σi2)≈0.995
pi2 tests(θL(1),θR(1),θ(2)|μi,σi2)≈0.005
ERminmax(0.01) is around 3.5 times smaller than the EER in Example 1 above, so that good improvements in both the convenience of the genuine user and security are still achieved, even though the probability of the genuine user needing to perform a second test is bounded above by a small value of 0.01.
In the general scenario in which at most N tests are performed with N>1, the probability pFP(N)(tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N)|μg,σg2) of a Type I error can be calculated by setting
pFP(1)(t(1)|μg,σg2)=Pr(r(Xg)≤t(1))
and computing
pFP(j)(tL(1),tR(1), . . . ,tL(j−1),tR(j−1),t(j)|μg,σg2)=Pr(r(Xg)≤tL(1))+Pr(tL(1)<r(Xg)≤tR(1))·pFP(j−1)(tL(2),tR(2), . . . ,tL(j−1),tR(j−1),t(j)|μg,σg2)
for j=2,3, . . . , N.
Likewise, in the general scenario in which at most N tests are performed with N>1, the, probability pFP(N)(tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N)|μg,σg2) of a Type II error can also be calculated by setting
pFN(1)(t(1)|μi,σi2)=Pr(r(Xi)>t(1))
and computing
pFN(j)(tL(1),tR(1), . . . ,tL(j−1),tR(j−1),t(j)|μi,σi2)=Pr(r(Xi)>tR(1))+Pr(tL(1)<r(Xi)≤tR(1))·pFN(j−1)(tL(2),tR(2), . . . ,tL(j−1),tR(j−1),t(j)|μi,σi2)
for j=2, 3, . . . , N.
(where tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N) parameterize these expressions for pFP(N) and pFN(N), and represent the parameterized thresholds θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)).
One or both of pFP(N) and pFN(N) may be used to set the values of θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N) for performing the above-mentioned at most N tests. For example, with a focus on user convenience, a target value βFP>0 for pFP(N) may be chosen to be as low as desired, with values of θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N) being chosen accordingly so that
βFP≥pFP(N)(θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)|μg,σg2)
There may be more than one vector of values (θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)) which satisfy this, any of which may be chosen. In some embodiments, the vector of values (θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)) may be chosen so that value of pFN(N)(θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)|μi,σi2) is as small as possible.
Likewise, with a focus on security, a target value βFN>0 for pFN(N) may be chosen to be as low as desired, with the values of θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N) being chosen accordingly so that
βFN≥pFN(N)(θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)|μi,σi2)
Again, there may be more than one vector of values (θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)) which satisfy this, any of which may be chosen. In some embodiments, the vector of values (θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)) may be chosen so that value of pFP(N)(θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)|μg,σg2) is as small as possible.
A further adapted EER approach could be used to set the values of θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N). Let the function v be defined by:
v(tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N)|μg,σg2μi,σi2)=max{pFP(N)(tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N)|μg,σg2),pFN(2)(tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N)|μi,σi2)}
and let
then the values of θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N) could be set to be values for tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N) for which ERminmax(N) is reached. Again, there may be more than one vector of values (θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N)) which satisfy this, any of which may be chosen.
Again, it will be appreciated that other methods may be used to set the values of θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N), depending, for example, on the desired balance between rates of false acceptances and false rejections.
In the general scenario in which at most N tests are performed with N>1, the probability that exactly m tests are performed with 1≤m≤N, given that the first user U1 is the “genuine” user (i.e. the predetermined user UD) or that the first user U1 is an “imposter” (i.e. a user other than the predetermined user UD), are pgm tests and pim tests respectively, where:
Example 6: Considering the same values of the parameters μg, σg2, μi and σi2 as for Example 1 above, ERminmax(3) may be determined numerically, with ERminmax(3)≈0.000015 for
(θL(1),θR(1),θL(2),θR(2),θ(3)|μg,σg2)≈(−0.20,2.47,−0.07,1.72,0.43).
The value of ERminmax(3) in this example is around 147 times smaller than the EER in Example 1 and around 8 times smaller than the value of ERminmax in Example 4, which is again a considerable improvement for both the convenience of the genuine user and security. For this example,
pg1 tests(θL(1),θR(1),θL(2),θR(2),θ(3)|μg,σg2)≈0.652
pg2 tests(θL(1),θR(1),θL(2),θR(2),θ(3)|μg,σg2)≈0.325
pg3 tests(θL(1),θR(1),θL(2),θR(2),θ(3)|μg,σg2)≈0.022
pi1 tests(θL(1),θR(1),θL(2),θR(2),θ(3)|μi,σi2)≈0.960
pi2 tests(θL(1),θR(1),θL(2),θR(2),θ(3)|μi,σi2)≈0.039
pi3 tests(θL(1),θR(1),θL(2),θR(2),θ(3)|μi,σi2)≈0.001
Note that pim tests may be viewed as relating, in part, to the security of the biometric authentication since an attacker/imposter may be able to obtain further information relating to the biometric authentication via the more than one tests. To help mitigate this, some embodiments of the invention may impose a maximum number of consecutive times that the biometric authentication process, resulting in more than one test being conducted, may be performed.
Note also that pgm tests relates to the (in)convenience experienced by the genuine (predetermined) user UD. It may, for example, be desirable to ensure that pg1 tests is at least a threshold value 1−α for some 0<α<1 and then set the values of θL(1),θR(1), . . . ,θL(N−1),θR(N−1),θ(N) to be values for tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N) that achieve ERminmax(N) but with the minimization constrained to values for tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N) for which
pg1 tests(tL(1),tR(1), . . . ,tL(N−1),tR(N−1),t(N)|μg,σg2)>1−α.
As mentioned above, the input X (and, likewise, the first input(s) X(j) for j=1, 2, . . . , M and the second input X(N)) could be obtained or derived, for example, based on one or more of: one or more captured images of the user's face or eye(s); fingerprint data obtained from a fingerprint scanner; capture/recording of the user's voice (which may or may not be predetermined words spoken); etc. depending on which biometric characteristics are to be used by the authentication system. The following discussion uses the user's voice (e.g. in relation to spoken words) as an example of the generation and use of the models λg and λi, and the derivation of their respective means μg and μi and respective variances σg2 and σi2. However, it will be appreciated that analogous approaches may be used in respect of other types of characteristics of the users and for different types of input X.
The input X may be a captured/recorded speech segment of a single speaker. A feature vector representing the input X may be generated from X. For example, the input X may be divided into a number of frames of a certain length, for which there may or may not be a certain amount of overlap between adjacent frames. For example, a frame may be 10 ms long and the amount of overlap between adjacent frames may be 50%. In some embodiments, certain frames may be discarded if those frames do not meet certain predetermined criteria—for example, a frame that does not contain speech or a frame that is too silent (i.e. that does not meet the criteria of being sufficiently loud) may be discarded. Put another way, frames that meet certain predetermined criteria may be selected for further processing. Feature extraction may then be applied to the frames (or the frames selected based on the above-mentioned criteria). This may be achieved in a variety of ways, such as the well-known feature extraction based on Mel-Frequency Cepstral Coefficients (MFCCs)—see L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition, Englewood Cliffs, N.J., PTR Prentice Hall, 1993, the entire disclosure of which is incorporated herein by reference. If these (selected/remaining) frames are numbered 1 to K and if n denotes the number of MFCCs used, then this method outputs a feature vector xj∈n for frame j with j=1, 2, . . . , K. It will be appreciated that other methods of obtaining a set {xj∈n: j=1, 2, . . . , K} of K feature vectors from an input X may be used and, as mentioned above, alternative methods may be used when the input X is not a speech segment but, instead, relates to different characteristics of the user.
There are various types of models λg and λi which could be used for embodiments of the invention, such as a Hidden Markov Model. A Gaussian Mixture Model (GMM) is a well-known type of model—see, for example, https://en.wikipedia.org/wiki/Mixture_model, as well as D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, Speaker verification using adapted Gaussian mixture models, Digital Signal Processing 10 (2000), pp. 19-41, the entire disclosures of which are incorporated herein by reference. To compute the likelihoods p(X|λg) and p(X|λi) using a GMM-based method, let gj: n→ be Gaussian density functions with mean vector
The Expectation-Maximization (EM) algorithm presented in A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society: Series B (Methodological), vol. 39, no. 1 (1977), pp. 1-38 (, the entire disclosure of which are incorporated herein by reference) is a well-known method to estimate the parameters of λ. This algorithm takes a set of feature vectors, the value of m, the number of iterations used in the EM algorithm, and initial values for one Gaussian density function as inputs, and outputs the estimated parameters of λ.
Thus, during a training phase, a first set of feature vectors may be extracted from training speech segments provided by the genuine speaker UD (as discussed above). This first set may be input to the EM algorithm to estimate the parameters of λg. Likewise, a second set of feature vectors may be extracted from training speech segments of one or more imposters (i.e. users other than the genuine speaker UD). This second set may be input to the EM algorithm to estimate the parameters of λi.
For a set of feature vectors {x1, x2, . . . , xT} (where xj∈n for j=1, 2, . . . , T) extracted from a speech segment X, the log-likelihoods log p(X|λg) and log p(X|λi) may be computed as:
i.e. the likelihoods p(X|λg) and p(X|λi) may be computed as:
so that r(X)=log p(X|λg)−log p(X|λi) may be calculated.
To estimate the mean μg and the variance σg2, a set of Mg speech segments {Xg(1), Xg(2), . . . , Xg(M
Similarly, to estimate the mean μi and the variance σi2, a set of Mi speech segments {Xi(1), Xi(2), . . . , Xi(M
Based on these estimated means μg, μi and the variances σg2, σi2, the thresholds used for a testing phase (i.e. when performing the biometric authentication) may be calculated, as discussed above.
During the testing phase, a set of feature vectors {x1, x2, . . . , xS} may be extracted from a test speech segment X. The log-likelihoods log p(X|λg) and log p(X|λi) and/or the likelihoods p(X|λg) and p(X|λi) may be computed as discussed above:
Thus, r(X)=log p(X|λg)−log p(X|λi) may be calculated during the testing phase.
It will be appreciated that, in the above example discussion of audio speech segments, the input X was viewed as a speech segment recorded/captured from a user and from which a set of feature vectors x∈n is extracted for subsequent processing. However, input X could equally be viewed as the actual feature vectors x∈n obtained from the user via an audio recording/capture and subsequent feature extraction. Other ways of viewing the input X may equally apply (e.g. the input X may be a filtered/processed version of an audio recording/capture).
As discussed above, other types of model may be used for the models λg and λi, and corresponding ways of initializing those models λg and λi during a training phase may be used accordingly. Indeed, the models λg and λi do not necessarily need to be of the same type. Since such modeling and training/initialization would be well-known to the skilled person, further detail on this shall not be given herein.
2—System Overview
The storage medium 104 may be any form of non-volatile data storage device such as one or more of a hard disk drive, a magnetic disc, a solid-state-storage device, an optical disc, a ROM, etc. The storage medium 104 may store an operating system for the processor 108 to execute in order for the computer 102 to function. The storage medium 104 may also store one or more computer programs (or software or instructions or code).
The memory 106 may be any random access memory (storage unit or volatile storage medium) suitable for storing data and/or computer programs (or software or instructions or code).
The processor 108 may be any data processing unit suitable for executing one or more computer programs (such as those stored on the storage medium 104 and/or in the memory 106), some of which may be computer programs according to embodiments of the invention or computer programs that, when executed by the processor 108, cause the processor 108 to carry out a method according to an embodiment of the invention and configure the system 100 to be a system according to an embodiment of the invention. The processor 108 may comprise a single data processing unit or multiple data processing units operating in parallel, separately or in cooperation with each other. The processor 108, in carrying out data processing operations for embodiments of the invention, may store data to and/or read data from the storage medium 104 and/or the memory 106.
The interface 110 may be any unit for providing an interface to a device 122 external to, or removable from, the computer 102. The device 122 may be a data storage device, for example, one or more of an optical disc, a magnetic disc, a solid-state-storage device, etc. The device 122 may have processing capabilities—for example, the device may be a smart card. The interface 110 may therefore access data from, or provide data to, or interface with, the device 122 in accordance with one or more commands that it receives from the processor 108.
The user input interface 114 is arranged to receive input from a user, or operator, of the system 100. The user may provide this input via one or more input devices of the system 100, such as a microphone 125, a mouse (or other pointing device) 126, a camera (e.g. a webcam or an integrated camera) 127, a fingerprint reader/detector 128, and/or a keyboard 124, that are connected to, or in communication with, the user input interface 114. However, it will be appreciated that the user may provide input to the computer 102 via one or more additional or alternative input devices (such as a touch screen). The computer 102 may store the input received from the input devices via the user input interface 114 in the memory 106 for the processor 108 to subsequently access and process, or may pass it straight to the processor 108, so that the processor 108 can respond to the user input accordingly.
The user output interface 112 is arranged to provide a graphical/visual and/or audio output to a user, or operator, of the system 100. As such, the processor 108 may be arranged to instruct the user output interface 112 to form an image/video signal representing a desired graphical output, and to provide this signal to a monitor (or screen or display unit) 120 of the system 100 that is connected to the user output interface 112. Additionally or alternatively, the processor 108 may be arranged to instruct the user output interface 112 to form an audio signal representing a desired audio output, and to provide this signal to one or more speakers 121 of the system 100 that is connected to the user output interface 112.
The network interface 116 provides functionality for the computer 102 to download data from and/or upload data to one or more data communication networks.
It will be appreciated that the architecture of the system 100 illustrated in
As discussed above, embodiments of the invention perform biometric authentication to determine, given an input X that is provided by (or that originated from) a first user U1 and that is based on (or represents) one or more biological/biometric characteristics of the first user U1, whether the first user U1 is a predetermined or specific user UD. The first user U1 may be a user of the computer system 100. In embodiments in which biological/biometric characteristics of the user U1 that are used to perform the biometric authentication relate to aspects/features of the user's face or eye(s), then the input X may be based on an image (of the face or eye(s) of the user U1) captured via the camera 127. In embodiments in which biological/biometric characteristics of the user U1 that are used to perform the biometric authentication relate to aspects/features of the user's voice, then the input X may be based on audio/sound (of the voice of the user U1) captured/recorded via the microphone 125. In embodiments in which biological/biometric characteristics of the user U1 that are used to perform the biometric authentication relate to aspects/features of the user's fingerprint(s), then the input X may be based on fingerprint data (from one or more fingers of the user U1) captured via the fingerprint reader/detector 128. It will be appreciated that embodiments of the invention may make use of additional or alternative mechanisms (e.g. via the user input interface 114 and/or via the interface 110 and a device 122) for obtaining an input X from the first user U1 that is based on one or more biological/biometric characteristics of the first user U1.
In some embodiments of the invention, the biometric authentication is carried out at the device of the first user U1, e.g. if the first user U1 is attempting to gain access to, or login to, a mobile telephone, laptop, personal computer, etc.
However, in other embodiments (which may equally apply to situations in which the first user U1 is attempting to gain access to, or login to, a mobile telephone, laptop, personal computer, etc.), at least some of the processing for the biometric authentication may be carried out separately from the device which initially received input from the user U1.
For example, with the system 200 shown in
It will be appreciated that other deployment scenarios are possible, with various stages of the processing for the biometric authentication being perform at different locations and/or by different entities.
3—Performance of Biometric Authentication
As discussed above, the method 500 may involve performing one or more first tests and, optionally (depending on the outcome of the one or more first tests) performing a second test. In the method 500, steps 502-510 are steps some or all of which may be performed for a first test, whilst steps 506, 510, 514 and 516 are steps some or all of which may be performed for the second test. It will be appreciated, however, that the first and second tests may be performed using a different set of steps and/or using these steps but in a different order. Either way, the outcome of the method 500 is either a determination at the step 506 that the first user U1 is not the predetermined user UD or a determination at the step 510 that the first user U1 is the predetermined user UD.
At the step 502, the authentication system obtains a respective first input X(j) for the (current) first test being performed. This first input X(j) is obtained based on one or more biological/biometric characteristics of the first user U1. The nature of the first input X(j), and methods and system components for obtaining the first input X(j), have been discussed above. The first user U1 may be prompted, for example by a message displayed on the monitor 120, to interact with the authentication system so as to provide/generate the first input X(j) (e.g. by posing for an image to be captured by the camera 127, or by speaking so that audio may be recorded by the microphone 125, or by placing one or more fingers on the fingerprint reader/detector 128 so that fingerprint data may be obtained).
At the step 504, the authentication system determines whether or not a respective first log-likelihood ratio (for the (current) first test being performed) between/for a first likelihood and a second likelihood exceeds a first respective threshold (for the (current) first test being performed), where the first likelihood is a likelihood of obtaining the respective first input X(j) based on a first model in which input is obtained from the predetermined user UD, and wherein the second likelihood is a likelihood of obtaining the respective first input X(j) based on a second model in which input is not obtained from the predetermined user UD (i.e. when input is obtained from one or more users other than the predetermined user UD).
Thus, in some embodiments, the first and second models are and λi respectively and the first and second likelihoods are p(X(j)|λg) and p(X(j)|λi). The step 504 may involve calculating the respective first log-likelihood ratio r(X(j))=log p(X(j)|λg)−log p(X(j)|λi) and identifying whether r(X(j))≤θL(j), for the first threshold θL(j). Alternatively, as
the step 504 may involve calculating
and identifying whether
which is equivalent to determining whether r(X(j))≤θL(j). It will be appreciated that there are other ways in which this respective log-likelihood ratio may be tested against the respective first threshold (e.g. calculating a metric/value based on a ratio between the first likelihood and the second likelihood and comparing the metric/value to a threshold based on the respective first threshold), and that some embodiments may not therefore actually involve directly calculating this log-likelihood ratio.
In some embodiments, two separate models λg and λi may be implemented and maintained, and p(X(j)|λg) and p(X(j)|λi) may be calculated separately (so that, for example, r(X(j))=log p(X(j)|λg)−log p(X(j)|λi) may then be calculated or
may be calculated). In other embodiments, log p(X(j)|λg)−log p(X(j)|λi), or the ratio
or other relationships between the first and second likelihoods p(X(j)|λg) and p(X(j)|Δi) may be calculated without calculating p(X(j)|λg) and/or p(X(j)|λi) explicitly, and the determination for the step 504 may be performed accordingly.
If the respective first log-likelihood ratio does not exceed the respective first threshold, then processing proceeds to the step 506 at which the authentication system determines that the first user U1 is not the predetermined user UD. Thus, when (or in response to) the respective first log-likelihood ratio does not exceed the respective first threshold, the authentication system determines that the first user U1 is not the predetermined user UD. Subsequent steps may then be performed according to the nature and purpose of the biometric authentication—for example, the first user U1 may be denied access to a device, data or services for which the biometric authentication was required.
Otherwise, processing continues at the step 508 at which the authentication system determines whether or not the respective first log-likelihood ratio exceeds a respective second threshold, wherein the second threshold is greater than the first threshold. Continuing the above example, the step 508 may identify whether r(X(j))>θR(j) for the second threshold θR(j). As with the step 504, there are other ways in which the step 508 may be implemented (e.g. calculating a metric/value based on a ratio between the first likelihood and the second likelihood and comparing the metric/value to a threshold based on the respective second threshold). For example, the step 508 may involve determining whether
As with the step 504, for the step 508, in some embodiments, two separate models λg and λi may be implemented and maintained, and p(X(j)|λg) and p(X(j)|λi) may be calculated separately (so that, for example, r(X(j))=log p(X(j)|λg)−log p(X(j)|λi) may then be calculated or
may be calculated). In other embodiments, log p(X(j)|λg)−log p(X(j)|λi), or the ratio
or other relationships between the first and second likelihoods p(X(j)|λg) and p(X(j)|λi) may be calculated without calculating p(X(j)|λg) and/or p(X(j)|λi) explicitly, and the determination for the step 508 may be performed accordingly.
If the respective first log-likelihood exceeds the respective second threshold, then processing proceeds to the step 510 at which the authentication system determines that the first user U1 is the predetermined user UD. Thus, when (or in response to) the respective first log-likelihood exceeds the respective second threshold, the authentication system determines that the first user U1 is the predetermined user UD. Subsequent steps may then be performed according to the nature and purpose of the biometric authentication—for example, the first user U1 may be permitted access to a device, data or services for which the biometric authentication was required.
Otherwise, processing continues at the step 512, at which the authentication system determines to perform another first test or determines to perform the second test. In particular, if the first test has been performed a predetermined maximum number of times, then processing continues at the step 514 in order to perform the second test; if, on the other hand, the first test has been performed fewer than the predetermined maximum number of times, then processing returns to the step 502 in order to perform a further first test. Thus, when the respective first log-likelihood ratio exceeds the respective first threshold and the respective first log-likelihood ratio does not exceed the respective second threshold, the step 512 determines to either (a) perform a further first test when a number of times that the first test has been performed is less than a predetermined maximum number of times or (b) perform the second test when the number of times that the first test has been performed equals the predetermined maximum number of times.
In some embodiments, the method 500 may involve using a counter j as an index for the current first test, i.e. to indicate the number of times that the first test has been performed. Thus, the method 500 may initialise, by initially setting the counter j to be 1. The step 512 may then involve testing whether j=M (where M is the above-mentioned predetermined maximum number of times for which the first test may be performed)—following the above-presented mathematical basis, in this embodiment M=N−1. If j=M then processing continues at the step 514; otherwise, the counter j may be increased by 1 and processing may return at the step 502. It will, however, be appreciated that there are many other ways of work out whether or not to perform an additional first test or whether to perform the second test—for example, the counter j could be initialised to the valve M (where M is the above-mentioned predetermined maximum number of times for which the first test may be performed)—following the above-presented mathematical basis, in this embodiment M=N−1. In this case, the step 512 may involve testing whether j=1: if j=1 then processing continues at the step 514; otherwise, the counter j may be decreased by 1 and processing may return at the step 502. Other mechanisms could be used likewise.
At the step 514 the authentication system obtains an input X(N) based on the one or more biological/biometric characteristics of the first user U1. This may be performed in the same way in which the first input(s) X(j) was/were obtained at the step 502.
At the step 516, the authentication system determines whether or not a second log-likelihood ratio between/for a third likelihood and a fourth likelihood exceeds a third threshold, where the third likelihood is a likelihood of obtaining the input X(N) based on the first model (in which input is obtained from the predetermined user UD), and where the fourth likelihood is a likelihood of obtaining the input X(N) based on the above second model (in which input is not obtained from the predetermined user UD). Continuing the above example, the third and fourth likelihoods may be p(X(N)|λg) and p(X(N)|λi) respectively, as have been discussed above. The step 516 may involve calculating the second log-likelihood ratio r(X(N))=log p(X(N)|λg)−log p(X(N)|λi) and identifying whether r(X(N))>θ(N), for the third threshold θ(N). Alternatively,
and so the step 516 may involve calculating
and identifying whether
which is equivalent to determining whether r(X(N))>θ(N). It will be appreciated that there are other ways in which this log-likelihood ratio may be tested against the third threshold (e.g. calculating a metric/value based on a ratio between the third likelihood and the fourth likelihood and comparing the metric/value to a threshold based on the third threshold) and that some embodiments may not therefore actually involve directly calculating this log-likelihood ratio.
As with the steps 504 and 508, in some embodiments, two separate models λg and λi may be implemented and maintained, and p(X(N)|λg) and p(X(N)|λi) may be calculated separately (so that, for example, r(X(N))=log p(X(N)|λg)−log p(X(N)|λi) may then be calculated
or may be calculated). In other embodiments, log p(X(N)|λg)−log p(X(N)|λi), or the ratio
or other relationships between the third and fourth likelihoods p(X(N)|λg) and p(X(N)|λi) may be calculated without calculating log p(X(N)|λg) and/or log p(X(N)|λi) explicitly, and the determination for the step 516 may be performed accordingly.
If the second log-likelihood exceeds the third threshold, then processing proceeds to the step 510 at which the authentication system determines that the first user U1 is the predetermined user UD. Thus, when (or in response to) the second log-likelihood exceeds the third threshold, the authentication system determines that the first user U1 is the predetermined user UD. As above, subsequent steps may then be performed according to the nature and purpose of the biometric authentication—for example, the first user U1 may be permitted access to a device, data or services for which the biometric authentication was required.
Otherwise, processing proceeds to the step 506 at which the authentication system determines that the first user U1 is not the predetermined user UD. Thus, when (or in response to) the second log-likelihood ratio does not exceed the third threshold, the authentication system determines that the first user U1 is not the predetermined user UD. Subsequent steps may then be performed according to the nature and purpose of the biometric authentication—for example, the first user U1 may be denied access to a device, data or services for which the biometric authentication was required.
As mentioned above, in some embodiments, processing with never return to the step 502, as such embodiments may be arranged to perform exactly one first test (i.e. embodiments in which M=1).
It will be appreciated that the step 504 may be performed after the step 508.
At a step 602, one or more of the models used by the authentication system may be trained (or initialized) and/or updated. As discussed, the biometric authentication may be based on the models λg and λi.
Details of such training/initialization have been set out above.
At a step 604, one or more of the above-mentioned thresholds for performing the biometric authentication may be determined. Techniques for determining the thresholds have been set out above. These thresholds may then be viewed as predetermined thresholds—i.e. they become predetermined once the model(s) have been trained (and potentially based on one or more targets for false acceptances and false rejections).
At a step 606, the authentication system may be configured. For example, the authentication system may be configured so as to use the thresholds determined at the step 604. Additionally, the authentication system may be configured to use any further updated parameters (e.g. if the predetermined maximum number M has been updated).
At a step 608, the authentication system may then perform biometric authentication, using the method 500 discussed above. As illustrated by a dashed line 610, the authentication system may perform multiple separate biometric authentications. As mentioned above, the number of consecutive times this may be performed when each separate biometric authentication involves more than one test may be limited to a predetermined maximum—should this maximum be reached, then one or more further measures may be taken (e.g. a system or device may become locked, thereby requiring a system administrator to unlock the system/device; the model λg may need to be retrained; etc.).
As illustrated by a dotted line 612, processing may return to the step 602 at which the model(s) may be updated and/or re-trained. For example, the model λg may be periodically updated or retrained based on additional biometric data obtained/collected from the predetermined user UD over time (e.g. based on the inputs X(j) and/or X(N) obtained from the user whenever the predetermined user undergoes the biometric authentication). In this way, the authentication system may adapt over time to the predetermined user, so that the model λg may become more accurate and so that false rejections occur less frequently.
4—Modifications
It will be appreciated that the methods described have been shown as individual steps carried out in a specific order. However, the skilled person will appreciate that these steps may be combined or carried out in a different order whilst still achieving the desired result.
It will be appreciated that different statistical tests (other than the using the log-likelihood ratio) could be used instead.
It will be appreciated that embodiments of the invention may be implemented using a variety of different information processing systems. In particular, although the figures and the discussion thereof provide an exemplary computing system and methods, these are presented merely to provide a useful reference in discussing various aspects of the invention. Embodiments of the invention may be carried out on any suitable data processing device, such as a personal computer, laptop, personal digital assistant, mobile telephone, set top box, television, server computer, etc. Of course, the description of the systems and methods has been simplified for purposes of discussion, and they are just one of many different types of system and method that may be used for embodiments of the invention. It will be appreciated that the boundaries between logic blocks are merely illustrative and that alternative embodiments may merge logic blocks or elements, or may impose an alternate decomposition of functionality upon various logic blocks or elements.
It will be appreciated that the above-mentioned functionality may be implemented as one or more corresponding modules as hardware and/or software. For example, the above-mentioned functionality may be implemented as one or more software components for execution by a processor of the system. Alternatively, the above-mentioned functionality may be implemented as hardware, such as on one or more field-programmable-gate-arrays (FPGAs), and/or one or more application-specific-integrated-circuits (ASICs), and/or one or more digital-signal-processors (DSPs), and/or one or more graphical processing units (GPUs), and/or other hardware arrangements. Method steps implemented in flowcharts contained herein, or as described above, may each be implemented by corresponding respective modules; multiple method steps implemented in flowcharts contained herein, or as described above, may be implemented together by a single module.
It will be appreciated that, insofar as embodiments of the invention are implemented by a computer program, then one or more storage media and/or one or more transmission media storing or carrying the computer program form aspects of the invention. The computer program may have one or more program instructions, or program code, which, when executed by one or more processors (or one or more computers), carries out an embodiment of the invention. The term “program” as used herein, may be a sequence of instructions designed for execution on a computer system, and may include a subroutine, a function, a procedure, a module, an object method, an object implementation, an executable application, an applet, a servlet, source code, object code, byte code, a shared library, a dynamic linked library, and/or other sequences of instructions designed for execution on a computer system. The storage medium may be a magnetic disc (such as a hard drive or a floppy disc), an optical disc (such as a CD-ROM, a DVD-ROM or a BluRay disc), or a memory (such as a ROM, a RAM, EEPROM, EPROM, Flash memory or a portable/removable memory device), etc. The transmission medium may be a communications signal, a data broadcast, a communications link between two or more computers, etc.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/064920 | 5/28/2020 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2021/239239 | 12/2/2021 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
10467394 | Asulin | Nov 2019 | B2 |
10764281 | Gaeta | Sep 2020 | B1 |
20170242995 | Bassenye-Mukasa | Aug 2017 | A1 |
20190295554 | Lesso | Sep 2019 | A1 |
20200228339 | Barham | Jul 2020 | A1 |
Number | Date | Country |
---|---|---|
WO-2016001657 | Jan 2016 | WO |
WO-2020144510 | Jul 2020 | WO |
Entry |
---|
Wikipedia, ‘Mixture Model’, pp. 1-14, last edited Apr. 13, 2020 retrieved from: https://en.wikipedia.org/w/index.php?title=Mixture_model&oldid=950749555. |
Douglas A. Reynolds, et al., “Speaker verification using adapted Gaussian mixture models,” Digital Signal Processing 10, pp. 19-41, Jan. 1, 2000, available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.117.338&rep=rep1&type=pdf. |
A. P. Dempster, et al., “Maximum likelihood from incomplete data via the EM algorithm,” Journal of the Royal Statistical Society. Series B (Methodological), vol. 39, No. 1. (Jan. 1, 1977), pp. 1-38. Available at: http://links.jstor.org/sici?sici=0035-9246%281977%2939%3A1%3C1%3AMLFIDV%3E2.0.CO%3B2-Z. |
L.R. Rabiner, et al., “Fundamentals of Speech Recognition”, Englewood Cliffs, N.J., PTR Prentice Hall, Jan. 1, 1993, pp. 1-386 (ch. 1-6). |
Number | Date | Country | |
---|---|---|---|
20220121732 A1 | Apr 2022 | US |