The invention disclosed herein relates generally to fraud detection, and more particularly to a fraud detection mechanism adapted for inconsistent data collection.
There are several ways that mail pieces can be marked to evidence payment of postage for delivery of the mail piece. A mail piece could include, for example, letters, magazines, postcards, packages, parcels, etc. For example, a stamp could be applied to the mail piece, or a mailing machine could be used to print a postage meter indicium on the mail piece or a label applied to the mail piece. With the proliferation of communications networks, e.g., the Internet, it is also possible to print an indicium, either directly on the mail piece or on a label that is affixed to the mail piece, that evidences payment of postage using a personal computer coupled to the Internet and a general purpose printer coupled to the computer.
Regardless of which method is utilized to evidence payment of postage, a verification system is provided to ensure that the payment evidence is both authentic, i.e., not a counterfeit, and original, i.e., not a duplicate. For example, stamps are cancelled by a postal mark, thereby preventing them from being reused. Postal meter indicia includes a two-dimensional (2D) barcode and certain human-readable information. Some of the data included in the barcode could include, for example, the meter manufacturer identification, meter model identification, meter serial number, values for the ascending and descending registers of the meter, postage amount, and date of mailing. In addition, a digital signature may be required to be created by the meter for each mail piece and placed in the digital signature field of the barcode. Verification of the signature provides authentication of an indicium, while other portions of the included data can help detect duplicate indicia.
In some forms of postage, fraud detection is performed utilizing a confirmation number applied to each mail piece, along with an indicium, that uniquely identifies each mail piece for which postage has been paid. Upon delivery of the mail piece, the letter carrier, i.e., delivery person or “postman,” that is delivering the mail piece is required to scan the confirmation number, and the data is stored in a central database. Thus, in theory, if a confirmation number is scanned more than once, it is an indication that the same confirmation number has been improperly utilized more than once, thereby attempting to defraud the delivery service of payment for the second mail piece.
Fraud detection mechanisms work well if the data collection methods are consistent enough to provide accurate data. For example, fraud detection mechanisms utilized for credit cards, phone cards, and cellular telephones rely on the accuracy of data to allow fraud detection decisions to be made based upon simple rules. For example, a large increase in the frequency of purchases or calls may indicate a stolen credit card or phone card. Similarly, transactions that occur within a short time period spread over large geographic distances may also indicate fraud. Such fraud detection mechanisms, however, assume the data is correct and base decisions upon that assumption. This is largely due to the fact that there is a closed loop between the payer and the service provider/biller. Thus, if a transaction was processed, e.g., purchase with a credit card, call made using a phone card or cellular telephone, the data with respect to that transaction is “hard” data, i.e., each transaction is typically unique and has actually occurred.
Unfortunately, the data collected from the scanning system for the mail piece delivery fraud detection system described above is inconsistent and therefore may not be completely accurate, thereby leading to erroneous decisions about fraudulent use of confirmation numbers for delivery of mail pieces. For example, failure by the letter carrier to scan the confirmation number will completely negate the fraud detection mechanism; therefore, it is imperative that the letter carrier scans the confirmation number upon delivery of the mail piece. To ensure this, most delivery services will discipline letter carriers if the confirmation numbers are not scanned. As a result, some letter carriers will scan the confirmation number on a mail piece multiple times to ensure that it has been scanned properly. These multiple scans may occur within a short period of time, e.g., in rapid succession when a mail piece is delivered, or over a longer period of time, e.g., prior to leaving a central facility to deliver the mail pieces and at the actual time of delivery. Thus, multiple scans may be recorded for the same mail piece. Another situation that can result in multiple scans for the same mail piece occurs if the letter carrier scans the mail piece and then can not actually deliver the mail piece, thereby requiring multiple delivery attempts. For example, if the letter carrier scans the mail piece upon approaching the intended recipient's door, and the intended recipient is not at home to accept the mail piece, the letter carrier must make a second delivery attempt. When the mail piece is delivered the next day, it may again be scanned, resulting in multiple scans of the same mail piece. This inconsistency of the data collection makes the data difficult to use for fraud detection. For example, two delivery scans of a confirmation number within a short period of time could indicate either (i) the label including the confirmation number and associated indicium has been copied, and two mail pieces have been sent using the same confirmation number and indicium (but only paid for once), or (ii) the letter carrier scanned the confirmation number on the same mail piece twice. Similarly, two scans of a confirmation number separated by some time period could indicate either (i) a copied confirmation number was fraudulently used, or (ii) more than one attempt was made to deliver the mail piece. Duplicate data may be, therefore, the result of either improper system operation or fraudulent activity.
Thus, if each time a confirmation number was scanned more than once resulted in a determination of possible fraudulent activity, a large number of unnecessary fraud investigations would occur. To reduce the number of fraud investigations based on multiple scans of the same confirmation number, current fraud detection mechanisms only make a decision of potentially fraudulent activity if a high number, such as, for example, five or more, of delivery scans occur for the same confirmation number. This solution, however, has inherent problems in that it may not detect actual fraudulent activity. For example, the confirmation numbers can be copied one, two, three or even four times and reused to send multiple mail pieces to different locations, while only paying for delivery of a single mail piece. As a result, the potential for large scale fraud to be committed without fear of being detected exists.
Thus, there exists a need for a fraud detection mechanism that is adapted for inconsistent data collection.
The present invention alleviates the problems associated with the prior art and provides a fraud detection mechanism that is adapted for inconsistent data collection.
According to embodiments of the present invention, the data from scanned confirmation numbers is collected and stored in a database. The data is analyzed to determine normal operational variations from ideal system behavior, e.g., the percentage of confirmation numbers that are scanned multiple times. Profiles are developed for each individual sender, e.g., the number of multiple scans performed per confirmation number generated by each sender, and for scanning activity that meets predetermined parameters, such as delivery areas, e.g., the number of multiple scans performed per letter carrier route. If the sender's profile differs significantly from the normal operational variations, there is an indication of potential fraudulent activity and an investigation can be initiated. For example, if a large percentage of a particular sender's confirmation numbers have multiple delivery scans, while only a small percentage of all confirmation numbers are scanned multiple times, there is an indication of possible fraudulent activity by that sender. Similarly, if the data for a specific confirmation number differs significantly from the profile for data in its delivery area, there is an indication of potential fraudulent activity and an investigation of that confirmation number can be initiated. For example, if multiple delivery scans occur for a single confirmation number on a letter carrier route where confirmation numbers are rarely or never scanned more than once, there is an indication of possible fraudulent activity. By analyzing a combination of sender and delivery scan data with system wide scan data, the effect of inconsistent data is minimized to significantly reduce the number of erroneous indications of fraudulent activity while still providing a high level of fraud detection.
Therefore, it should now be apparent that the invention substantially achieves all the above aspects and advantages. Additional aspects and advantages of the invention will be set forth in the description that follows, and in part will be obvious from the description, or may be learned by practice of the invention. Moreover, the aspects and advantages of the invention may be realized and obtained by means of the instrumentalities and combinations particularly pointed out in the appended claims.
The accompanying drawings illustrate a presently preferred embodiment of the invention, and together with the general description given above and the detailed description given below, serve to explain the principles of the invention. As shown throughout the drawings, like reference numerals designate like or corresponding parts.
In describing the present invention, reference is made to the drawings, wherein there is seen in
According to an embodiment of the invention, the inconsistencies in scanning practices can be mitigated by determining normal variations in scan data and identifying senders whose scan data varies significantly beyond the normal variations. Normal variations in scan data are determined based upon aggregate scan data. The aggregate scan data is utilized to create a scan profile table for use as the basis for determining normal variations.
At step 34, the contents of database 22 is again sorted, this time based upon the number of scans for each confirmation number 18 over the given period of time that meet a second predetermined parameter, where the second parameter is a specific subset of the first parameter used in step 30. For example, for a first parameter of class of service, a second parameter subset may be based on weight, zone based rate, time to deliver, etc. For a first parameter of a geographic region, a second parameter subset may be based on small geographic area subsets of the geographic area used in step 30. Each geographic area subset can correspond to a large geographic area or a small geographic area. For example, a large geographic area could be defined as the entire area having the same first three digits for the zip code, while a small geographic area could be defined as a single letter carrier's specific delivery route. It should be understood that any number of subsets may be used as desired. In step 36, a profile of scans table, including data similar to the profile created above in step 32, is created using the data sorted in step 34 for each geographic area subset. In step 38, a single Scan Profile Table for delivery scan data is created by combining the profile tables created in step 32 (for the first parameter, e.g., entire geographic area) and in step 36 (for the second parameter, e.g., each geographic area subset) into a single table. An example of such a table is illustrated in
In step 40, the database 22 is again sorted, this time based upon sender information. It should be noted that the confirmation numbers 18 need not provide the specific identify of the sender, but instead need only be linked to a specific sender. It may be necessary, therefore, to use other databases that relate the specific identity of the sender to each confirmation number 18. Such databases currently exist for Internet Postage Evidencing Systems approved by the USPS. In step 42, a profile of scans table, including data similar to the profile created above in step 32, is created using the data sorted in step 34 for each sender. In step 44, a single Scan Profile Table for sender scan data is created by combining the profile tables created in step 42 for each sender into a single table. An example of such a table is illustrated in
Referring now to
If it is determined in step 52 that there are a sufficient number of total scans for the selected sender, then in step 54 it is determined if the selected sender's multiple scan rate (from column three, Multiple Scan %, of the Scan Profile Table illustrated in
If in step 56 it is determined that an extended fraud detection check is required, then in step 60 a sender specific table of scan data by geographic area is created. This sender specific table enables the sender's data to be analyzed by each geographic area. As a result, a more accurate assessment of whether or not a sender is committing fraud can be performed.
If it is determined in step 66 that there are a sufficient number of total scans for the selected sender in the specified geographic area, then in step 68 it is determined if the selected sender's multiple scan rate in that area is greater than a threshold value for that geographic area (obtained from the Scan Profile Table for delivery scan data illustrated in
The advantages of performing the extended fraud detection check can be seen by examining the data in the two example tables illustrated in
Referring again back to
It should be noted that the fraud detection processing can be performed daily, weekly, monthly or any other time period as desired. Additionally, the processing can be performed either by the carrier, e.g., postal service, the party that operates the data center 14, or any other third party that has access to the database 22 as authorized by the postal service. It should be noted that while the above embodiments have been described with respect to multiple scans of delivery confirmation numbers, the invention is not so limited and could also be extended to other data. For example, the number or percentage of forwarded packages, number or percentage of packages with insufficient postage, etc. could also be used for fraud detection. In addition, while the above embodiments have been described with respect to postal delivery confirmation fraud detection, the invention is not so limited and can also be applied to other fraud detection systems, particularly systems where data collection is inconsistent or incomplete. For example, fraud detection systems were the data collected represents only a sample of the items passing through the system, such as the Information Based Indicia Program, can compare the sampled data with aggregate data (e.g., the total amount of postage sampled for a given user versus what is expected for that user given the sampling rate and their total postage purchased). It can also be extended to systems that process other items of value. For example, manufacturer coupon redemption rates for individual merchants could be analyzed to determine if a particular merchant's coupon redemption rates were significantly higher than expected. Each coupon includes a unique identification number (e.g., a fifty cents coupon for soap has a different identification number than a fifty cents coupon for deodorant) that is scanned upon redemption of the coupon. Higher than expected redemption rates might indicate that the merchant might be redeeming the same coupon or copies of the coupon multiple times and pocketing the money, rather than the merchant's customers redeeming the coupons.
Thus, according to embodiments of the present invention, a fraud detection mechanism that is adapted for inconsistent data collection is provided. The data is analyzed to determine normal operational variations from ideal system behavior. Profiles are developed for each individual sender, e.g., the number of multiple scans performed per confirmation number generated by each sender, and delivery areas, e.g., the number of multiple scans performed per specific geographic area. If the sender's profile differs significantly from the normal operational variations, there is an indication of potential fraudulent activity and an investigation can be initiated. By analyzing a combination of sender and delivery scan data with system wide scan data, the effect of inconsistent data is minimized to significantly reduce the number of erroneous indications of fraudulent activity while still providing a high level of fraud detection.
While preferred embodiments of the invention have been described and illustrated above, it should be understood that these are exemplary of the invention and are not to be considered as limiting. Additions, deletions, substitutions, and other modifications can be made without departing from the spirit or scope of the present invention. Accordingly, the invention is not to be considered as limited by the foregoing description.
This application claims the benefit of U.S. Provisional Application Ser. No. 60/591,822, filed on Jul. 28, 2004, the specification of which is hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
60591822 | Jul 2004 | US |