The present invention relates to the technical field of transaction detection, and in particular, to a fast anti-money laundering detection method based on a transaction graph.
For decades, money laundering has been a major criminal activity within financial computing systems. In today's technology-driven society, criminals are using every possible means at their disposal to launder the ill-gotten money from their illegal activities. With the development of technology, the dynamic nature of the information system has reduced the effectiveness of the existing money laundering detection mechanism, so the process of money laundering crime is more covert, and the money laundering methods are more complex, intelligent and organized, which brings new problems and challenges to conventional anti-money laundering supervision systems.
As the primary goal of anti-money laundering supervision, anomaly detection has become the focus of current research. At present, the main challenge for abnormal transaction detection is to build effective rules and models from massive and heterogeneous transaction data to make the anti-money laundering detection positioning more rapid and accurate, so as to achieve the purpose of effective financial market security supervision. Therefore, it is a trend to break the convention thinking of anti-money laundering supervision and build a data-based intelligent anti-money laundering supervision through artificial intelligence and big data analysis. Through reference search in the existing technologies, it is found that the detection of abnormal transactions in the context of anti-money laundering mostly focuses on the detection of single transaction or single node. For example, in the Chinese patent “Method and system for detecting money laundering transactions in complex associated transactions” (with the authorization number of CN112508705A), it is proposed to establish a recurrent neural network model to collect a plurality of transaction data that are not known whether it is money laundering, and the transaction data are input into a trained recurrent neural network model in turn to get the determination result of whether it is money laundering. In the Chinese patent “Intelligent suspicious transaction monitoring method based on semi-supervised graph neural network” (with the authorization number of CN110400220A), it is proposed to input individual transaction features of high-risk density capital transaction network and accounts into a semi-supervised graph neural network, and the semi-supervised graph neural network outputs the risk probability of capital transaction of the account, and determines the account with the risk probability of capital transaction higher than a first threshold as the account with high money laundering risk. The existing relevant research has made good progress in improving the accuracy of abnormal transaction detection, but there are still two major defects: first, since the real-time transaction graph is constantly dynamic, the existing anti-money laundering detection method based on graph neural network cannot accurately monitor the dynamic transaction graph in real time, and second, since the existing anti-money laundering scheme focuses on the analysis of single node or single transaction, it is difficult to fully evaluate the impact of the relationship, that is, the potential social network relationship in the transaction graph, on the risk.
The purpose of the present invention is to provide a fast anti-money laundering detection method based on a transaction graph to overcome the defects of the above existing technologies that cannot accurately monitor the dynamic transaction graph in real time and focus on analyzing a single node or a single transaction.
The purpose of the present invention can be realized through the following technical solutions:
A fast anti-money laundering detection method based on a transaction graph, including the following steps:
In the transaction graph, a node denotes a user or a merchant and an edge denotes a transaction.
The transaction graph uses an Elliptic data set as the standard for collecting transaction features.
Further, in each transaction of the transaction graph, 166 features are collected, of which 94 features are local information of a transaction account and the other 72 features are aggregated transaction data from forward/backward aggregate transaction information of a central node as aggregation features.
Further, the local information of the transaction account includes time step, transaction fee, input/output number, output volume and a plurality of pieces of sum data, and the transaction data corresponding to the aggregation feature includes maximum value, minimum value, and standard deviation.
The directed graph is denotes as G=((V, M), E), where V={vu1, vu2, vu3, . . . , vun} denotes a series of transaction users, M={vm1, vm2, vm3, . . . , vmn} denotes a series of merchants, and E={e1, e2, e3, . . . , e|E|} denotes a series of transactions (when a transaction occurs between a user and a merchant, an edge is created between two nodes).
The formula used for updating an edge feature in the graph neural network at step S5 is:
where e′ij denotes an edge feature after update, NN denotes a neural network including two fully connected layers with an activation function of ReLu, eij denotes an edge feature before update, vg denotes a feature vector corresponding to a directed graph G of the transaction graph, vi and mj denotes nodes in the directed graph, of which viϵV, mjϵM.
Further, the edge feature eij denotes the edge between the nodes i and j, and the corresponding feature vector includes transaction times and transaction location.
The formula used for updating a node feature in the graph neural network at step S5 is:
The formula used for updating a feature vector of the directed graph G in the graph neural network at step S5 is:
Where v′g denotes a feature vector of the directed graph G after update,
The graph neural network based on location information includes an attention mechanism based graph convolutional network, accepting χ∈N
Further, the graph convolutional network is provided with a temporal attention layer to better capture the pattern of transaction changing over time, specifically shown in the following formula:
Further, the graph neural network based on location information is provided with a prediction layer, an output of the prediction layer is a fraud probability of the transaction, and the prediction layer is provided with a loss function L, specifically shown in the following formula:
FIGURE is a schematic diagram of a flow chart the present invention
The present invention is described in detail below with reference to the accompanying drawings and specific embodiments. The present embodiment is implemented on the premise of the technical scheme of the present invention, and gives the detailed implementation method and specific operation process, but the protection scope of the present invention is not limited to the following embodiment.
As shown in FIGURE, a fast anti-money laundering detection method based on a transaction graph, including the following steps:
In transaction graph, a node denotes a user or a merchant and an edge denotes a transaction.
The transaction graph uses an Elliptic data set as the standard for collecting transaction features.
In each transaction of the transaction graph, 166 features are collected, of which 94 features are local information of a transaction account and the other 72 features are aggregated transaction data from forward/backward aggregate transaction information of a central node as aggregation features.
The local information of the transaction account includes time step, transaction fee, input/output number, output volume and a plurality of pieces of sum data, and the transaction data corresponding to the aggregation feature includes maximum value, minimum value, and standard deviation.
The directed graph is denotes as G=((V, M), E), where V={vu1, vu2, vu3, . . . , vun} denotes a series of transaction users, M={vm1, vm2, vm3, . . . , vmn} denotes a series of merchants, and E={e1, e2, e3, . . . , e|E|} denotes a series of transactions (when a transaction occurs between a user and a merchant, an edge is created between two nodes).
The formula used for updating an edge feature in the graph neural network step S5 is:
The edge feature eij denotes the edge between the nodes i and j, and the corresponding feature vector includes transaction times and transaction location.
The formula used for updating a node feature in the graph neural network step S5 is:
The formula used for updating a feature vector of the directed graph G in the graph neural network step S5 is:
Where v′g denotes a feature vector of the directed graph G after update,
The graph neural network based on location information includes an attention mechanism based graph convolutional network, accepting χ∈N
The graph convolutional network is provided with a temporal attention layer to better capture the pattern of transaction changing over time, specifically shown in the following formula:
The graph neural network based on location information is provided with a prediction layer, an output of the prediction layer is a fraud probability of the transaction, and the prediction layer is provided with a loss function L, specifically shown in the following formula:
In the present embodiment, there are two steps for the anti-money laundering detection: pre-access and post-monitoring. In pre-access, a blacklist and a money laundering rule are taken as a benchmark to determine whether a transaction order is fraud, and if the transaction is of high risk, then the transaction will be blocked directly. Post-monitoring refers to the use of the algorithmic model for predicting the probability of money laundering to predict the risk of transactions, and the transactions involving high probability of money laundering are handed over to experts for further determination.
During the specific implementation, the transaction data in the transaction graph will arrive in the form of distributed queues, and will be admitted in advance, and the GAT network will be used for prediction, and a memory database will be used to record these historical transaction data. If the prediction result shows that the order is a high-risk money laundering transaction, it shall be handed over to experts for determination, and the determination result will be returned to the historical database. The data in the historical database may play the role of offline update. On the one hand, the data may act on the GAT network, enabling the network to be updated in real time, and on the other hand, the data may help improve the rules of pre-access.
Compared with the prior art, the present invention has the following beneficial effects:
Further, it should be noted that the specific embodiments described in the present specification may be given different names, and the above content described in the present specification is only an example of the structure of the present invention. All equivalent alternatives or simple changes based on the structure, features and principles of the present invention are included in the scope of protection of the present invention. Those skilled in the art to which the present invention belongs may make various modifications or supplements to the specific examples described or adopt similar methods, and as long as they do not deviate from the structure of the present invention or go beyond the scope defined in the claims, they shall fall within the scope of protection of the present invention.
| Number | Date | Country | Kind |
|---|---|---|---|
| 202111528301.8 | Dec 2021 | CN | national |
This application is a 371 of international application of PCT application serial no. PCT/CN2022/106409, filed on Jul. 19, 2022, which claims the priority benefit of China application serial no. 202111528301.8, filed on Dec. 14, 2021. The entirety of each of the above-mentioned patent applications is hereby incorporated by reference herein and made a part of this specification.
| Filing Document | Filing Date | Country | Kind |
|---|---|---|---|
| PCT/CN2022/106409 | 7/19/2022 | WO |