MULTI-GRANULARITY PERCEPTION INTEGRATED LEARNING METHOD, DEVICE, COMPUTER EQUIPMENT AND MEDIUM

Description

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to Chinese Patent Application No. 202210590822.4, filed on May 27, 2022, the contents of which are hereby incorporated by reference.

TECHNICAL FIELD

The application relates to the technical field of computers, and in particular to a multi-granularity perception integrated learning method, a device, a computer equipment and a storage medium aiming at data analysis of the online behaviors of users.

BACKGROUND

With the wide application of Internet in many practical fields, such as information security, economic management, social governance, medical biology and so on, more and more data are produced to record users' online behavior information. How to extract knowledge and mine data from users' online behavior data more effectively and accurately to meet the actual needs is still facing a lot of tests. However, there are few application researches on the data of users' online behavior combining granular computing and ensemble learning. Users' online behavior data belongs to structured data, which is easy to query, modify and calculate, which may usually abstract a higher level of data. This abstract process is called granulation, and multi-granularity perception is a granulation conversion method on the data to different degrees for many times, thus generating abstract multi-granularity characteristics, so as to achieve the objective of multi-level and multi-perspective perception of data. From the perspective of cognitive computing, multi-granularity perception is the concept learning based on granular computing, which is beneficial to conceptual knowledge. At present, how to make users' online behavior data reasonably multi-granularity and how to carry out efficient, accurate and interpretable integrated learning on multi-granularity structured data have rarely been studied systematically, so it is very valuable and necessary to carry out the research on multi-granularity awareness integrated learning method of users' online behavior data.

SUMMARY

Based on this, it is necessary to provide a multi-granularity perception integrated learning method, device, computer equipment and storage medium that may apply the granular computing theory to the analysis of users' online behavior.

The application relates to a multi-granularity perception integrated learning method, including following steps:

- obtaining a data set of a user's online behavior, and preprocessing the data set to obtain a preprocessed data set; the data in the preprocessed data set includes attribute characteristics, granularity characteristics and particle lab el values;
- inputting the preprocessed data set into a pre-designed multi-granularity perception data derivation algorithm, and performing a multi-granularity perception processing on the preprocessed data set according to a characteristic category of the attribute characteristics and the particle label values by the multi-granularity perception data derivation algorithm to obtain a multi-granularity perception data set; dividing the multi-granularity perception data set by a granular layer according to the granularity characteristics to obtain a multi-level derivative data set; the derivative data set is divided into a training data set and a testing data set; the data in the derivative data set include derivative attribute values and the particle label values of corresponding granular layers;
- training a plurality of preset base learners according to the derivative attribute values of the training data set data and the particle label values of the corresponding granular layers to obtain a trained base learner based on a base learning algorithm; a number of the base learners is the same as a number of layers of the derivative data set;
- inputting the training data set into the trained base learner, calculating a self-prediction error of the testing data set predicted by the trained base learner, and counting a mean square error with particle as unit and a mean square error with granular layer as unit according to the self-prediction error;
- obtaining a particle-level weight according to the mean square error with particle as unit, obtaining a granularity-level weight according to the mean square error with granular layers as unit, and determining weight information according to the particle-level weight and the granularity-level weight; wherein, the smaller of the values of the particles or granular layers, the larger the weight values; and
- inputting the testing data set into the trained base learner to obtain prediction results of the testing data set, performing weighted integration on the prediction results according to the weight information, and outputting a multi-granularity perception integrated learning prediction results for the user's online behavior data.