This application claims the benefit of priority of European Patent Application No. 17382279.2 filed May 16, 2017, the contents of which are incorporated herein by reference in their entirety.
The present invention has its application within the telecommunication sector, more specifically, relates to the analysis of mobile user traffic.
More particularly, the present invention refers to a method for detecting from mobile traffic applications (apps) of mobile user terminals (smartphones, tablets, etc.).
Smartphones offer users the possibility to install on them whatever applications (apps) they decide to (apart from preinstalled apps). These apps belong to categories as entertainment, sports, productivity, travel . . . . Therefore, applications installed in a certain smartphone provide useful information about its user profile, to be understood as the set of habits and preferences of a person.
Those apps require Internet connection for tasks as content update or access authorization. The set of queries or requests sent to retrieve data from Internet is here defined as traffic. Being a protocol a set of predefined rules that defines the way of transferring information, requests information may vary depending on the used protocol. Examples of information appearing in requests are: source IP, request date and time, domain or user agent. The latter concepts, domain and user agent, are defined as follows:
The smartphones have recently experimented an exponential growth in terms of number of users and hours spent with them. In this context, knowing applications used by a customer will allow to precisely define its profile. A correct user profile is the key to success in multiple use-cases like recommender systems, protection against possible security threats (malicious apps) or statistical analysis, as defined as follows:
Mobile Network Operators (MNOs) can obtain information required to define users profile from mobile traffic. A request is generated each time a mobile user interacts with an app on its smartphone. The request passes through the MNO infrastructure, which both stores it in a database as sends it to the Internet. Data stored in the MNO database is simplified information of HTTP and DNS requests. Hypertext Transfer Protocol (HTTP) is a protocol for transferring hypermedia files. Domain Name System (DNS) is a naming system for clients or services connected to the Internet or to a private network. DNS associates a domain name with an internet protocol address (IP). The information stored in the database is the domain and the date and time of the request, i.e. the complete URL is not consulted in any case. In addition, all stored data are anonymized.
There are approaches for analyzing mobile traffic based on domain information. However, relation between domains and applications is not bijective. Unique domains, i.e. domains exclusively accessed by an app, are the less frequent. Instead, there are some domains accessed by many apps. In the latter case, the knowledge of the domain does not univocally define the application.
There are also approaches for analyzing mobile traffic based on user agent. User agent presents two major drawbacks: not all HTTP petitions have user agent value, and applications developers decide the value of user agent field, so they can use another apps' user agent instead of setting their own.
Finally, there is a great variability in the requests of a concrete application. It is due, inter alia, to the different operating systems or mobile user devices (smartphones, tablets). Even different executions of the same application on the same terminal do not maintain the same request order. Some issues related to the requests variability are, to name but a few: request may be cached, latencies between requests vary depending on the mobile use, list of domains consulted by an app may vary between devices, or dynamic content include noise in executions.
Therefore, it is highly desirable to develop a method of apps detection from the mobile traffic which allows the MNOs to get a more precise user profile.
The present invention solves the aforementioned problems and overcomes previously explained state-of-art work limitations by providing a method for detecting applications (apps) downloaded and/or used by a user in his/her mobile terminal (e.g., a smartphone, tablet, etc.). The apps detection is based on the analysis of domain information collected from the mobile user's traffic. More particularly, the method of apps detection uses an analysis of domains by words considering their frequency and so the method is device and request order independent.
An aspect of the present invention refers to a method for detecting applications of mobile users, the applications generating a train of files associated with one or more mobile user terminals and each mobile user terminal engaged in the generation of mobile user traffic, which comprises the following steps:
The present invention has a number of advantages with respect to prior art, which can be summarized as follows:
These and other advantages will be apparent in the light of the detailed description of the invention.
For the purpose of aiding the understanding of the characteristics of the invention, according to a preferred practical embodiment thereof and in order to complement this description, the following FIGURES are attached as an integral part thereof, having an illustrative and non-limiting character:
The matters defined in this detailed description are provided to assist in a comprehensive understanding of the invention. Accordingly, those of ordinary skill in the art will recognize that variation changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, description of well-known functions and elements are omitted for clarity and conciseness.
Of course, the embodiments of the invention can be implemented in a variety of architectural platforms, operating and server systems, devices, systems, or applications. Any particular architectural layout or implementation presented herein is provided for purposes of illustration and comprehension only and is not intended to limit aspects of the invention.
Real time solution pre-calculates the first two stages. For example, in a real-time use case where a user wants to know which apps can be detected from his smartphone, a possible implementation can be described as follows:
Note that in this text, the term “comprises” and its derivations (such as “comprising”, etc.) should not be understood in an excluding sense, that is, these terms should not be interpreted as excluding the possibility that what is described and defined may include further elements, steps, etc.
Number | Date | Country | Kind |
---|---|---|---|
17382279 | May 2017 | EP | regional |
Number | Name | Date | Kind |
---|---|---|---|
9633081 | Procopio | Apr 2017 | B1 |
20090138446 | Kalavade | May 2009 | A1 |
20090282027 | Subotin et al. | Nov 2009 | A1 |
20110029657 | Gueta et al. | Feb 2011 | A1 |
20120147769 | Kalavade | Jun 2012 | A1 |
20150058488 | Backholm | Feb 2015 | A1 |
Entry |
---|
European Search Report and the European Search Opinion dated Oct. 24, 2017 From the European Patent Office Re. Application No. 17382279.2. (8 Pages). |
Li et al. “Distance Weighted Cosine Similarity Measure for Text Classification”, Proceedings of the 14th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2013, Hefei, China, Oct. 20-22, 2013, LNCS 8206: 611-618, Oct. 22, 2013. |
Powers “Evaluation: From Precision, Recall and F-Measure to Roc, Informedness, Markedness & Correlation”, Journal of Machine Learning Technologies, 2(1): 37-63, Available Online Dec. 15, 2011. |
Number | Date | Country | |
---|---|---|---|
20180338010 A1 | Nov 2018 | US |