This application claims priority to India Patent Application No. 4300/CHE/2013, filed Sep. 23, 2013, the disclosure of which is hereby incorporated by reference in its entirety.
The present invention relates to detection of online privacy violation. More particularly, the present invention relates to a system and a method to detect online privacy violation in third party websites.
Privacy problem has escalated in new challenging environment of cloud and big data. Widespread use of social networking sites has increased the opportunity of privacy exposure. In the online world, data has become equivalent to currency of the real world. Search engines, e-commerce sites, online social networks, advertisers, fraudsters, spammers etc. are in thirst of data of users, more specifically Personally Identifiable Information (PII), which can be used for genuine as well as malicious purposes. With the outburst of social apps, mobile apps and cloud based frameworks, assuring privacy on the modern web is a challenging task.
In most cases, sharing of user's data by a website to its partners is subjected to legal terms and conditions of the site. Once data moves from a user's browser to the internet, there is no mechanism to track the data or detect possible privacy violation. To an extent, some applications contribute towards protection of privacy by preventing third party cookies from following users on the web or by providing means to clean public databases via their API's (Application programing interface). However, these techniques do not assist in detecting how user's data has been leaked to the public or which site violated their privacy agreement.
In today's digital era, online presence has become a commonplace. Almost all activities of the real world such as collaboration, shopping, discussions, banking etc., have moved online and many of them require personal information of end users. With privacy being a clear threat, it is only recently that companies started focusing on privacy preserving applications. Since online privacy failures can occur at several places right from visible IP address, unencrypted traffic, insecure applications, online social networks etc., there are technologies which attempt to protect privacy in each of these specific areas, which are different from the present disclosure.
A service called BurnNote™ allows users to send self-destructing data to other users online, so that sensitive information is not stored in emails or leaked to the web. Tor browser bundle allows users to browse the web anonymously by encrypting network traffic and routing through complex network nodes. There are (virtual private network) VPN clients which provide anonymous browsing capabilities, suitable for connecting to unsecured Wi-Fi hotspots.
There are several applications for mobile devices which analyze the permissions required by each of the installed applications and report if there is any escalation of permissions in each case.
There are certain browser extensions which route information through proxy servers so that third party cookies (which track users) can be blocked. There are browser extensions designed to help users in understanding and taking control of the data they share on specific sites such as Facebook™, Twitter™, and Gmail™ etc. Also, there are tools which help users in understanding who can see their profiles on social networks like Facebook™ and what data will be visible to the public. Though not a privacy protection feature, web browsers store a history of sites visited by users, sometimes along with form data, and this may be used for manual inspection of visited sites.
The drawbacks of the above mentioned prior art is that there are no systems or methods to track information submitted on webpages and check against it later to detect privacy violations. Existing technologies may have feature to store submitted pages but data field's storage cannot be selective or personalized. Existing methods of page information storing even retains the submitted data. Storage of this information or sharing it to third party could lead to privacy violations.
In the present era of web based services, users provide personal information to many websites. It's practically challenging to keep track of these sites and data fields submitted to them manually.
The browser's native history maintenance technique resembles the functionality of privacy tracker database of the present disclosure to some extent. However, it is designed only to assist users in navigation and not as a privacy tracking or privacy violation detecting mechanism. Even otherwise, it has shortcomings such as: The “Clear History” option in browsers completely erases all browsing history of users. Reinstalling browsers will erase browsing history information. Since history data is stored locally in the machine, it is not available when users change their machines or it cannot be segregated when multiple users use the same machine.
Some existing applications contribute towards privacy protection by preventing third party cookies from following users on the web or by providing means to clean public databases via their APIs. Techniques such as self-destructing emails, anonymous browsing, data encryption, analysis of privileges in mobile devices etc. contribute towards privacy preservation. However, these techniques neither assist in detecting how user's data leaked to the public nor inform which site violated their privacy agreement.
The existing methods or products are designed to work specific to each context. Some products target removal of third party cookies which keep tracking users on the web. Some products track privacy breaches specific to Facebook™ while some other products target data queried only by search engines. Most of these solutions are tightly coupled with the configurations specific only to a certain set of popular websites or channels. They do not answer important questions such as how the data got leaked to the public or which party has violated user's privacy by sharing data with third parties.
The present disclosure addresses the problem of detection of privacy violation on the internet. It alerts the user when such a violation takes place so that the user can take suitable actions.
The present system does not have any tightly coupled configuration with any websites.
Also restriction to specific sites with respect to detecting privacy violations is not provided in the present system. Since the privacy information database is based on cloud, it is highly scalable and does not have any limit on the amount of data that can be processed. With respect to all these factors, the present system improves on existing techniques.
Thus there is a need to provide a system and a method that tracks privacy related information and browsing history of the user, while assisting the user in detecting possible privacy violation. The method of creating user privacy profile, collecting information for submitting data to website and storing in specific format on the cloud and feature of checking the possible privacy violation by submitting data field and matching it with browsing history is found in the present disclosure. Privacy profile update, privacy violations detection and browsing history update is hosted as cloud service in the present system and the user can access this service without dependency on specific browser or machine or location. Thus the present system benefits the user in legal process wherever privacy laws are applicable. User can technically establish who could be the potential privacy law violators.
Therefore the present system helps in creating privacy fingerprint for user by collecting details of web based activity where personal information is shared with third parties. There are possibilities that data collecting agencies or enterprise may share user data to third party for their business benefit, without taking users consensus. If user notice, such information is used by third party and represented on its website, user can identify data collector who might have involved in privacy violation.
Thus it will increase user confidence in services offered over web and help in businesses which collect information as part of their business process to offer more user friendly and trustworthy services.
The present disclosure provides flexible implementation of the system. Privacy tracking can be provided as a service on Cloud, where user can access the service from any browser, machine, location. All details related to browsing history, privacy profile are stored in cloud environment.
In present disclosure, actual data values are not stored or shared with cloud service provider. Only the data field name along with some other browsing details is stored in browsing history database.
According to one of the aspect of the present invention there is provided a method to detect online privacy violation. The method comprising steps of embedding a tracker into a web browser to open at least one data consumer website or at least one third party website wherein a user submits at least one data value into their corresponding data field in a data consumer website.
Generating one or more profile using the tracker, wherein the profile assists the user to select one or more data fields as per the user preferences; capturing the user selected one or more data fields and their corresponding plurality of browsing history using the tracker; storing the profile and the plurality of browsing history into at least one database through the tracker; triggering of the tracker for detecting online privacy violation in a third party website; and submitting at least one data field by the user into at least one input field as provided through the tracker to detect online privacy violation for the submitted data field.
The triggering of at least one processor using the tracker to compare the submitted data field with the data field as stored in the database and matching of the stored data field with its corresponding browsing history to indicate one or more websites with their related timestamps that have assisted in violating privacy of the user by leaking the submitted data field to the third party website.
According to another aspect of the present invention there is provided a system to detect online privacy violation. The system comprising a browser to open at least one data consumer website or at least one third party website; wherein in the consumer data website the user submits at least one data value into their corresponding data field; a tracker embedded into a browser, the tracker assists to generate one or more profiles for the user, the profile enables user to select one or more data fields as established on user preferences; wherein the tracker captures the user selected data field and their corresponding plurality of browsing history; and a privacy system operatively connected with the tracker, the privacy system comprises at least one processor and at least one database.
The database stores the profile and the plurality of browsing history; wherein the tracker is triggered through the user for detecting online privacy violation in a third party website, wherein the tracker enables the user to submit at least one data field into at least one input field as provided through the tracker to detect online privacy violation for the submitted data field; wherein the processor being triggered by the tracker to compare the submitted data field with the stored data field in the database and matching of the stored data field with its corresponding browsing history to indicate one or more websites with their related timestamps that have assisted in violating privacy of the user by leaking the submitted data field to the third party.
The present disclosure proposes to track and detect privacy violation on the web. An advisory system is developed which assists users of the web to maintain their own record of data they share with each website. The architecture of the present system is explained in the adjoined
The browser 100 receives URL as input and access the information resource available on web. The end users access certain webpage through browser and submit their data or perform various activities, which could lead to generation of user related private data. This data is submitted to web sites of information collecting entity to meet users or business interest. Privacy tracker service plugin 200 is one of the components which is embedded as browser component/plugin and gets activated when user opens any webpage to submit data to data consumer website (DCWeb), 400 or perform some activity on the website. Privacy tracker service plugin 200 provides option to user to create his personalized privacy profile.
Data consumer web site 400 is owned by business or enterprise or organization or individuals who provide the option for user to submit their details. These details are submitted as forms and used by enterprise to process this information for business or user interest.
User is able to define personal data fields which are sensitive and private. User is also able to define type of data and activity on website that should be logged in proposed system when users submit the data or perform activity. This customized information is captured by PTSP 200 and stored in privacy tracker database (PTD, 320). Whenever user submits any form on website, PTSP 200 retrieves the personalized profile and it identifies user defined personal data fields for which details are being submitted. PTSP 200 stores this information along with website details, time stamp in PTD 320 as privacy data history.
Further the PTSP 200 has another role when users access some website and identifies or suspects some sensitive personal data pertaining to him. The user then submits the identified data field to PTSP 200, which checks user's privacy profile history stored in PTD 320. It alerts for privacy violation if website reflecting the data value is never been provided with specific information. During a browsing session, if the system encounters user's data which is not in its database, the privacy tracker service plugin 200 alerts the user about the possible sites which have violated user's privacy and suggests suitable actions.
Privacy tracker database 320 stores user privacy profile and privacy data history in is database 320.
In privacy check processor, 310 whenever user queries about some suspected data field, PCP 310 retrieves the user profile and also checks websites where sensitive data was submitted. Based on search result, PCP 310 provides result to user through PTSP 200.
Online privacy detection module 202 enables user to verify any potential privacy violation, while accessing third party website. In case user related personal information is displayed on third party webpage, user can probe the privacy information system to check from users browsing history stored in privacy tracker database 320. This module also raises alert after checking and display information about potential website that might have compromised user's privacy details. This module can be manual or automated based on implementation.
Connectivity module 203 enables connectivity to various other components to send and retrieve the information.
The privacy profile management module 204 enables user to create his/her personal profile/preference/actions related to privacy. Users can add, modify, and delete the profile as per their requirement. Users can define data fields which are sensitive and contain private data.
The privacy tracker service plugin generates profiles to assist the user to select data fields as per his preferences. The privacy tracker service plugin captures only the user selected data fields and the corresponding browsing history of the user selected data fields at 404. These user selected data fields and their corresponding browsing history is stored in privacy tracker database at 405.
Thus when a website presents a form, user fills it with his details at 403. The privacy tracker service plugin keeps track of the user selected data fields and their corresponding browsing history at 404 and saves it into privacy tracker database at 405, which is a sub-system of our cloud based privacy information system. The schema of our privacy tracker database is outlined in the table in
When the user browses another site i.e. the third party site at 501 which happens to display the information entered by the user in one of his previous sessions at 502 and the user suspects for some privacy violation in the website, then the privacy tracker service plugin is triggered for the detection of online privacy violation in the third party website at 503.
This triggering of the privacy tracker service plugin allows the user to enter data field's name which the user wants to have a violation check for in the given input field at 504.
Then the privacy tracker service plugin sends the page to the privacy information processor of the privacy information system i.e. the privacy information processor is triggered by the privacy tracker service plugin to check for online privacy violation of the user entered data field in the given input field at 504.
The processor compares the submitted data field with the data field as stored in the database and then the processor matches the stored data field of the data base with its corresponding browsing history to indicate one or more websites with their related timestamps that have assisted in violating privacy of the user by leaking the submitted data field to the third party website at 505.
The PCP returns privacy violation detection result to PTSP at 506.
The privacy violation check result with one or more websites that have violated privacy of the user is displayed to the user by the browser though the PTSP . . . . The user is alerted which site in its database violated privacy of the user by leaking the information to a third party. In this way, the system assists users in tracking their information and thereby detecting privacy violation and exposure of their data on the web at 507.
In
It is assumed that user has defined “SSN Number” as sensitive field in already existing privacy profile. When user submits the form data field, the privacy tracker service plugin (PTSP) intercepts the submitted web form, checks with the privacy profile and captures information about only those sensitive data fields from form, which are already defined in profile. Therefore, in this case SSN number field name is captured along with details like user id, URL where data is being submitted and time stamp of submission. This information is stored in the database by PTSP and retrieved later when privacy violation check need to be performed. Some data collecting website may provide the privacy term and agreements before data submission. PTSP captures the agreement and stores in users privacy history database.
The approach of storing data filed name and other related information at tracking service provider that is hosted in cloud ensures that user's sensitive data does not get compromised, as data field values which are sensitive are never captured and shared with hosting services.
Further concept of personalizing privacy profile ensures that only sensitive information defined by user is captured during data submission. Provision of keeping only privacy profile related information in database increase the performance of query operation during privacy violation check and reduce the storage size also.
In this example, user when visit the sample1.com website and find his SSN number (XYZ). Suspecting possible privacy violation, he can enter “SSN Number” using privacy check feature and perform a search to find the list of various web sites, where he had provided SSN number. User may also see agreements (optional) from database if it was captured bt PTSP during the web form submission.
Number | Date | Country | Kind |
---|---|---|---|
4300/CHE/2013 | Sep 2013 | IN | national |
Number | Name | Date | Kind |
---|---|---|---|
8261362 | Goodwin et al. | Sep 2012 | B2 |
9495714 | Bush | Nov 2016 | B2 |
9703988 | Sudbury | Jul 2017 | B1 |
9720555 | Sorden | Aug 2017 | B2 |
20090018850 | Abhyanker | Jan 2009 | A1 |
20090287706 | Bourges-Waldegg | Nov 2009 | A1 |
20110087753 | Yu | Apr 2011 | A1 |
20110276396 | Rathod | Nov 2011 | A1 |
20120291144 | Chen et al. | Nov 2012 | A1 |
20120323794 | Livshits | Dec 2012 | A1 |
20130091452 | Sorden | Apr 2013 | A1 |
20130297346 | Kulkarni | Nov 2013 | A1 |
20140129779 | Frachtenberg | May 2014 | A1 |
20150127628 | Rathod | May 2015 | A1 |
20170207969 | Lehtovirta | Jul 2017 | A1 |
Number | Date | Country |
---|---|---|
2010011449 | Jan 2010 | WO |
2011112492 | Sep 2011 | WO |
2013055564 | Apr 2013 | WO |
Entry |
---|
Wills et al, “Understanding What They Do with What They Know (Short Paper)”, Oct. 15, 2012, WPES'12, AMC, p. 13-18. |
Jang et al, “An Empirical Study of Privacy-Violating Information Flows in JavaScript Web Applications”, Oct. 2010, CCS'10, ACM, p. 270-283. |
Bapat et al, “Restraining Add-On's Behavior in Private Browsing”, Apr. 2012, SecurIT'12, ACM, p. 194-199. |
Hofinger et al, “When Browsing Leaves Footprints”, Apr. 10, 2012, MPM'12, ACM, p. 1-6. |
Sanket Baviskar et al, “Protection of Web User's Privacy by Securing Browser from Web Privacy Attacks” , Department of Computer Science and Engineering, National Institute of Technology, IJCTA, Jul.-Aug. 2011. |
Miguel A. Monjas et al.“Privacy Delegate: a browser-based tool for privacy self-management in social network”, CDTI, Ministry of Science and Innovation of Spain. |
Number | Date | Country | |
---|---|---|---|
20150088777 A1 | Mar 2015 | US |