The invention relates to a data visualisation system and method.
The low cost of data storage hardware has led to the collection of large volumes of data. Broadcasters who operate mass media for example television, radio and newspapers generate and collect large volumes of data during the course of their business. To compete effectively, it is necessary for such broadcasters to be able to identify and use information hidden in the collected data. It would be helpful for a broadcaster to know its audience better to enable enhanced placement of advertisements. Better knowledge of the viewing trends and viewer profiles would enable the broadcaster to charge increased fees for advertisements targeted at certain audiences. The task of identifying this hidden information has proved very difficult for these broadcasters.
Traditionally, analysis of data has been achieved by running a query on a set of data records stored in a database. The merchant or other party first creates a hypothesis, converts this hypothesis to a query, runs the query on the database, and interprets the results obtained with resect to the original hypothesis.
One disadvantage of this verification driven hypothesis approach is that the merchant must form the desired hypothesis in advance. This is merely confirming what the merchant already suspects and does not provide the merchant with information which may be unexpected. Another disadvantage is that the merchant needs to have available the technical knowledge to formulate the appropriate queries.
Our patent specification WO 00/77682 to Compudigm International Limited entitled “Data Visualisation System and Method” describes a system and method of presenting visualisations in an intuitive manner for various types of merchants. It would be particularly desirable to include data such as the time of various transactions and interactions to enhance the effectiveness and visual appeal of these visualisations.
In broad terms in one form the invention comprises a data visualisation system comprising a data value memory in which is maintained a finite set of data values, each data value having a corresponding time value; a display arranged to display a representation of each data value centered on respective data points, the position of each data point based on the time value of each data point; and a contour generator arranged to generate and display a contoured representation around each data point such that each data point is displayed as a local maximum.
In another form in broad terms the invention comprises a data visualisation system comprising a data value memory in which is maintained a finite set of data values, each data value having a corresponding time value; a display arranged to display a representation of each data value centered on respective data points, the position of each data point based on the time value of each data point; and a contour generator arranged to generate and display one or more contour lines around each data point, each contour line representing data values which are less than the data value of the data point around which the contour line is displayed.
In another preferred form the invention comprises a method of data visualisation comprising the steps of maintaining a data value memory a finite set of data values, each data value having a corresponding time value; displaying a representation of each data value centered on respective data points, the position of each data point based on the time value of each data point; and generating and displaying a contoured representation around each data point such that each data point is displayed as a local maximum.
In another preferred form the invention comprises a method of data visualisation comprising the steps of maintaining in a data value memory a finite set of data values, each data value having a corresponding time value; displaying a representation of each data value centered on respective data points, the position of each data point based on the time value of each data point; and generating and displaying one or more contour lines around each data point, each contour line representing data values which are less than the data value of the data point around which the contour line is displayed.
In a further preferred form the invention comprises a data visualisation computer program which enables maintaining in a data value memory a finite set of data values, each data value having a corresponding time value; displaying a representation of each data value centered on respective data points, the position of each data point based on the time value of each data point, and generating and displaying a contoured representation around each data point such that each data point is displayed as a local maximum.
In yet another preferred form the invention comprises a data visualisation computer program which enables maintaining in a data value memory a finite set of data values, each data value having a corresponding time value; displaying a representation of each data value centered on respective data points, the position of each data point based on the time value of each data point; and generating and displaying one or more contour lines around each data point, each contour line representing data values which are less than the data value of the data point around which the contour line is displayed.
Preferred forms of the data visualisation system and method will now be described with reference to the accompanying figures in which:
Clients 20A and 20B, for example, are connected to a network 22, such as a local area network or LAN. The network 22 could be connected to a suitable network server 24 and communicate with the workstation as shown. Client 20C is shown connected directly to the workstation using for example a serial connection or WAP connection. Clients 20D, 20E and 20F are shown connected to the workstation through the Internet 26. Client 20D is shown as connected to the Internet 26 with a dial-up connection and clients 20E and 20F are shown connected to a network 28 such as a local area network or LAN, with the network 28 connected to a suitable network server 30.
The preferred system 10 further comprises a data repository 40, for example a data warehouse maintained in a memory. It is envisaged that the data repository may alternatively comprise a single database, a collection of databases, or a datamart. The preferred data repository 40 includes data from a variety of sources. The data repository may include, for example, interaction data 42 representing interactions between customers and merchants, as will be more particularly described below. The data repository may also include data from other sources for example census data 44, data from merchant customer databases 46, promotion data 48 held by a merchant or other organisation, and/or survey data 50 resulting from customer surveys.
The personal computer or workstation 12 preferably operates under the control of appropriate operating and application software, and has a data memory 60 connected to a server 62. The invention includes a retrieval device 64 which in one form comprises a software implemented query enabling retrieval of data from the data repository 40. The data retrieved using the retrieval device 64 is processed with the server 62. A contour generator 66 which in one form comprises a computer-implemented software program generates a contoured representation or series of contour lines in order to display a representation of the data on a client workstation 20, as will be described below.
Typically a merchant will transmit programmes, information and/or data to one or more customers over mass media, for example television, radio or newspapers. The merchant, for example, may operate a private television network in which customers pay the merchant to receive a transmission. In one example, the merchant may provide the customer with identification apparatus for example a decoder which enables a particular customer to receive a data transmission. The apparatus records the customer identifier and the channel and time during which transmission is requested.
As the customer interacts with the merchant in this way, the interaction generates interaction data which is then migrated to the data repository 40. The interaction data could be stored in a number of records in a relational database. Each record may include a merchant identifier used to identify a particular merchant and a date and/or time at which the interaction between the customer and merchant took place.
The record could also include a customer identifier. The decoder preferably has stored on it a customer identifier and may have stored other data, for example residential address and family size of the customer. Such data is typically stored in a customer database 46 and could be migrated to the data repository 40.
The merchant may also provide selected customers with apparatus which records viewing habits. The viewing habits of this sample group of customers could be recorded and stored in a database, together with appropriate demographic data. This sample customer database could be made available to other merchants in order to estimate population viewing habits.
The interaction data is migrated to the data repository 40, generally by way of daily updates or in real time. It is advantageous to cleanse, catalogue and validate the interaction data during migration of the data to the data repository, and this task could be performed by either the merchant or by a third party. Once stored in the data repository 40, the data could be linked to other sources of data for subsequent retrieval, for example the census data 44, data from the merchant customer database 46, promotion data 48 and/or survey data 50 held by the merchant.
The data repository 40 could be maintained by a merchant or alternatively could be maintained by a third party. Updates to the data repository could be carried out by the merchant directly, or alternatively the merchant could provide batched data to a third party for updating the data. Alternatively, a third party could be entrusted with the task of collecting the interaction data and migrating the data to the data repository.
Preferred forms of the invention are arranged to display various key performance indicators or KPIs for a merchant. The KPI, for example the number of viewers, is preferably graphically represented adjacent or near to a representation of the channel and time segment viewed by the customer. There are a finite number of time slots and channels in mass media and the individual audiences for each time slot represent a finite set of data values. These data values are preferably graphically illustrated as data points on the representation.
The preferred representation is coloured and the value of revenue of each machine is illustrated by representing the corresponding data points in the appropriate colour to represent the correct KPI value for each time segment.
The areas of the representation around each data point are shown as a series of contour lines. The nature of the contours for each data point are preferably represented to gradually drop off or fall away from each data point. Each data point could be represented by x and y co-ordinates indicating the relevant position of each data point in the representation. Each data point could also have a z value representing the height or magnitude of the data point. This z value could indicate, for example, the revenue from viewers, number of viewers or other data value in a particular data point. The contour lines represent z data values which are less than the data value of the data point around which the contour lines are displayed. In this way, each data value is centered on a data point. Each data point is displayed as a local maximum as surrounding values drop off or fall away around each point.
This contoured method of representing data values is more particularly described in our patent specification WO 00/77862 to Compudigm International Limited filed on 14 Jun. 2000 entitled “Data visualisation system and method” which is incorporated by reference. The data value of each data point represents the apex of a bell-shaped curve. As x and y values in the representation 100 are increased or decreased, the z value at the new position in the representation will change.
As shown at 102, a set of data values is constructed from the retrieved data. This set of data values could include, for example, the number of viewers watching one channel during a one hour time segment. It is envisaged that the set of data values could be stored in data value memory 60 to increase efficiency of the system as indicated at 104. The data value memory could comprise volatile main RAM or non-volatile mass storage of the workstation on which the invention 12 is implemented.
Referring to step 106, the set of data values are retrieved from the volatile or non-volatile data memory and as shown at 108, a set of data points is constructed to represent the data values. Appropriate x and y values are generated for each data point to space the data points over a generated representation. Each data point has a corresponding time value and the x and y values are based on this time value. Z values for each data point are also calculated based on individual data values.
Referring to step 110, a contoured representation of the data values is generated by the contour generator 66 and displayed on a client workstation 20. The individual drop-off for each data point is calculated and displayed in the appropriate colour and shading corresponding to the z value at each point.
It is envisaged that the invention generate individual displays of contoured representations. It is also envisaged that the invention generate animated sequences of representations by generating two or more “still” representations at various time intervals and superimposing successive representations over earlier representations to generate an animated sequence m which a succession of data points appear to move on a display.
As indicated at 112, where such an animated sequence is required, further representations will be needed and if the necessary data is obtainable from the data memory as indicated at 114, it is retrieved from the data memory as indicated at 106, otherwise further data is retrieved from the data repository at 100.
Referring to
The representation could include a series of time segments, one of which is indicated at 206. These time segments could represent, for example, fifteen-minute time segments of data transmission. Each segment is preferably colour-coded and the same colour could be used for similar categories which appear at different times during the day, for example breakfast news and prime time news.
The representation 200 preferably also includes one or more contoured representations, for example the contoured representation indicated at 208. Each contoured representation comprises a series of data points, each data point centered on a time segment. Data values, for example the number of customers watching the channel during that time segment, are shown as a contoured representation around the time segment.
The representation of
Where the representation is animated, the segments over which representations are contoured could be shifted left or right to indicate progressive or historic time segments. In the case of
It is assumed that the time period of interest commences at point tp and ends at point t0. Each point travels from a leftmost minimum value Xmin to a rightmost or maximum value Xmax.
The display could calculate the position of each data point as a function of time t as follows:
Where an animated sequence is presented to a user, the data points could be repositioned in a circular arrangement around the circle to represent changing time periods, with representations centred on the successive data points. This would have the effect of the contoured representation proceeding around the clock to represent the passage of time. The size and colour of the contours would represent the characteristics or KPIs demonstrated by the system.
It is assumed that the time period of interest commences at tp and ends at t0, and that t0−tp<T. Each circle 302 comprises a series of data points around a “clock face” where one revolution represents time T. The current time t moves around the clock representing an angle calculated by the following function:
The x and y values for each data point are calculated based on the angle calculated from equation (2). If t0−tp>T, or for a continuously changing system, previous values are overwritten during each revolution of the circle 302.
The areas defined by the circles and radial lines could be shaded a different colour to represent for example different numbers of viewers or different types of viewer profile, for example men aged 25 to 40.
In one form the rings could represent the current date and time, the same time one week ago and the same time two weeks ago. Alternatively, the rings could represent different types of programmes, for examples movies, sport and news. As a further example, each ring could represent different viewer profiles for example viewers aged 50 years and over, viewers aged 25 to 29 and viewers aged less than 25.
The value r0 is the maximum radius of the spiral. The position of a data point at time t in polar coordinates is:
p(t)=(r(t),theta(t)) (3)
The radius for a time period in a previous time period rT could be for example rT=0.8 r0. The position of the data point for time t in polar coordinates, is found by equation (3) above.
Theta (t) is calculated by the following equation:
The radius at time t, r(t) is calculated by the following equation:
It is envisaged that, once the data points are generated, these data points can be contoured as described above.
Referring to
The invention displays data during a time period from tp to t0, where t0−tp>T. The invention enables a time t to be compared with t−T, t−2T, t−3T and so on. The value r0 is the maximum radius of the spiral and rmin is the minimum viewable radius of the spiral, approximately
The position of the data point for time t in polar coordinates, is p(t) calculated from equation (3) above and theta (t) is calculated from equation (4) above.
The radius at time t, r(t) is calculated by the following equation:
The invention further provides a method of identifying similar periods of activity and either tightening or loosening time periods represented in a 360° rotation of the spiral so that these are aligned. The spiral of
It is envisaged that the spiral of
It is envisaged that the flexible displays of the invention based on the time values for individual data points could be applied to many types of interactions. A merchant for example may operate commercial premises or stores from which a customer purchases goods or services. The merchant could operate a petrol station in one or more geographic locations. The merchant could alternatively operate a wagering or betting service, or operate a casino or other gaming facility in which several gaming machines and stations are positioned in one or more common rooms at a common venue. The merchant could also operate a warehouse facility, manufacturing facility, car parking premises, telecommunications network or website. The merchant could also offer a range of financial or insurance services. The merchant could operate a warehouse or could provide telecommunications services.
The foregoing describes the invention including preferred forms thereof. Alterations and modifications as will be obvious to those skilled in the art are intended to be incorporated within the scope hereof as defined by the accompanying claims.
Number | Date | Country | Kind |
---|---|---|---|
505591 | Jul 2000 | NZ | national |
511734 | May 2001 | NZ | national |
Number | Date | Country | |
---|---|---|---|
Parent | 10328718 | Dec 2002 | US |
Child | 10637800 | Aug 2003 | US |
Parent | PCT/NZ01/00135 | Jul 2001 | US |
Child | 10328718 | Dec 2002 | US |