The present disclosure relates to the analysis and visualization of data.
In order to provide answers to customer inquiries in an efficient manner, organizations employ knowledge bases to provide self-help to customers. Additionally, organizations employ internal knowledge bases to assist their representatives in answering customer questions. Depending on the size of the customer base, scope of the organization's products, and the size of the organization, the necessary knowledge bases may grow to be very large in size and scope.
Depending on the size of an organization, there may be limited resources available to generate informational items to include in a knowledge base. For example, an organization's subject matter experts may be extremely busy developing new products or enhancing current products, leaving little time to populate a knowledge base. Understanding what specific content is most needed in a knowledge base allows an organization to prioritize their limited resources used to create reusable knowledge.
Overview
Data indicating a plurality of informational items is received at a processor. The data further comprises at least one useful informational item. Also received at the processor is data indicating a plurality of questions, including at least one unanswered question. The processor analyzes the data to determine a number of unanswered questions and a number of useful informational items. The processor generates a visualization comparing the number useful informational items to the number of unanswered questions. The visualization is displayed on a display.
Depicted in
Turning to the contents of table 100, question data 110 is maintained in columns 112, 114 and 116, and informational item data 120 is maintained in columns 122, 124 and 125. Specifically, column 112 contains an indication of the content of a question, such as the full question, or an abbreviation thereof. Alternatively, if the question was originally posted to an internet message board, column 112 may be populated with the title of the posting. Column 114 includes a unique identifier for the question, while column 116 provides an identifier of the topic to which the question relates. For example, the entries contained in rows 130, 131 and 133 are all directed to network address translation, or NAT. Accordingly, the data items in rows 130, 131 and 133 all have the same topic identifier of “1.” While data table 100 only includes a numerical indication of the topic, table 100 may be linked to another data table to retrieve the topic name. Other example implementations may store the topic name with the question data. Though not depicted, additional question data may include subtopic information, or information to identify the source of the question.
According to the example of
Informational item data 120 includes an informational item name 122 as well as a unique identifier 124. Informational item data 120 may also include an ItemLoc value 125 indicating a location for where the informational item may be found. While
By including information item data 120 in the same data entry as question data 110, a link between the question and the informational item is created. For example, because the “Configuring NAT” informational item is included in row 130 along with the “How do I configure NAT” question, this is an indication that the “Configuring NAT” informational item has been linked to the “How do I configure NAT” question. Remaining with the message board example, the “How do I configure NAT” item may represent a question posed on the message board. A NAT subject matter expert may come across the “How do I configure NAT” question while perusing the message board. Knowing where to find the answer to the question, the subject matter expert posts a response which includes a link to the “Configuring NAT” informational item. This link created by the subject matter expert may be automatically created by the message board, or an administrator may record the link as part of their administrative duties.
With reference now made to
Also contained in table 220 are columns analogous to those presented in table 100 of
Data fields like columns 226 and 227 may be considered indications of “usefulness.” As used herein, an indication of usefulness is a property of an informational item that indicates that the item is useful in resolving questions and/or issues directed to the subject matter contained in the informational item beyond the simple existence of the informational item. For example, the “Resolving NAT Issues” informational item located at row 244, while known to exist in the example data, would not be considered as having any indications of usefulness. On the other hand, each of the other entries in table 220 have at least two types of indications of usefulness, as the entries in rows 240-243 have data in both the LinkedQuestion column 226 and the Kudos column 227. Other indications of usefulness may include the number of times the question or informational item has been “viewed” or “read”, or the number of times the question or informational item has been referred to in another document.
Turning now to
By surrounding inner areas 320a-c, which are based on the number of useful informational items, with annular areas 330a-c, which are based on the number of unanswered questions, and comparing the relative sizes of the two areas, a determination can be made whether or not the informational items are adequately addressing the questions being asked for a particular topic. In other words, it can be determined whether or not there is a knowledge gap for a particular category of questions.
Similarly, if there are no informational items that have been indicated as being useful, or no unanswered questions, either because all questions have been answered or no questions have been asked, the absence of areas 320a-c or 330a-c may also be informative to a user. For example, if areas 330a-c are large, and comprise full circles, it can be determined that there are a large number of unanswered questions that are not addressed by any informational items. Similarly, areas 320a-c which are not surrounded by any unanswered questions may lead to further investigation to determine whether all questions have been answered, or if there are informational items unnecessarily created to address questions which are not being asked.
When linked informational items are used to determine the usefulness of an informational item and whether or not a question has been answered, visualizations 300a-c allow for an easy three-way comparison between the total number of questions related to a particular topic, the number of questions related to the particular topic that have been linked to (or answered by) at least one informational item, and the number of questions related to a particular topic which have not been linked to at least one informational item.
According to the example of
Through visualizations 300a-c an evaluation of gaps in a knowledge base can be quickly determined. For example, given the large area 300a relative to the area of visualizations 300b and 300c, it can be easily determined that there is a significantly greater number of questions related to NAT than there are for Crashes and URL Filtering. Furthermore, it can easily be determined that there are significant knowledge gaps in the currently available knowledge base for NAT, while significantly smaller gaps are present for the URL Filtering and Crashes topics. For example, areas 320a and 320b are the same size. Accordingly, if just these questions are considered, an administrator may come away with an erroneous view that the information available for the NAT and Crashes categories are providing similar coverage for the questions posed in their respective categories. It may also be the case that an administrator would view the total number of questions represented by areas 310b and 310c and come to the erroneous conclusion that the URL Filtering topic represented by area 310c is being better served than the Crash topic represented by area 310b due to the smaller number of URL Filtering questions. Yet, by including all three areas 310a-c, 320a-c and 330a-c, for all three topics, an accurate understanding of the relative knowledge gaps can be accurately determined.
By showing all three areas, 310a-c, 320a-c and 330a-c, an administrator can make a quick and accurate determination of the actual gaps in the knowledge base. For example, it can quickly be determined that even though the number of questions linked to an informational item is the same for the NAT and Crash categories, given the greater number of questions related to NAT, and therefore an area 330a greater than 330b, it is obvious that there are more gaps and a greater need to increase the amount of useful content in the NAT category of the knowledge base. Furthermore, even though there are a greater number of questions being directed to the Crashes category than there are questions directed to the URL Filtering category, because the relative sizes of areas 310b, 320b and 330b can be quickly and accurately compared to the relative sizes of 310c, 320c and 330c, it can be quickly and accurately determined that the Crash topic is being serviced as well if not better than the URL Filtering topic.
In other words, by visually displaying to the user both the frequency of a discussion topic (indicated by the size of areas 310a-c), as well as the usefulness of the existing content that matches that topic (indicated by the size of areas 320a-c), the user can understand what specific content is needed in the knowledge base. Specifically, visualizations such as those provided by visualization 300a-c provide a view of what technical subjects are in most need of useful content across different categories. Resources can then be re-focused to target creating content in one category over another, based on need. Additionally, by viewing visualizations 300a-c together, a view into the overall “health” of a knowledge base, and the relative “health” of the different categories can be easily ascertained.
Accordingly, visualizations 300a-c provide an easy way for organizations to understand where gaps exist in their self-help knowledge management systems. The visualizations display existing knowledge linked to customer questions/problems (showing where the existing knowledge articles meet the customer needs), and areas where customer questions have little or no associated knowledge—and thus areas where content must be developed in order to pro-actively answer those customer questions.
Furthermore, the visualizations 300a-c may provide further information through information box 340. Specifically, if the user selects a specific visualization, in this case visualization 300a, information box 340 may be displayed. The selection of visualization 300a may be accomplished by selecting visualization 300a with a predetermined number of mouse clicks, hovering over visualization 300a with cursor 350, or other methods. Once displayed, information box 340 may display detailed information for visualization 300a. For example, information box 340 may display the topic to which the questions represented by visualization 300a are directed, the number of questions related to the topic, the number of questions represented by areas 310a, 320a and 330a, the subtopics covered under the main topic, as well as other information stored about the questions represented by visualization 300a.
According to examples, the visualizations 300a-c are interactive, allowing users to “drill down” to view the details of the data visualized by the different areas 310a-330c. As one example,
Specifically, each of the visualizations 405-420 corresponds to a NAT informational item. Furthermore, characteristics of visualization 405-420 can be used to show additional information concerning each informational item. Take for example visualization 405 which has a significantly larger size than the other visualizations 410-420. The size or area of the visualizations 405-420 may be made in proportion to the number of times the corresponding informational item has been linked to a question. If visualization 405 corresponds to the informational item contained in row 240 of table 220 of
Also included in visual representation 400 are smaller visualizations, such as visualization 425. These smaller visualizations may be displayed to indicate informational items that exist within a knowledge base, but that have not yet been linked to any questions. According to the present example, as these informational items have not been linked to any questions they do not contribute to the size of area 320 in
Turning to visualizations 420a-c, it can be seen that these visualization share a common size, but have a different border thickness. Specifically, border 430b is thicker than border 430a, and border 430c is, in turn, thicker than border 430b. The borders as used in
Additionally, even though the size of visualizations 405 and 415a differ from those of 420a-c, it can be determined that informational items represented by these visualizations have received one kudo a piece, as the thickness of borders 430d and 430e are thicker than border 430a, but thinner than border 430b.
Furthermore, it can be determined that a visualization, such as visualization 425, has received no kudos, and is not linked to any documents. Accordingly, a determination can be made to shift resources from the topic covered by the informational item represented by visualization 425 to other areas in which there are present gaps in the knowledge base.
Of course, depending on user preferences, the thickness of the borders of visualizations 405-420 may be used to indicate the number of times the informational items represented by visualizations 405-420 have been linked to a question. Similarly, the area of the visualizations 405-420 may be used to indicate the number of “kudos” that have been received by each informational item represented by visualizations 405-420.
Additional information can also be expressed through visualizations 405-425. As depicted in
Furthermore, the visualizations 405-425 may provide further information through information box 440. Specifically, if the user selects a specific visualization, in this case visualization 405, information box 440 may be displayed. The selection of visualization 405 may be accomplished by selecting visualization 405 with a predetermined number of mouse clicks, hovering over visualization 405 with cursor 450, or other methods. Once displayed, information box 440 may display detailed information for the informational item associated with visualization 405. For example, information box 440 may display the title of the informational item, the number of questions to which the informational item has been linked, the title and/or text of the questions to which the informational items has been linked, the number of “kudos” the informational has received, and other information, such as additional indications of usefulness and additional subtopics to which the informational item is directed.
Turning now to
According to user preferences, visualization 500 may contain visualizations only for questions that have not been answered, such as through a link an informational item, or as depicted in
In
In visualization 500, the visualizations representing answered and unanswered questions have been grouped together by subtopic. According to other examples, the visualizations may be grouped to mimic their locations in visualizations 300a-c of
The visualizations 510a-e may provide further information through information box 540. Specifically, if the user selects a specific visualization, in this case visualization 510a, information box 540 may be displayed. The selection of visualization 510a may be accomplished by selecting visualization 510a with a predetermined number of mouse clicks, hovering over visualization 510a with cursor 550, or other methods. Once displayed, information box 540 may display detailed information for the question associated with visualization 510a. For example, information box 540 may display the title of the question, the subtopic of the questions, the title or text of the informational item to which the question has been linked, if any, the text of the question, and other information such as additional subtopics to which the question is directed.
Turning to
Each of the links 610 may be selected to show additional information about the links through information box 640. Specifically, selection of link 610b may cause information box 640 to be displayed. The selection of link 610b may be accomplished with a predetermined number of mouse clicks, hovering over link 610b with cursor 650, or other methods. Once displayed, information box 640 may display detailed information for link 610b. For example, information box 640 may display the title of the question, and the title of the informational item for link 610b, as well as other information such as the subtopic to which the question and informational item are directed.
Additionally, in order to better view the information contained in visualization 600, the view may be rotated, scaled, zoomed, or otherwise manipulated so that each visualization and link may be viewed by the user.
With reference now made to
In step 730, the processor analyzes a number of unanswered questions and a number of useful informational items. Step 730 may comprise analyzing a number of the plurality of questions which are not linked to at least one of the plurality of informational items. Because the processor knows the total number of questions, determining the number of unlinked questions may also comprise determining the number of linked questions. In fact, determining the number of unlinked questions may first comprise determining the number of linked questions, and using the difference between the total number of questions and the number of linked questions to determine the number of unlinked questions.
In step 740, the processor generates a visualization of the number useful informational items compared to the number of unanswered questions. Step 740 may comprise generating a visualization of the number of questions not linked to at least one informational item compared to both the total number of questions and the number of questions linked to at least one informational item. This visualization may take the form of one or more of visualizations 300a-c from
Finally, in step 750, the visualization is displayed on a display.
With reference now made to
Memory 830 may comprise read only memory (ROM), random access memory (RAM), magnetic disk storage media devices, optical storage media devices, flash memory devices, electrical, optical, or other physical/tangible (e.g., non-transitory) memory storage devices. The processor 810 is, for example, a microprocessor or microcontroller that executes instructions for the proxy device logic. Thus, in general, the memory 830 may comprise one or more tangible (non-transitory) computer readable storage media (e.g., a memory device) encoded with software comprising computer executable instructions and when the software is executed (by the processor 810), device 800 is operable to perform the operations described herein in connection with
The above description is intended by way of example only.