Patient data mining, presentation, exploration, and verification

Information

  • Patent Grant
  • 8214225
  • Patent Number
    8,214,225
  • Date Filed
    Monday, November 4, 2002
    21 years ago
  • Date Issued
    Tuesday, July 3, 2012
    12 years ago
Abstract
The present invention provides a graphical user interface for presentation, exploration and verification of patient information. In various embodiments, a method is provided for browsing mined patient information. The method includes selecting patient information to view, at least some of the patient information being probabilistic, presenting the selected patient information on a screen, the selected patient information including links to related information. The selected patient information may include elements, factoids, and/or conclusions. The selected patient information may include an element linked to unstructured information. For example, an element linked to a note with highlighted information may be presented. Additionally, the unstructured information may include medical images and waveform information.
Description
FIELD OF THE INVENTION

The present invention relates to organization and review of data, and, more particularly to a graphical user interface for presentation, exploration and verification of patient information.


BACKGROUND OF THE INVENTION

The information environment faced by physicians has undergone significant changes. There is much more information available, in more formats than ever before, competing for the limited time of physicians. Although the information age is slowly transforming this landscape, it has not yet delivered tools that can alleviate the information overload faced by physicians.


Currently, many health care organizations have started to migrate toward environments where most aspects of patient care management are automated. However, health care organizations with such information management systems have tended to maintain information in a myriad of unstructured and structured data sources. It may still be necessary to access numerous different data sources, each with its own peculiar format.


In view of the above, it would be desirable and highly advantageous to provide new graphical tools for presentation, exploration and verification of patient information.


SUMMARY OF THE INVENTION

The present invention provides a graphical user interface for presentation, exploration and verification of patient information.


In various embodiments of the present invention, a method is provided for browsing mined patient information. The method includes selecting patient information to view, at least some of the patient information being probabilistic, presenting the selected patient information on a screen, the selected patient information including links to related information. The selected patient information may include raw information extracted from various data sources for the patient (hereinafter referred to as ‘elements’) or conclusions drawn therefrom. This information may be derived from various data sources.


The selected patient information may include an element linked to unstructured information. For example, an element linked to a note with highlighted information may be presented. The highlighted information may refer to information used to derive the element. Additionally, the unstructured information may include medical images and waveform information.


The selected patient information may also be derived from structured data sources, such as a database table.


The selected patient information may include a document with links to elements associated with the document.


The selected patient information may include patient summary information.


The patient information presented to a particular user may depend on the identity or role of the user. For instance, a physician may be interested only in a high-level view of the disease (at least initially) and be presented with the most relevant conclusions drawn from the entire patient record.


Another option is to display all the patient information (every element and derived conclusion) but to sort this list in order of decreasing relevance to the disease.


These and other aspects, features and advantages of the present invention will become apparent from the following detailed description of preferred embodiments, which is to be read in connection with the accompanying drawings.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows an exemplary data mining framework for mining structured clinical information;



FIG. 2 shows an exemplary main browser screen;



FIG. 3 shows an exemplary options screen;



FIG. 4 shows an exemplary summary frame screen;



FIGS. 5 and 6 show exemplary verification screens;



FIGS. 7 and 8 show exemplary exploration screens;



FIGS. 9 and 10 show exemplary results of extraction from a structured data source; and



FIGS. 11 to 13 show exemplary presentation of patient summary information.





DESCRIPTION OF PREFERRED EMBODIMENTS


FIG. 1 illustrates an exemplary data mining framework as disclosed in “Patient Data Mining,” by Rao et al., copending U.S. patent application Ser. No. 10/287,055, published as 2003-012045, filed herewith, which is incorporated by reference herein in its entirety.


Detailed knowledge regarding the domain of interest, such as, for example, a disease of interest is used. This domain knowledge base can come in two forms. It can be encoded as an input to the system, or as programs that produce information that can be understood by the system. The part of the domain knowledge base that is input to the present form of the system may also be learned from data.


Domain-specific knowledge for mining the data sources may include institution-specific domain knowledge. For example, this may include information about the data available at a particular hospital, document structures at a hospital, policies of a hospital, guidelines of a hospital, and any variations of a hospital.


The domain-specific knowledge may also include disease-specific domain knowledge. For example, the disease-specific domain knowledge may include various factors that influence risk of a disease, disease progression information, complications information, outcomes and variables related to a disease, measurements related to a disease, and policies and guidelines established by medical bodies.


An extraction component takes information from a computerized patient record (CPR) to produce probabilistic assertions (elements) about the patient that are relevant to an instant in time or time period. This process is carried out with the guidance of the domain knowledge that is contained in the domain knowledge base. The domain knowledge required for extraction is generally specific to each source.


Extraction from a text source may be carried out by phrase spotting, which requires a list of rules that specify the phrases of interest and the inferences that can be drawn therefrom. For example, if there is a statement in a doctor's note with the words “There is evidence of metastatic cancer in the liver,” then, in order to infer from this sentence that the patient has cancer, a rule is needed that directs the system to look for the phrase “metastatic cancer,” and, if it is found, to assert that the patient has cancer with a high degree of confidence (which, in the present embodiment, translates to generate an element with name “Cancer”, value “True” and confidence 0.9).


The data sources include structured and unstructured information. Structured information may be converted into standardized units, where appropriate. Unstructured information may include ASCII text strings, image information in DICOM (Digital Imaging and Communication in Medicine) format, and text documents partitioned based on domain knowledge. Information that is likely to be incorrect or missing may be noted, so that action may be taken. For example, the mined information may include corrected information, including corrected ICD-9 diagnosis codes.


Extraction from a database source may be carried out by querying a table in the source, in which case, the domain knowledge needs to encode what information is present in which fields in the database. On the other hand, the extraction process may involve computing a complicated function of the information contained in the database, in which case, the domain knowledge may be provided in the form of a program that performs this computation whose output may be fed to the rest of the system.


Extraction from images, waveforms, etc., may be carried out by image processing or feature extraction programs that are provided to the system.


Combination includes the process of producing a unified view of each variable at a given point in time from potentially conflicting assertions from the same/different sources. In various embodiments of the present invention, this is performed using domain knowledge regarding the statistics of the variables represented by the elements (“prior probabilities”).


Inference is the process of taking all the factoids that are available about a patient and producing a composite view of the patient's progress through disease states, treatment protocols, laboratory tests, etc. Essentially, a patient's current state can be influenced by a previous state and any new composite observations.


As illustrates in FIG. 1, an exemplary data mining framework for mining high-quality structured clinical information includes a data miner 150 that mines information from a computerized patient record (CPR) 110 using domain-specific knowledge contained in a knowledge base (130). The data miner 150 includes components for extracting information from the CPR 152, combining all available evidence in a principled fashion over time 154, and drawing inferences from this combination process 156. The mined information may be stored in a structured CPR 180.


The extraction component 152 deals with gleaning small pieces of information from each data source regarding a patient, which are represented as probabilistic assertions about the patient at a particular time. These probabilistic assertions are called elements. The combination component 154 combines all the elements that refer to the same variable at the same time period to form one unified probabilistic assertion regarding that variable. These unified probabilistic assertions are called factoids. The inference component 156 deals with the combination of these factoids, at the same point in time and/or at different points in time, to produce a coherent and concise picture of the progression of the patient's state over time. This progression of the patient's state is called a state sequence.



FIG. 2 illustrates an exemplary main browser screen 200 for browsing mined patient information. The exemplary main browser screen 200 includes a run state selector 202, a patient selector 204, and an enter button 206.


In operation, a user interacting with the main browser screen 200 enters a patient identifier using the patient selector 204 and a data mining run state using the run state selector 202. The user then clicks on the enter button 206 to cause the selected patient identifier and run state to be input.


The data mining run state can include a particular run cycle (e.g., run date, time) that patient medical records were mined. When information is retrieved, it can include only information current as of that point.


Referring to FIG. 3, an exemplary options screen 300 is illustrated. The options screen 300 may include a plurality of input buttons, each input button for displaying a level of information. For example, the user may click on an input button to select summary information. FIG. 4 illustrates the result of selecting summary information from the options screen 300. As shown in FIG. 4, a summary of a particular patient information is presented. This summary includes all elements, documents, and tests for the patient relating to glycemic control, which is the view of the patient record presented to the particular user.


Advantageously, the patient information presented to a particular user may depend on the identity or role of the user. For example, a cardiologist may be presented with a different view of the data than an oncologist. Similarly, a physician may be presented with information different from that of a nurse or administrative employee. By presenting different views of the patient information, the user can more effectively make use of information that he or she is interested in.


Another option is to display all the patient information (every element and derived conclusion) but to sort this list in order of decreasing relevance to the disease. For instance, one patient's most relevant item may be his abnormal test results, while another patient whose test results are normal may have his family history of cancer be the most relevant item.


Referring to FIG. 5, an exemplary verification screen is illustrated. This screen allows a user to drill down an element to its underlying source. In this case, the element “STTAbn; Value: true, 0.8” has been selected, causing a physician note to be displayed in the right-hand portion of the screen. The highlighted portion of the physician note indicates the data from which the element was derived. In this case, it was concluded that there is an 80% probability that the patient's ECG showed ST-T wave abnormalities. FIG. 6 illustrates drilling down of another element, “STTabn; Value: false, 0.7”, that contradicts the element shown in FIG. 5. In this case, it was concluded that there is an 70% probability that the patient's ECG showed ST-T wave abnormalities. A user may use the verification screen to verify the conclusions inferred from the underlying data sources.


Although FIGS. 5 and 6 show that the underlying data sources are physician notes, it should be appreciated that the data sources could take other forms. For example, the elements may be derived from (and linked to) medical images, waveforms, and structured information (e.g., information contained in a database).


Referring to FIG. 7, documents may be displayed to the user. In this case, the user selected a physician note written by Emergency Room (ER) personnel. Two separate elements were derived from information contained in this document. FIG. 8 shows another document displayed on the exploration screen. As illustrated, this document includes fourteen elements in six categories.



FIGS. 9 and 10 illustrate patient information extracted from structured data sources. In particular, FIG. 9 shows lab results for a particular patient. As depicted, the lab results include a date, time, test name, and measurement value. FIG. 10 shows various medications administered to the patient. This information includes a drug name, date, dosage, and price information. The information obtained from structured data sources may have been converted into standardized units, where appropriate.



FIGS. 11 to 13 illustrate exemplary patient summary screens. FIG. 11 shows summary results for ‘BGLUT’ (blood glucose level). As shown, various summary information is presented to the user. Likewise, FIG. 12 shows summary results for “TCPL”. As shown in FIG. 13, patient summary information related to various facets of glycemic control is presented.


While the exemplary screens use several selection menus and buttons, it should be appreciated that the selection of various parameters such as the patient identifier, miner run state, documents, elements, categories, etc., can be accommodated using a variety of devices, such as a number of graphical user interface selection widgets, check boxes, buttons, list boxes, pop-up or drop-down marks, text entry boxes and the like, or any known or later developed interfaces that an operator can access. It should be appreciated that the various exemplary screens illustrated herein can also, or alternatively, include any device capable of presentation, exploration, and verification of mined patient information.


Although illustrative embodiments of the present invention have been described herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to those precise embodiments, and that various other changes and modifications may be affected therein by one skilled in the art without departing from the scope or spirit of the invention.

Claims
  • 1. A method for browsing patient information, comprising the steps of: mining, by a processor, for patient information, the mining extracting values for respective variables as at least some of the patient information, each of the variables assigned one value, each of the values for each of the multiple variables being derived from multiple pieces extracted from a data source, each of the pieces referring to a same variable being assigned first probabilities indicating likelihoods of the respective variables being the corresponding values, at least one of the first probabilities being less than 100%, each of the values determined with a unified probability determined by combination of the first probabilities from multiple of the pieces referring to the same variable such that the multiple variables are assigned respective values and respective unified probabilities, the pieces and values representing a patient at a particular time;selecting at least one of the variables to view the patient information related to the selected one of the variables, the selected patient information related to the selected one of the variables including a subset portion of the data source, the subset portion including the piece extracted for the value of the selected one of the variables and additional information associated with the piece from the data source and not including other information from the data source for the patient, the piece included in the subset portion associated with the first probability used to determine the unified probability of the value for the selected one of the variables;presenting the selected patient information on a screen.
  • 2. The method of claim 1, wherein the selected patient information includes one or more of raw information extracted from the data source for the patient and conclusions drawn there from.
  • 3. The method of claim 1, wherein the one of the values is derived from the piece extracted for the one of the values from unstructured data of the data source.
  • 4. The method of claim 1, wherein the selected patient information includes an element linked to unstructured information, the unstructured information including the subset portion.
  • 5. The method of claim 1, wherein the selected patient information includes an element linked to a note with highlighted information.
  • 6. The method of claim 4, wherein the highlighted information refers to information used to derive the element.
  • 7. The method of claim 3, wherein the unstructured information includes one of free text, medical image information, and waveform information.
  • 8. The method of claim 1, wherein the one of the values is inferred from pieces from structured data sources.
  • 9. The method of claim 8, wherein the structured data source includes a database.
  • 10. The method of claim 1, wherein the selected patient information is a document; further comprising providing a link to related information associated with the selected patient information, the links to related information referring to the pieces from the document.
  • 11. The method of claim 1, wherein the selected patient information includes summary information.
  • 12. The method of claim 1, wherein selectable patient information is presented based on a view.
  • 13. The method of claim 12, wherein the view is based on a user identifier, a role, or a combination thereof.
  • 14. The method of claim 12, wherein the selected patient information includes summary information.
  • 15. The method of claim 12, wherein the selected patient information is sorted based upon relevance to a disease, a user, or a combination thereof.
  • 16. A program storage device readable by a machine, tangibly embodying a program of instructions executable on the machine to perform method steps for browsing mined patient information, the method steps comprising: selecting at least one of multiple variables, the selecting indicating the patient information to view as a subset of the mined patient information, at least some of the mined patient information used to derive a value for the selected one of the variables, the value extracted from a piece of a data source and having a first probability indicating a likelihood of the variable being the value, the first probability determined from a second probability of the piece indicating a likelihood of the variable being the value;presenting the selected patient information on a screen, the selected patient information including a portion of the data source, the portion including the piece for the value and additional information associated with the piece from the data source, the piece included in the portion having the second probability used to determine the first probability for the value.
  • 17. The program storage device of claim 16 wherein the step of presenting comprises presenting the selected patient information with a link to related information, the related information comprising the data source.
  • 18. The method of claim 1 wherein the selected patient information includes links to the data source.
  • 19. The method of claim 1 wherein presenting the selected patient information comprises displaying a list of information as a function of relevance to a disease.
  • 20. The method of claim 1 wherein presenting the selected patient information comprises presenting corrected information relative to the patient information.
  • 21. The method of claim 1 wherein mining comprises mining as a function of domain-specific knowledge; further comprising:presenting on the screen a list of the values indicative of the domain-specific knowledge.
  • 22. The method of claim 1 wherein presenting comprises presenting an inferred conclusion.
  • 23. The method of claim 1 wherein selecting the patient information comprises selecting at least one of the values from a medical image.
  • 24. The method of claim 1 wherein presenting comprises presenting to a physician.
  • 25. The method of claim 1 wherein presenting comprises presenting to a nurse.
  • 26. The method of claim 1 wherein presenting comprises presenting to an administrative employee.
  • 27. A system for browsing mined patient information, the system comprising: a data miner configured to mine patient information, the mining extracting values for multiple variables as at least some of the patient information, each of the values being derived from multiple pieces extracted from a data source, the pieces and values representing a patient at a particular time, the pieces assigned first probabilities indicating likelihoods of the variables being the corresponding values, each of the values determined with unified probabilities, each of the unified probabilities being determined by combination of the first probabilities from multiple of the pieces referring to the same variable;a user input for selecting at least one of the variables for viewing the patient information supporting the value for the selected one of the variables the patient information including a subset portion of the data source, the subset portion including the piece extracted for the one of the values of one of the variables and additional information associated with the piece from the data source, the piece included in the subset portion having the first probability used to determine the unified probability for the value of the selected one of the variables; anda screen operable to present the patient information.
  • 28. The method of claim 1 wherein mining comprises inferring a state of the variable from a combination of data corresponding to different pieces, the state comprising one of the values; further comprising storing the state as patient information a structured database;wherein selecting comprises selecting the variable from the structured database where a corresponding probability is associated with the inferring.
  • 29. The method of claim 1 wherein selecting comprises verifying contradictory data.
  • 30. A method for browsing patient information, the method comprising: mining for patient information from data in data sources of a computerized patient record, the patient information represented by a probability of a variable having a value for each of a plurality of pieces of the patient information, a final value of the variable determined by combining the probabilities of the pieces of the patient information for the variable, the mining including determining final values for respective multiple variables for a patient;presenting the patient information on a screen;receiving user selection of the variable; anddisplaying, on the screen, the data associated with the selected variable and supporting the probability, the data supporting the probability including the pieces, one of the pieces being from of a document in one of the data sources, and the data also including a context of the piece and not including other information from the data source for the piece.
CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application Serial No. 60/335,542, filed on Nov. 2, 2001, which is incorporated by reference herein in its entirety.

US Referenced Citations (127)
Number Name Date Kind
4946679 Thys-Jacobs Aug 1990 A
5172418 Ito et al. Dec 1992 A
5307262 Ertel Apr 1994 A
5359509 Little et al. Oct 1994 A
5365425 Torma et al. Nov 1994 A
5508912 Schneiderman Apr 1996 A
5544044 Leatherman Aug 1996 A
5557514 Seare et al. Sep 1996 A
5619991 Sloane Apr 1997 A
5652842 Siegrist, Jr. et al. Jul 1997 A
5657255 Fink et al. Aug 1997 A
5664109 Johnson et al. Sep 1997 A
5669877 Blomquist Sep 1997 A
5706441 Lockwood Jan 1998 A
5724379 Perkins et al. Mar 1998 A
5724573 Agrawal et al. Mar 1998 A
5737539 Edelson et al. Apr 1998 A
5738102 Lemelson Apr 1998 A
5811437 Singh et al. Sep 1998 A
5832450 Myers et al. Nov 1998 A
5835897 Dang Nov 1998 A
5845253 Rensimer et al. Dec 1998 A
5899998 McGauley et al. May 1999 A
5903889 de la Huerga et al. May 1999 A
5908383 Brynjestad Jun 1999 A
5924073 Tyuluman et al. Jul 1999 A
5924074 Evans Jul 1999 A
5935060 Iliff Aug 1999 A
5939528 Clardy et al. Aug 1999 A
5991731 Colon et al. Nov 1999 A
6039688 Douglas et al. Mar 2000 A
6067466 Selker et al. May 2000 A
6076088 Paik et al. Jun 2000 A
6078894 Clawson et al. Jun 2000 A
6081786 Barry et al. Jun 2000 A
6083693 Nandabalan et al. Jul 2000 A
6108635 Herren et al. Aug 2000 A
6125194 Yeh et al. Sep 2000 A
6128620 Pissanos et al. Oct 2000 A
6139494 Cairnes Oct 2000 A
6151581 Kraftson et al. Nov 2000 A
6173280 Ramkumar et al. Jan 2001 B1
6196970 Brown Mar 2001 B1
6212519 Segal Apr 2001 B1
6212526 Chaudhuri et al. Apr 2001 B1
6253186 Pendleton, Jr. Jun 2001 B1
6259890 Driscoll et al. Jul 2001 B1
6266645 Simpson Jul 2001 B1
6272472 Danneels et al. Aug 2001 B1
6322502 Schoenberg et al. Nov 2001 B1
6322504 Kirshner Nov 2001 B1
6338042 Paizis Jan 2002 B1
6381576 Gilbert Apr 2002 B1
6468210 Iliff Oct 2002 B1
6478737 Bardy Nov 2002 B2
6484144 Martin et al. Nov 2002 B2
6523019 Borthwick Feb 2003 B1
6529876 Dart Mar 2003 B1
6551243 Bocionek et al. Apr 2003 B2
6551266 Davis, Jr. Apr 2003 B1
6587829 Camarda et al. Jul 2003 B1
6611825 Billheimer et al. Aug 2003 B1
6611846 Stoodley Aug 2003 B1
6641532 Iliff Nov 2003 B2
6645959 Bakker-Arkema et al. Nov 2003 B1
6678669 Lapointe et al. Jan 2004 B2
6754655 Segal Jun 2004 B1
6802810 Ciarniello et al. Oct 2004 B2
6804656 Rosenfeld et al. Oct 2004 B1
6826536 Forman Nov 2004 B1
6839678 Schmidt et al. Jan 2005 B1
6903194 Sato et al. Jun 2005 B1
6915254 Heinze et al. Jul 2005 B1
6915266 Saeed et al. Jul 2005 B1
6941271 Soong Sep 2005 B1
6961687 Myers, Jr. et al. Nov 2005 B1
6988075 Hacker Jan 2006 B1
7058658 Mentzer Jun 2006 B2
7130457 Kaufman et al. Oct 2006 B2
7249006 Lombardo et al. Jul 2007 B2
7307543 Rosenfeld et al. Dec 2007 B2
7353238 Gliklich Apr 2008 B1
7630908 Amrien et al. Dec 2009 B1
20010011243 Dembo et al. Aug 2001 A1
20010023419 LaPointe et al. Sep 2001 A1
20010032195 Graichen et al. Oct 2001 A1
20010041991 Segal et al. Nov 2001 A1
20010051882 Murphy et al. Dec 2001 A1
20020002474 Michelson et al. Jan 2002 A1
20020010597 Mayer et al. Jan 2002 A1
20020026332 Snowden et al. Feb 2002 A1
20020032581 Reitberg Mar 2002 A1
20020035316 Drazen Mar 2002 A1
20020077853 Boru et al. Jun 2002 A1
20020082480 Riff et al. Jun 2002 A1
20020087361 Benigno et al. Jul 2002 A1
20020099570 Knight Jul 2002 A1
20020123905 Goodroe et al. Sep 2002 A1
20020138492 Kil Sep 2002 A1
20020138524 Ingle et al. Sep 2002 A1
20020143577 Shiffman et al. Oct 2002 A1
20020165736 Tolle et al. Nov 2002 A1
20020173990 Marasco Nov 2002 A1
20020177759 Schoenberg et al. Nov 2002 A1
20030028401 Kaufman et al. Feb 2003 A1
20030046114 Davies et al. Mar 2003 A1
20030050794 Keck Mar 2003 A1
20030108938 Pickar et al. Jun 2003 A1
20030120133 Rao et al. Jun 2003 A1
20030120134 Rao et al. Jun 2003 A1
20030120458 Rao et al. Jun 2003 A1
20030125984 Rao et al. Jul 2003 A1
20030125985 Rao et al. Jul 2003 A1
20030125988 Rao et al. Jul 2003 A1
20030126101 Rao et al. Jul 2003 A1
20030130871 Rao et al. Jul 2003 A1
20030135391 Edmundson et al. Jul 2003 A1
20030208382 Westfall Nov 2003 A1
20040067547 Harbron et al. Apr 2004 A1
20040078216 Togo Apr 2004 A1
20040184644 Leichter et al. Sep 2004 A1
20040243586 Byers Dec 2004 A1
20050187794 Kimak Aug 2005 A1
20050191716 Surwit et al. Sep 2005 A1
20060064415 Guyon et al. Mar 2006 A1
20060122864 Gottesman et al. Jun 2006 A1
20060136259 Weiner et al. Jun 2006 A1
Foreign Referenced Citations (14)
Number Date Country
198 20 276 Nov 1999 DE
0 596 247 Sep 1993 EP
0 641 863 Mar 1995 EP
0 917 078 Oct 1997 EP
2 332 544 Jun 1999 GB
11328073 Nov 1999 JP
WO 9829790 Jul 1998 WO
9839720 Sep 1998 WO
WO 0051054 Aug 2000 WO
WO 0069331 Nov 2000 WO
WO 0166007 Sep 2001 WO
2001297157 Oct 2001 WO
WO 0178005 Oct 2001 WO
0182173 Nov 2001 WO
Related Publications (1)
Number Date Country
20030120514 A1 Jun 2003 US
Provisional Applications (1)
Number Date Country
60335542 Nov 2001 US