The present invention relates to medical information processing systems, and, more particularly to a computerized system and method for providing automated performance measurement information for health care organizations.
Health care organizations need to generate various types of performance measurement information to determine how well they are progressing over time. Health care organizations typically use this information to determine areas of excellence within their organizations as well as those areas that need improvement. Performance measurement information provides an objective basis for planning and making budgeting decisions. In addition, performance measurement information can be used to demonstrate accountability to the public and to back up claims of quality. Frequently, performance measurement information is provided to accreditation organizations for compliance purposes.
The Joint Commission on Accreditation of Healthcare Organizations (JCAHO), an organization that accredits more than 4,700 hospitals nationwide, requires that participating hospitals provide certain types of performance measurement information. For example, JCAHO requires that participating hospitals provide information regarding patients treated for acute myocardial infarction (AMI). As one example of the type of information that must be provided, hospitals are required to indicate whether an AMI patient without aspirin contraindication received aspirin within 24 hours before or after hospital arrival. Because it is believed that early treatment with aspirin markedly reduces mortality for AMI, JCAHO requires hospitals to report this information.
Currently, performance measurement information must be collected from a myriad of structured and unstructured data sources to comply with accreditation requests. For example, it may be necessary to access numerous different databases, each with its own peculiar format. Worse, physician notes may have to be consulted. These notes usually are nothing more than free text dictations, and it may be very difficult to sift through the notes to gather the necessary information. As a result, the effort taken to collect this information is usually time consuming, expensive, and error prone. Furthermore, usually only a small sample of patient data can be supplied.
Given the importance of collecting accurate performance measurement information, it would be desirable and highly advantageous to provide new techniques for automatically generating performance measurement information for health care organizations.
The present invention provides a technique for automatically generating performance measurement information for health care organizations.
In various embodiments of the present invention, a method is provided that includes formulating a query based on a specified performance measurement category. This query is then executed to obtain performance measurement information. At least some of the obtained performance measurement information may be derived from unstructured data sources, such as free text physician notes.
The performance measurement information can be outputted. The performance measurement information may be sent to a health care accreditation organization. An example of a health care accreditation organization is the Joint Commission on Accreditation of Health Care Organizations (JCAHO).
Performance measurement information can include patient information from a health care provider being evaluated. For example, a health care accreditation organization may evaluate a hospital for its quality of care in treating heart attack patients. This patient information may include clinical information, financial information, and demographic information.
The obtained performance measurement information may be sampled from a patient population. Alternatively, it may be obtained for an entire patient population.
Performance measurement information may be generated by a health care provider, third party service provider, or an accreditation organization. The performance measurement information may be made available using a network, such as, for example, the Internet.
In various embodiments, an evaluation score of a health care provider may be calculated using the obtained performance measurement information. This evaluation score may be outputted for evaluating health care providers. Health care consumers may have the opportunity to view or download evaluation information via the Internet. Health care providers may be ranked according to the evaluation scores. Such rankings may be done for various performance measurement categories.
These and other aspects, features and advantages of the present invention will become apparent from the following detailed description of preferred embodiments, which is to be read in connection with the accompanying drawings.
To facilitate a clear understanding of the present invention, illustrative examples are provided herein which describe certain aspects of the invention. However, it is to be appreciated that these illustrations are not meant to limit the scope of the invention, and are provided herein to illustrate certain concepts associated with the invention.
It is also to be understood that the present invention may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. Preferably, the present invention is implemented in software as a program tangibly embodied on a program storage device. The program may be uploaded to, and executed by, a machine comprising any suitable architecture. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units (CPU), a random access memory (RAM), and input/output (I/O) interface(s). The computer platform also includes an operating system and microinstruction code. The various processes and functions described herein may either be part of the microinstruction code or part of the program (or combination thereof) which is executed via the operating system. In addition, various other peripheral devices may be connected to the computer platform such as an additional data storage device and a printing device.
It is to be understood that, because some of the constituent system components and method steps depicted in the accompanying figures are preferably implemented in software, the actual connections between the system components (or the process steps) may differ depending upon the manner in which the present invention is programmed.
The computer system 100 may be a standalone system or be linked to a network via the network interface 112. The network interface 112 may be a hard-wired interface. However, in various exemplary embodiments, the network interface 112 can include any device suitable to transmit information to and from another device, such as a universal asynchronous receiver/transmitter (UART), a parallel digital interface, a software interface or any combination of known or later developed software and hardware. The network interface may be linked to various types of networks, including a local area network (LAN), a wide area network (WAN), an intranet, a virtual private network (VPN), and the Internet.
The external storage 114 may be implemented using a database management system (DBMS) managed by the processor 102 and residing on a memory such as a hard disk. However, it should be appreciated that the external storage 114 may be implemented on one or more additional computer systems. For example, the external storage 114 may include a data warehouse system residing on a separate computer system.
Those skilled in the art will appreciate that other alternative computing environments may be used without departing from the spirit and scope of the present invention.
Referring to
Preferably, the structured CPR 214 is populated with patient information using data mining techniques described in “Patient Data Mining,” by Rao et al., copending U.S. Published Patent Application No. 2003/0126101, filed herewith, which is incorporated by reference herein in its entirety.
That disclosure teaches a data mining framework for mining high-quality structured clinical information. The data mining framework includes a data miner that mines medical information from a computerized patient record based on domain-specific knowledge contained in a knowledge base. The data miner includes components for extracting information from the computerized patient record, combining all available evidence in a principled fashion over time, and drawing inferences from this combination process. The mined medical information is stored in a structured computerized patient record.
The extraction component deals with gleaning small pieces of information from each data source regarding a patient, which are represented as probabilistic assertions about the patient at a particular time. These probabilistic assertions are called elements. The combination component combines all the elements that refer to the same variable at the same time period to form one unified probabilistic assertion regarding that variable. These unified probabilistic assertions are called factoids. The inference component deals with the combination of these factoids, at the same point in time and/or at different points in time, to produce a coherent and concise picture of the progression of the patient's state over time. This progression of the patient's state is called a state sequence.
An individual model of the state of a patient may be built. The patient state is simply a collection of variables that one may care about relating to the patient. The information of interest may include a state sequence, i.e., the value of the patient state at different points in time during the patient's treatment.
Each of the above components uses detailed knowledge regarding the domain of interest, such as, for example, a disease of interest. This domain knowledge base can come in two forms. It can be encoded as an input to the system, or as programs that produce information that can be understood by the system. The part of the domain knowledge base that is input to the present form of the system may also be learned from data.
Domain-specific knowledge for mining the data sources may include institution-specific domain knowledge. For example, this may include information about the data available at a particular hospital, document structures at a hospital, policies of a hospital, guidelines of a hospital, and any variations of a hospital.
The domain-specific knowledge may also include disease-specific domain knowledge. For example, the disease-specific domain knowledge may include various factors that influence risk of a disease, disease progression information, complications information, outcomes and variables related to a disease, measurements related to a disease, and policies and guidelines established by medical bodies.
As mentioned, the extraction component takes information from the CPR to produce probabilistic assertions (elements) about the patient that are relevant to an instant in time or time period. This process is carried out with the guidance of the domain knowledge that is contained in the domain knowledge base. The domain knowledge required for extraction is generally specific to each source.
Extraction from a text source may be carried out by phrase spotting, which requires a list of rules that specify the phrases of interest and the inferences that can be drawn therefrom. For example, if there is a statement in a doctor's note with the words “There is evidence of metastatic cancer in the liver,” then, in order to infer from this sentence that the patient has cancer, a rule is needed that directs the system to look for the phrase “metastatic cancer,” and, if it is found, to assert that the patient has cancer with a high degree of confidence (which, in the present embodiment, translates to generate an element with name “Cancer”, value “True” and confidence 0.9).
The data sources include structured and unstructured information. Structured information may be converted into standardized units, where appropriate. Unstructured information may include ASCII text strings, image information in DICOM (Digital Imaging and Communication in Medicine) format, and text documents partitioned based on domain knowledge. Information that is likely to be incorrect or missing may be noted, so that action may be taken. For example, the mined information may include corrected information, including corrected ICD-9 diagnosis codes.
Extraction from a database source may be carried out by querying a table in the source, in which case, the domain knowledge needs to encode what information is present in which fields in the database. On the other hand, the extraction process may involve computing a complicated function of the information contained in the database, in which case, the domain knowledge may be provided in the form of a program that performs this computation whose output may be fed to the rest of the system.
Extraction from images, waveforms, etc., may be carried out by image processing or feature extraction programs that are provided to the system.
Combination includes the process of producing a unified view of each variable at a given point in time from potentially conflicting assertions from the same/different sources. In various embodiments of the present invention, this is performed using domain knowledge regarding the statistics of the variables represented by the elements (“prior probabilities”).
Inference is the process of taking all the factoids that are available about a patient and producing a composite view of the patient's progress through disease states, treatment protocols, laboratory tests, etc. Essentially, a patient's current state can be influenced by a previous state and any new composite observations.
The domain knowledge required for this process may be a statistical model that describes the general pattern of the evolution of the disease of interest across the entire patient population and the relationships between the patient's disease and the variables that may be observed (lab test results, doctor's notes, etc.). A summary of the patient may be produced that is believed to be the most consistent with the information contained in the factoids, and the domain knowledge.
For instance, if observations seem to state that a cancer patient is receiving chemotherapy while he or she does not have cancerous growth, whereas the domain knowledge states that chemotherapy is given only when the patient has cancer, then the system may decide either: (1) the patient does not have cancer and is not receiving chemotherapy (that is, the observation is probably incorrect), or (2) the patient has cancer and is receiving chemotherapy (the initial inference—that the patient does not have cancer—is incorrect); depending on which of these propositions is more likely given all the other information. Actually, both (1) and (2) may be concluded, but with different probabilities.
As another example, consider the situation where a statement such as “The patient has metastatic cancer” is found in a doctor's note, and it is concluded from that statement that <cancer=True (probability=0.9)>. (Note that this is equivalent to asserting that <cancer=True (probability=0.9), cancer=unknown (probability=0.1)>).
Now, further assume that there is a base probability of cancer <cancer=True (probability=0.35), cancer=False (probability=0.65)> (e.g., 35% of patients have cancer). Then, we could combine this assertion with the base probability of cancer to obtain, for example, the assertion <cancer=True (probability=0.93), cancer=False (probability=0.07)>.
Similarly, assume conflicting evidence indicated the following:
1. <cancer=True (probability=0.9), cancer=unknown probability=0.1)>
2. <cancer=False (probability=0.7), cancer=unknown (probability=0.3)>
3. <cancer=True (probability=0.1), cancer unknown (probability=0.9)> and
4. <cancer=False (probability=0.4), cancer unknown (probability=0.6)>.
In this case, we might combine these elements with the base probability of cancer <cancer=True (probability=0.35), cancer=False (probability=0.65)> to conclude, for example, that <cancer=True (prob=0.67), cancer=False (prob=0.33)>.
Referring again to
The query is then executed to obtain performance measurement information. At least some of the obtained performance measurement information may be derived from unstructured data sources, such as, for example, free text, medical images and waveforms.
An exemplary query is shown in
It should be appreciated that the query shown in
As mentioned previously, the performance measurement information can be sent to a health care accreditation organization such as JCAHO. The obtained performance measurement information may be sampled or obtained for an entire patient population.
Performance measurement information may be generated by a health care provider, third party service provider, or an accreditation organization. The performance measurement information may be made available using any suitable network.
In order to empower health care consumers, an evaluation score of a health care provider may be determined using the obtained performance measurement information. Consumers may view or download this evaluation information via the Internet, for example. Health care providers may be ranked according to the evaluation scores. Such rankings may be done for various performance measurement categories. For example, hospitals in a particular geographic area may be ranked according to quality of care in treating prostate cancer. There may be another list that ranks hospitals nationwide for quality of care in treating infectious diseases, etc.
Referring to
In step 402, a query is formulated based on the selected performance measurement category. (This may involve formulating a query such as the one shown in
In step 402, the query is executed to obtain performance measurement information. At least some of the obtained performance measurement information may have been derived from unstructured information. Preferably, this information resides in a structured data repository that is populated using mined unstructured patient information, as described in “Patient Data Mining,” by Rao et al., copending U.S. Published Patent Application No. 2003/0126101.
In step 404, a compliance report is formatted. While this step involves creating a report, it should be appreciated that there are many other ways to output performance measurement information. For instance, the performance measurement information may be output to a magnetic or optical disc, electronically transmitted, or displayed upon a screen.
In step 405, a determination is made as to whether any more reports are to be generated. If there are, then control returns back to step 401; otherwise, control continues to step 406 where the operation stops.
As shown in
Although illustrative embodiments of the present invention have been described herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to those precise embodiments, and that various other changes and modifications may be affected therein by one skilled in the art without departing from the scope or spirit of the invention.
This application claims the benefit of U.S. Provisional Application Ser. No. 60/335,542, filed on Nov. 2, 2001, which is incorporated by reference herein in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
4946679 | Thys-Jacobs | Aug 1990 | A |
5307262 | Ertel | Apr 1994 | A |
5359509 | Little et al. | Oct 1994 | A |
5365425 | Torma et al. | Nov 1994 | A |
5508912 | Schneiderman | Apr 1996 | A |
5544044 | Leatherman | Aug 1996 | A |
5557514 | Seare et al. | Sep 1996 | A |
5619991 | Sloane | Apr 1997 | A |
5652842 | Siegrist et al. | Jul 1997 | A |
5657255 | Fink et al. | Aug 1997 | A |
5664109 | Johnson et al. | Sep 1997 | A |
5669877 | Blomquist | Sep 1997 | A |
5706441 | Lockwood | Jan 1998 | A |
5724379 | Perkins et al. | Mar 1998 | A |
5737539 | Edelson et al. | Apr 1998 | A |
5811437 | Singh et al. | Sep 1998 | A |
5832450 | Myers et al. | Nov 1998 | A |
5835897 | Dang | Nov 1998 | A |
5845253 | Rensimer et al. | Dec 1998 | A |
5899998 | McGauley et al. | May 1999 | A |
5924073 | Tyuluman et al. | Jul 1999 | A |
5935060 | Iliff | Aug 1999 | A |
5939528 | Clardy et al. | Aug 1999 | A |
6076088 | Paik et al. | Jun 2000 | A |
6078894 | Clawson et al. | Jun 2000 | A |
6081786 | Barry et al. | Jun 2000 | A |
6083693 | Nandabalan et al. | Jul 2000 | A |
6108635 | Herren et al. | Aug 2000 | A |
6128620 | Pissanos et al. | Oct 2000 | A |
6151581 | Kraftson et al. | Nov 2000 | A |
6196970 | Brown | Mar 2001 | B1 |
6253186 | Pendleton, Jr. | Jun 2001 | B1 |
6259890 | Driscoll et al. | Jul 2001 | B1 |
6266645 | Simpson | Jul 2001 | B1 |
6272472 | Danneels et al. | Aug 2001 | B1 |
6322502 | Schoenberg et al. | Nov 2001 | B1 |
6338042 | Paizis | Jan 2002 | B1 |
6381576 | Gilbert | Apr 2002 | B1 |
6468210 | Iliff | Oct 2002 | B1 |
6484144 | Martin et al. | Nov 2002 | B2 |
6529876 | Dart et al. | Mar 2003 | B1 |
6551243 | Bocionek et al. | Apr 2003 | B2 |
6551266 | Davis, Jr. | Apr 2003 | B1 |
6611846 | Stoodley | Aug 2003 | B1 |
6641532 | Iliff | Nov 2003 | B2 |
6645959 | Bakker-Arkema et al. | Nov 2003 | B1 |
6678669 | Lapointe et al. | Jan 2004 | B2 |
8754855 | Segal | Jun 2004 | |
6804656 | Rosenfeld et al. | Oct 2004 | B1 |
8802810 | Clamiello et al. | Oct 2004 | |
6826536 | Forman | Nov 2004 | B1 |
6839678 | Schmidt et al. | Jan 2005 | B1 |
6903194 | Sato et al. | Jun 2005 | B1 |
6915254 | Heinze et al. | Jul 2005 | B1 |
6915266 | Saeed et al. | Jul 2005 | B1 |
6941271 | Soong | Sep 2005 | B1 |
6988075 | Hacker | Jan 2006 | B1 |
7058658 | Mentzer | Jun 2006 | B2 |
7130457 | Kaufman et al. | Oct 2006 | B2 |
7307543 | Rosenfeld et al. | Dec 2007 | B2 |
20010011243 | Dembo et al. | Aug 2001 | A1 |
20010032195 | Graichen et al. | Oct 2001 | A1 |
20010041991 | Segal et al. | Nov 2001 | A1 |
20010051882 | Murphy et al. | Dec 2001 | A1 |
20020002474 | Michelson et al. | Jan 2002 | A1 |
20020010597 | Mayer et al. | Jan 2002 | A1 |
20020026332 | Snowden et al. | Feb 2002 | A1 |
20020032581 | Reitberg | Mar 2002 | A1 |
20020035316 | Drazen | Mar 2002 | A1 |
20020077853 | Boru et al. | Jun 2002 | A1 |
20020082480 | Riff et al. | Jun 2002 | A1 |
20020087361 | Benigno et al. | Jul 2002 | A1 |
20020099570 | Knight | Jul 2002 | A1 |
20020123905 | Goodroe et al. | Sep 2002 | A1 |
20020138492 | Kil | Sep 2002 | A1 |
20020138524 | Ingle et al. | Sep 2002 | A1 |
20020143577 | Shiffman et al. | Oct 2002 | A1 |
20020165736 | Tolle et al. | Nov 2002 | A1 |
20020173990 | Marasco | Nov 2002 | A1 |
20020177759 | Schoenberg et al. | Nov 2002 | A1 |
20030028401 | Kaufman et al. | Feb 2003 | A1 |
20030046114 | Davies et al. | Mar 2003 | A1 |
20030050794 | Keck | Mar 2003 | A1 |
20030108938 | Pickar et al. | Jun 2003 | A1 |
20030120133 | Rao et al. | Jun 2003 | A1 |
20030120134 | Rao et al. | Jun 2003 | A1 |
20030120458 | Rao et al. | Jun 2003 | A1 |
20030120514 | Rao et al. | Jun 2003 | A1 |
20030125985 | Rao et al. | Jul 2003 | A1 |
20030125988 | Rao et al. | Jul 2003 | A1 |
20030126101 | Rao et al. | Jul 2003 | A1 |
20030130871 | Rao et al. | Jul 2003 | A1 |
20030208382 | Westfall | Nov 2003 | A1 |
20040078216 | Toto | Apr 2004 | A1 |
20050187794 | Kimak | Aug 2005 | A1 |
20060064415 | Guyon et al. | Mar 2006 | A1 |
Number | Date | Country |
---|---|---|
0 917 078 | Oct 1997 | EP |
11328073 | Nov 1999 | JP |
2001297157 | Oct 2001 | JP |
9839720 | Sep 1998 | WO |
0182173 | Nov 2001 | WO |
Number | Date | Country | |
---|---|---|---|
20030125984 A1 | Jul 2003 | US |
Number | Date | Country | |
---|---|---|---|
60335542 | Nov 2001 | US |