To diagnose a patient, traditionally a physician examines the patient and then uses his or her own expert professional knowledge and judgment to produce and document a diagnosis for the patient. More recently, various Computer Aided Diagnostic (CAD) systems have been created to automate the generation of a clinical diagnosis based on current and past clinical and social information relating to the patient. In particular, deep learning systems are increasingly used to automatically analyze a radiology image (e.g., an ultrasound, CT, or MRI image) and to automatically detect abnormalities in the image, and even to derive a full clinical diagnosis from the image. Existing CAD systems can perform at or above the level of humans in certain narrow areas of use, such as reading sensitivity (recall) and specificity (precision). In most other ways, however, existing CAD systems do not perform as well as human experts. Furthermore, providing a system that meets or exceeds human accuracy in all settings is a difficult and long process.
A computer system automatically generates a first diagnosis of a patient based on input such as one or more medical images. The computer system receives input representing diagnostic intent from a human user. The system determines whether to provide the first diagnosis to the human user based on the first diagnosis and the diagnostic intent, such as by determining whether the first diagnosis and the diagnostic intent agree with each other, and only providing the first diagnosis to the human user if the first diagnosis disagrees with the diagnostic intent of the human user.
Other features and advantages of various aspects and embodiments of the present invention will become apparent from the following description and from the claims.
There is emerging evidence that even if a Computer Aided Diagnostic (CAD) system does not perform as well as a human in terms of accuracy, a combination of the CAD system and a human expert may exceed the individual accuracy of either. This general effect is known in the machine learning community as “boosting,” which refers to a technique in which an ensemble of multiple weak classifiers is combined to make a strong classifier. In the case of clinical diagnosis, however, the physician is responsible for the final diagnosis and cannot, therefore, be treated merely as a “weak classifier” that can be overridden by other classifiers (such as the CAD system). Instead, all input from other weaker classifiers, such as CAD systems, must be evaluated and consolidated by the physician into a final diagnosis, which can require a significant amount of valuable physician time. As a result of this aspect of clinical diagnosis, simple existing boosting techniques cannot be applied to the problem of clinical diagnosis. One problem posed by the prior art, therefore, is how to leverage the benefits of CAD systems even when they are not perfectly accurate, and in light of the requirement that the physician approve of the final diagnosis, while minimizing the amount of physician time and effort required to produce the final diagnosis. From a technical perspective, one problem posed by the prior art is how to develop a computer-implemented system that can use a CAD system to automatically generate a diagnosis of a patient, and which can automatically tailor the output of the CAD system to based on an automated comparison between a diagnosis generated automatically by the CAD system and an indication of diagnostic intent received from the patient's physician.
In a simplified example, assume that in a certain use case the physician's accuracy is 95% and that the CAD system's accuracy is 93%. (Although the description herein refers solely to accuracy for ease of explanation, in practice such accuracy may be divided into and measured by reference to both specificity and sensitivity.) If the physician's and CAD system's errors are perfectly correlated (i.e., if the physician's errors are a subset of the CAD system's errors), then the CAD system would have no value in improving the diagnostic accuracy of the physician. In practice, however, it is not usually the case that the physician's and CAD system's errors are perfectly correlated.
Therefore, solely for purposes of example and without limitation, assume instead that:
As described above, existing CAD systems typically always display the CAD-generated diagnosis to the physician for review. Naively displaying all of the CAD-generated diagnoses to the physician in the scenario described above would result in the following:
As the above example illustrates, one problem with existing systems is that, by providing the CAD system's output to the physician in all cases, they make highly inefficient use of the physician's time by providing the physician with the CAD system's output to review even in cases in which such review at best provides no benefit, and at worst is affirmatively disadvantageous. In the particular example above, in which only 2% of cases involve CAD output which is useful for the physician to review, the amount of physician time wasted is very significant.
As the above example further illustrates, another problem with existing systems is that, by providing the CAD system's output to the physician in all cases, they involve the CAD system generating and providing output in cases in which doing so is not likely to result in any improvement in accuracy or efficiency of the CAD system. As a result, such CAD systems perform generate and provide output which not only is unlikely to improvement the accuracy or efficiency of the CAD system, but which may actually result in a decrease in accuracy of the CAD system (if, for example, the physician modifies the CAD system's initial output to make it less accurate) and a decrease in efficiency of the CAD system (by, for example, causing the CAD system to generate and provide output which does not result in any increase in accuracy of the results produced by the CAD system).
Embodiments of the present invention address these and other problems of prior art systems by automatically and selectively providing CAD system output to a physician only in cases in which embodiments of the present invention determine that it would be valuable to provide such output to the physician. In other cases, embodiments of the present invention suppress or otherwise do not provide such output to the physician. As a result, the efficiency of the overall system (including both the CAD system and the physician) at generating accurate diagnoses is improved.
More specifically, the accuracy and efficiency of the CAD system itself is improved by embodiments of the present invention, relative to the prior art. For example, embodiments of the present invention solve the above-mentioned problem of sub-optimal efficiency of the CAD system, by not requiring the CAD system to generate and provide output to the physician in cases in which doing so is not likely to improve accuracy. Such embodiments of the present invention increase the efficiency of the CAD system itself by reducing the number of computations that the CAD system performs, namely by eliminating (relative to the prior art) computations involved in generating and providing output to the physician. Such embodiments of the present invention, therefore, reduce the number of computations required to be performed by the computer processor in each case, thereby resulting in a more efficient use of that processor.
Various Computer Aided Physician Documentation (CAPD) systems exist for understanding the context of a medical study and the content of a partially written report (also referred to herein as a “clinical note” or “note”) as it is being authored, and to annotate such a report (e.g., with measurements, findings, and diagnoses). Embodiments of the present invention may include a CAPD system which has been modified to evaluate the output of a CAD system in the context of the current note as it is being authored by the physician. For example, and as described in more detail below, embodiments of the present invention may determine whether the findings of the physician agree with or contradict the findings represented by the CAD output, and then determine whether to provide the CAD output to the physician based on the determination.
Referring to
In the particular embodiment illustrated in
Although not shown in
The audio capture component 106 may be or include any of a variety of well-known audio capture components, such as microphones, which may be standalone or integrated within or otherwise connected to another device (such as a smartphone, tablet computer, laptop computer, or desktop computer).
In the particular embodiment illustrated in
The ASR/NLU component may be implemented in any of a variety of ways, such as in one or more software programs installed and executing on one or more computers. Although the ASR/NLU component 110 is shown as a single component in
Although in the particular embodiment illustrated in
The structured note 112, whether created based on speech input 104, non-speech input, or a combination thereof, may take any of a variety of forms, such as any one or more of the following, in any combination: a text document (e.g., word processing document), a structured document (e.g., an XML document), and a database record (e.g., a record in an Electronic Medical Record (EMR) system). Although the structured note 112 is shown as a single element in
The system 100 also includes an Computer Aided Diagnostic (CAD) component 132, which receives CAD input 130 as input and processes the CAD input 130 to produce CAD output 134 (
The system 100 may also include a Computer Aided Physician Documentation (CAPD) component 118, which may include any of a variety of existing CAPD technologies, as well as being capable of performing the functions now described. The healthcare provider 102 may provide a diagnostic intent input 124 to the CAPD component 118, which may receive the diagnostic intent input 124 as input (
The diagnostic intent input 124 may include any of a variety of data representing a diagnostic intent of the healthcare provider in connection with the patient. Such input 124 may, but need not, represent a diagnosis of the patient by the healthcare provider 102. The input 124 may, for example, include data representing a diagnostic intent of the healthcare provider in connection with the patient but which does not represent a diagnosis of the patient by the healthcare provider. As an example of the latter, the diagnostic intent input 124 may include a description of the healthcare provider 102's observations (findings) of the patient and also include a description of the healthcare provider 102's impressions of their observations. Such findings and impressions may indicate a diagnostic intent of the healthcare provider but not represent a diagnosis.
Although the healthcare provider 102 may generate and provide the diagnostic intent input 124 after the structured note 112 has been generated in its entirety, this is not a limitation of the present invention. The healthcare provider 102 may, for example, generate some or all of the diagnostic intent input 124 while the structured note 112 is being generated and before the entire structured note 112 has been generated, e.g., while any of one or more of the following is occurring:
For example, the healthcare provider 118 may provide the diagnostic intent input 124 after inputting one section of the structured note 108 (e.g., the Findings section) and before inputting another section of the structured note 108 (e.g., the Impressions section). As a result, the CAPD component 118 may receive the provider diagnosis 124 while the structured note 108 is being generated (i.e., after some, but not all, of the structured note 108 has been generated).
The CAPD component 118 may also receive the CAD output 134 (which may include data representing a diagnosis generated by the CAD component 132) as input (
The CAPD component 118 may, after receiving both the provider diagnostic intent input 124 and the CAD output 134, determine whether to provide output 120 representing some or all of the CAD output 134 to the healthcare provider 102, based on any one or more of the following, in any combination (
If the CAPD component 118 determines that the CAD output 134 should be provided to the provider 102, then the CAPD component 118 provides the output 120 (representing some or all of the CAPD output 134) to the provider 102 (
The CAPD component 118 may use any of a variety of techniques to determine whether to provide the CAD output 134 to the healthcare provider 102. In general, if the CAD output 134 agrees with the diagnostic intent input 124, then the system 100 provides the CAD output 134 to the healthcare provider 102; otherwise the system 100 does not provide the CAD output 134 to the healthcare provider 102. A refinement of this general approach is that the system 100 may not provide the CAD output 134 to the healthcare provider 102 in response to determining that the diagnostic intent input 124 includes a finding that the CAD output 134 does not include, but otherwise provide the CAD output 134 to the healthcare provider 102.
More specifically, for example, the CAPD component 118 may determine whether the provider diagnostic intent 124 is the same as or otherwise consistent with (e.g., contains findings that are consistent with) the diagnosis represented by the CAD output 134. Then the CAPD component 118 may act as follows:
Embodiments of the present invention have a variety of advantages. For example, the system 100 and method 200 reduce the amount of time required by the healthcare provider 102 to review CAD-generated diagnoses, by only providing those diagnoses as output to the healthcare provider 102 in cases in which providing such diagnoses is likely to improve the accuracy of the healthcare provider 102's diagnosis. In typical uses cases this can reduce the amount of unnecessary effort required by the healthcare provider 102 by a significant amount, without any reduction in diagnosis accuracy, and possibly with an increase in diagnosis accuracy as a result of increasing the healthcare provider 102's confidence in the system 100 and reducing the healthcare provider 102's workload, thereby enabling the healthcare provider 102 to focus more carefully on reviewing the relatively small number of CAD diagnoses that are likely to be helpful in improving the accuracy of the healthcare provider 102's diagnosis.
It is to be understood that although the invention has been described above in terms of particular embodiments, the foregoing embodiments are provided as illustrative only, and do not limit or define the scope of the invention. Various other embodiments, including but not limited to the following, are also within the scope of the claims. For example, elements and components described herein may be further divided into additional components or joined together to form fewer components for performing the same functions.
Any of the functions disclosed herein may be implemented using means for performing those functions. Such means include, but are not limited to, any of the components disclosed herein, such as the computer-related components described below.
The techniques described above may be implemented, for example, in hardware, one or more computer programs tangibly stored on one or more computer-readable media, firmware, or any combination thereof. The techniques described above may be implemented in one or more computer programs executing on (or executable by) a programmable computer including any combination of any number of the following: a processor, a storage medium readable and/or writable by the processor (including, for example, volatile and non-volatile memory and/or storage elements), an input device, and an output device. Program code may be applied to input entered using the input device to perform the functions described and to generate output using the output device.
Embodiments of the present invention include features which are only possible and/or feasible to implement with the use of one or more computers, computer processors, and/or other elements of a computer system. Such features are either impossible or impractical to implement mentally and/or manually. For example, embodiments of the present invention use computerized automatic speech recognition, natural language understanding, computer-aided diagnostic, and computer-aided physician documentation components to automatically recognize and understand speech, to automatically generate diagnoses, and to automatically understand the context of a clinical note. Such components are inherently computer-implemented and provide a technical solution to the technical problem of automatically generating documents based on speech.
Any claims herein which affirmatively require a computer, a processor, a memory, or similar computer-related elements, are intended to require such elements, and should not be interpreted as if such elements are not present in or required by such claims. Such claims are not intended, and should not be interpreted, to cover methods and/or systems which lack the recited computer-related elements. For example, any method claim herein which recites that the claimed method is performed by a computer, a processor, a memory, and/or similar computer-related element, is intended to, and should only be interpreted to, encompass methods which are performed by the recited computer-related element(s). Such a method claim should not be interpreted, for example, to encompass a method that is performed mentally or by hand (e.g., using pencil and paper). Similarly, any product claim herein which recites that the claimed product includes a computer, a processor, a memory, and/or similar computer-related element, is intended to, and should only be interpreted to, encompass products which include the recited computer-related element(s). Such a product claim should not be interpreted, for example, to encompass a product that does not include the recited computer-related element(s).
Each computer program within the scope of the claims below may be implemented in any programming language, such as assembly language, machine language, a high-level procedural programming language, or an object-oriented programming language. The programming language may, for example, be a compiled or interpreted programming language.
Each such computer program may be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a computer processor. Method steps of the invention may be performed by one or more computer processors executing a program tangibly embodied on a computer-readable medium to perform functions of the invention by operating on input and generating output. Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, the processor receives (reads) instructions and data from a memory (such as a read-only memory and/or a random access memory) and writes (stores) instructions and data to the memory. Storage devices suitable for tangibly embodying computer program instructions and data include, for example, all forms of non-volatile memory, such as semiconductor memory devices, including EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROMs. Any of the foregoing may be supplemented by, or incorporated in, specially-designed ASICs (application-specific integrated circuits) or FPGAs (Field-Programmable Gate Arrays). A computer can generally also receive (read) programs and data from, and write (store) programs and data to, a non-transitory computer-readable storage medium such as an internal disk (not shown) or a removable disk. These elements will also be found in a conventional desktop or workstation computer as well as other computers suitable for executing computer programs implementing the methods described herein, which may be used in conjunction with any digital print engine or marking engine, display monitor, or other raster output device capable of producing color or gray scale pixels on paper, film, display screen, or other output medium.
Any data disclosed herein may be implemented, for example, in one or more data structures tangibly stored on a non-transitory computer-readable medium. Embodiments of the invention may store such data in such data structure(s) and read such data from such data structure(s).
Number | Date | Country | |
---|---|---|---|
62637463 | Mar 2018 | US |