1. Field of the Invention
This invention relates to health related data analysis and more particularly relates to a system and method for an integrated analysis of condition, cost, and duration.
2. Description of the Related Art
Most corporations, including health insurance corporations, maintain a high volume of data. Such data may be analyzed and exploited for valuable information regarding business trends, and other important statistics. Data mining is a common strategy for identifying and analyzing such data.
There are many various forms of data mining. Custom analytic operations may be developed to meet specific needs. Alternatively, commercially available statistical analysis tools such as Statistical Analysis Software (SAS) may be used to identify statistical trends in data.
Health insurance companies typically maintain databases of health insurance claim information, demographic information, and other data about health insurance plan members. Such information may be used to gain valuable insights into disease causes, progressions, and potential cures. Unfortunately, typical methods for analyzing such data are often cumbersome, costly, and require unworkably high processing times and resources.
For example, currently the process to determine cost and duration for a specific condition such as a disability, is a manual process performed by multiple vendors and is based on information gathered from a few physician offices for a couple of patients with similar disabilities. This process takes at least six weeks but can take up to three months, and costs disability carriers approximately $6K per member per year. One disability carrier, like Travelers, spends ˜$40M per year to determine cost and durations for specific individuals with a long-term disability.
The referenced shortcomings are not intended to be exhaustive, but rather are among many that tend to impair the effectiveness of previously known techniques disease management; however, those mentioned here are sufficient to demonstrate that the methodologies appearing in the art have not been satisfactory and that a significant need exists for the techniques described and claimed in this disclosure.
From the foregoing discussion, it should be apparent that a need exists for a program product, system, and method for analysis of condition, cost, and duration.
A system is presented for analysis of condition, cost, and duration. In one embodiment, the system includes a data storage device configured to store a database comprising one or more records, each record having one or more attributes. The system may also include a server in data communication with the data storage device. The server may obtain one or more attributes associated with a subject having a condition, search a database stored on a data storage device for a group of records associated with the attributes, wherein the records comprise cost and duration information associated with the condition, estimate cost and duration in response to the cost and duration information, and generate an output in response to the cost estimate and duration estimate.
In certain embodiments, the condition of the present invention may be a specific condition, or a combination of multiple specific conditions. Non-limiting examples of a specific condition include a short term disability, a long term disability, and a permanent disability.
In one embodiment, the server may filter the records according to a limiting criterion. In a further embodiment, the server may extract the cost and duration information of a predetermined time period. The server may also aggregate records based on the cost and duration information. In still another embodiment, the server may calculate a benchmark for cost and duration of the condition or its associated service. In a still further embodiment, the server may determine ranges and trends for cost and duration of the condition or its associated service.
A method is also presented for analysis of condition, cost, and duration. The method in the disclosed embodiments substantially includes the steps necessary to carry out the functions presented above with respect to the operation of the described system. In one embodiment, the method includes obtaining one or more attributes associated with a subject having a condition, searching a database stored on a data storage device for a group of records associated with the attributes, wherein the records comprise cost and duration information associated with the condition, estimating cost and duration in response to the cost and duration information, and generating an output in response to the cost estimate and duration estimate.
In a further embodiment, the method may include filtering the records according to a limiting criterion. The method may also include extracting cost and duration information of a predetermined time period. Additionally, the method may include aggregating the records based on the cost and duration information.
In a still further embodiment, the method may include calculating a benchmark for cost and duration of the condition or its associated service. In a certain embodiment, the method may also include determining ranges and trends for cost and duration of the condition or its associated service.
A tangible computer program product comprising a computer readable medium having computer usable program code executable to perform operations is also presented. In one embodiment, the operations may comprise obtaining one or more attributes associated with a subject having a condition, searching a database stored on a data storage device for a group of records associated with the attributes, wherein the records comprise cost and duration information associated with the condition, estimating cost and duration in response to the cost and duration information, and generating an output in response to the cost estimate and duration estimate.
In a certain embodiment, the operations may include filtering the records according to a limiting criterion. The operations may also include extracting cost and duration information of a predetermined time period. Additionally, the operations may include aggregating the records based on the cost and duration information. In a further embodiment, the operations may include calculating a benchmark for cost and duration of the condition or its associated service. In addition, the operations may include determining ranges and trends for cost and duration of the condition or its associated service.
The term “coupled” is defined as connected, although not necessarily directly, and not necessarily mechanically.
The terms “a” and “an” are defined as one or more unless this disclosure explicitly requires otherwise.
The term “substantially” and its variations are defined as being largely but not necessarily wholly what is specified as understood by one of ordinary skill in the art, and in one non-limiting embodiment “substantially” refers to ranges within 10%, preferably within 5%, more preferably within 1%, and most preferably within 0.5% of what is specified.
The terms “comprise” (and any form of comprise, such as “comprises” and “comprising”), “have” (and any form of have, such as “has” and “having”), “include” (and any form of include, such as “includes” and “including”) and “contain” (and any form of contain, such as “contains” and “containing”) are open-ended linking verbs. As a result, a method or device that “comprises,” “has,” “includes” or “contains” one or more steps or elements possesses those one or more steps or elements, but is not limited to possessing only those one or more elements. Likewise, a step of a method or an element of a device that “comprises,” “has,” “includes” or “contains” one or more features possesses those one or more features, but is not limited to possessing only those one or more features. Furthermore, a device or structure that is configured in a certain way is configured in at least that way, but may also be configured in ways that are not listed.
Other features and associated advantages could become apparent with reference to the following detailed description of specific embodiments in connection with the accompanying drawings.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
Various features and advantageous details are explained more fully with reference to the non-limiting embodiments that are illustrated in the accompanying drawings and detailed in the following description. Descriptions of well known starting materials, processing techniques, components, and equipment are omitted so as not to unnecessarily obscure the invention in detail. It should be understood, however, that the detailed description and the specific examples, while indicating embodiments of the invention, are given by way of illustration only, and not by way of limitation. Various substitutions, modifications, additions, and/or rearrangements within the spirit and/or scope of the underlying inventive concept could become apparent to those skilled in the art from this disclosure.
Certain units described in this specification have been labeled as modules, in order to more particularly emphasize their implementation independence. A module is “[a] self-contained hardware or software component that interacts with a larger system.” Alan Freedman, “The Computer Glossary” 268 (8th ed. 1998). A module comprises a component of a machine, a machine or a plurality of machines that are suitably programmed to operate according to executable instructions. For example, a module may be implemented as a hardware circuit comprising custom VLSI circuits or gate arrays, off-the-shelf semiconductors such as logic chips, transistors, or other discrete components. A module may also be implemented in programmable hardware devices such as field programmable gate arrays, programmable array logic, programmable logic devices, a controller, or the like.
Modules may also include software-defined units or instructions that, when executed by a processing machine or device, retrieve and transform data stored on a data storage device from a first state to a second state. An identified module of executable code may, for instance, comprise one or more physical blocks of computer instructions which may be organized as an object, procedure, or function. Nevertheless, the executables of an identified module need not be physically located together, but may comprise disparate instructions stored in different locations which, when joined logically together, comprise the module, and when executed by the processor, achieve the stated data transformation.
Indeed, a module of executable code maybe a single instruction, or many instructions, and may even be distributed over several different code segments, among different programs, and across several memory devices. Similarly, operational data may be identified and illustrated herein within modules, and may be embodied in any suitable form and organized within any suitable type of data structure. The operational data may be collected as a single data set, or may be distributed over different locations including over different storage devices.
In the following description, numerous specific details are provided, such as examples of programming, software modules, user selections, network transactions, database queries, database structures, hardware modules, hardware circuits, hardware chips, etc., to provide a thorough understanding of the present embodiments. One skilled in the relevant art could recognize, however, that the invention may be practiced without one or more of the specific details, or with other methods, components, materials, and so forth. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the invention.
Specifically, the system 100 may aggregate records based on the cost and duration information. In still another embodiment, the server 102 may generate an output in response to the cost and duration information. For example, the server 102 may calculate a benchmark or determine ranges and trends for cost and duration of the condition or its associated service.
In one embodiment, the user interface device 110 is referred to broadly and is intended to encompass a suitable processor-based device such as a desktop computer, a laptop computer, a Personal Digital Assistant (PDA), a mobile communication device or organizer device having access to the network 108. In a further embodiment, the user interface device 110 may access the Internet to access a web application or web service hosted by the server 102 and provide a user interface for enabling the service consumer (user) to enter or receive information. For example, the user may enter one or more attributes, such as a predetermined time period, limiting criteria, a selected attribute for aggregating or reporting a statistic, or the like.
The network 108 may facilitate communications of data between the server 102 and the user interface device 110. The network 108 may include any type of communications network including, but not limited to, a direct PC to PC connection, a local area network (LAN), a wide area network (WAN), a modem to modem connection, the Internet, a combination of the above, or any other communications network now known or later developed within the networking arts which permits two or more computers to communicate, one with another.
In one embodiment, the server 102 may receive an attribute, search the database for a group of records associated with the attribute, wherein the records comprise cost and duration information associated with the condition, estimate cost and duration in response to the cost and duration information, and generate an output in response to the cost and duration estimate. Additionally, the server 102 may access data stored in the data storage device 104 via a Storage Area Network (SAN) connection, a LAN, a data bus, or the like.
The data storage device 104 may include a hard disk, including hard disks arranged in an Redundant Array of Independent Disks (RAID) array, a tape storage drive comprising a magnetic tape data storage device, an optical storage device, or the like. In one embodiment, the data storage device 104 may store health related data, such as insurance claims data, consumer data, or the like. The data may be arranged in a database and accessible through Structured Query Language (SQL) queries, or other data base query languages or operations.
In one embodiment, the server 102 may submit a query to selected data storage devices 204-208 to collect a consolidated set of data elements associated with an individual or a group of individuals. The server 102 may store the consolidated data set in a consolidated data storage device 210. In such an embodiment, the server 102 may refer back to the consolidated data storage device 210 to obtain a set of data elements associated with a specified individual. Alternatively, the server 102 may query each of the data storage devices 204-208 independently or in a distributed query to obtain the set of data elements associated with a specified individual. In another alternative embodiment, multiple databases may be stored on a single consolidated data storage device 210.
In various embodiments, the server 102 may communicate with the data storage devices 204-210 over the data-bus 202. The data-bus 202 may comprise a SAN, a LAN, or the like. The communication infrastructure may include Ethernet, Fibre-Chanel Arbitrated Loop (FC-AL), Small Computer System Interface (SCSI), and/or other similar data communication schemes associated with data storage and communication. For example, the server 102 may communicate indirectly with the data storage devices 204-210, the server first communicating with a storage server or storage controller 106.
In one example of the system 200, the first data storage device 204 may store data associated with insurance claims made by one or more individuals. The insurance claims data may include cost and duration data associated with medical services, procedures, and prescriptions utilized by the individual. In one particular embodiment, the first data storage device 202 may include insurance claims data for over 56 million customers of a health insurance company. The database may include claims data spanning over 14 years. Of those 56 million members, 26 million had a five year history or more. In one embodiment, the second data storage device 206 may store summary data associated with the individual. The summary data may include one or more diagnoses of conditions from which the individual suffers and/or actuarial data associated with an estimated cost in medical services that the individual is likely to incur. The third data storage device 208 may store lab test data associated with the individual. For example, the third data storage device 208 may include data associated with the individual's lab test results and/or clinical observations. A fourth data storage device (not shown) may store demographic data. For example, the demographic data may include information relating to the individual's demographics include gender, race or ethnicity, age, income, disabilities, mobility, educational attainment, home ownership, employment status, location, or the like.
The server 102 may host a software application configured for analysis of condition, cost, and duration. The software application may further include modules or functions for interfacing with the data storage devices 204-210, interfacing with a network 108, interfacing with a user, and the like. In a further embodiment, the server 102 may host an engine, application plug-in, or application programming interface (API). In another embodiment, the server 102 may host a web service or web accessible software application.
The computer system 300 also may include Random Access Memory (RAM) 308, which may be SRAM, DRAM, SDRAM, or the like. The computer system 300 may utilize RAM 308 to store the various data structures used by a software application configured for analysis of condition, cost, and duration. The computer system 300 may also include Read Only Memory (ROM) 306 which may be PROM, EPROM, EEPROM, or the like. The ROM may store configuration information for booting the computer system 300. The RAM 308 and the ROM 306 may hold user and system 100 data.
The computer system 300 may also include an input/output (I/O) adapter 310, a communications adapter 314, a user interface adapter 316, and a display adapter 322. The I/O adapter 310 and/or user the interface adapter 316 may, in certain embodiments, enable a user to interact with the computer system 300 in order to input information for authenticating a user, identifying an individual, or receiving health profile information. In a further embodiment, the display adapter 322 may display a graphical user interface associated with a software or web-based application for analysis of condition, cost, and duration.
The I/O adapter 310 may connect one or more storage devices 312, such as one or more of a hard drive, a Compact Disk (CD) drive, a floppy disk drive, a tape drive, to the computer system 300. The communications adapter 314 may be adapted to couple the computer system 300 to the network 108, which may be one or more of a LAN and/or WAN, and/or the Internet. The user interface adapter 316 couples user input devices, such as a keyboard 320 and a pointing device 318, to the computer system 300. The display adapter 322 may be driven by the CPU 302 to control the display on the display device 324.
The present embodiments are not limited to the architecture of system 300. Rather the computer system 300 is provided as an example of one type of computing device that may be adapted to perform the functions of server 102 and/or the user interface device 110. For example, any suitable processor-based device may be utilized including without limitation, including personal data assistants (PDAs), computer game consoles, and multi-processor servers. Moreover, the present embodiments may be implemented on application specific integrated circuits (ASIC) or very large scale integrated (VLSI) circuits. In fact, persons of ordinary skill in the art may utilize any number of suitable structures capable of executing logical operations according to the described embodiments.
The network-based system 400 may include components or devices configured to operate in various network layers. For example, the server 102 may include modules configured to work within an application layer 404, a presentation layer 406, a data access layer 408 and a metadata layer 410. In a further embodiment, the server 102 may access one or more data sets 418-422 that comprises a data layer or data tier 430. For example, a first data set 418, a second data set 420 and a third data set 422 may comprise data tier 430 that is stored on one or more data storage devices 204-208.
One or more web applications 412 may operate in the application layer 404. For example, a user may interact with the web application 412 though one or more I/O interfaces 318 and 320 configured to interface with the web application 412 through an I/O adapter 310 that operates on the application layer. In one particular embodiment, a web application 412 may be provided for analysis of condition, cost, and duration that includes software modules configured to perform the steps of obtaining one or more attributes associated with a subject having a condition, searching a database stored on a data storage device for a group of records associated with the attributes, wherein the records comprise cost and duration information associated with the condition, estimating cost and duration in response to the cost and duration information, and generating an output in response to the cost estimate and duration estimate.
In a further embodiment, the server 102 may include components, devices, hardware modules, or software modules configured to operate in the presentation layer 406 to support one or more web services 414. For example, a web application 412 may access a web service 414 to perform one or more web-based functions for the web application 412. In one embodiment, a web application 412 may operate on a first server 102 and access one or more web services 414 hosted on a second server (not shown) during operation.
For example, a web application 412 for identifying cohorts and/or analyzing cohort cost and duration data, or other information may access a first web service 414 for identifying a first group of individuals associated with one or more attributes associated with an individual, and a second web service 414 for extracting cost and duration information from the records. The web services 414 may receive one or more attributes or generate a profile of the individual. In response, the web service 414 may return a list of records associated with the attributes or profile, statistics, graphs, or the like. One of ordinary skill in the art could recognize various web-based architectures employing web services 414 for modular operation of the web application 412.
In one embodiment, a web application 412 or a web service 414 may access one or more of the data sets 418-422 through the data access layer 408. In certain embodiments, the data access layer 408 may be divided into one or more independent data access layers (DAL) 416 for accessing individual data sets 418-422 in the data tier 430. These individual data access layers 416 may be referred to as data sockets or adapters. The data access layers 416 may utilize metadata from the metadata layer 410 to provide the web application 412 or the web service 414 with specific access to the data sets 418-422.
For example, the data access layer 416 may include operations for performing a query of the data sets 418-422 to retrieve specific information for the web application 412 or the web service 414. In a more specific example, the data access layer 416 may include a query for records associated with individuals who have been diagnosed with degeneration of intervertebral disc or that are associated with an ICD-9 code (e.g., ICD-9-M 722.4) associated with a diagnosis of degeneration of intervertebral disc and also match some additional predetermined attributes, such as age, gender, or residence location.
In one embodiment, the CPU 302 may include one or more software defined modules configured to obtain one or more attributes associated with a subject having a condition, search a database stored on a data storage device for a group of records associated with the attributes, wherein the records comprise cost and duration information associated with the condition, estimate cost and duration in response to the cost and duration information, and generate an output in response to the cost estimate and duration estimate. In one embodiment, these modules may include a search module 504, an estimate module 506, and a generate module 508.
In certain embodiments, the attribute may include a diagnosis code associated with a condition. The condition may be a specific condition or a combination of multiple conditions. For example, the condition may be a short term disability, a long term disability, a permanent disability, a disease, a co-morbidity condition, a health-related condition, a retirement, or a combination thereof. Such a disease may include any disease known to persons of ordinary skill in the art, such as a neoplasm.
The attribute may, in certain circumstances, include a plurality of attributes. This may be referred to as a “profile.” In certain embodiments, it may be helpful for a user of the present program product, system, and method to identify occurrences of a particular combination of conditions, diagnoses, events, characteristics, or the like. For example, the attributes contain a diagnosis of the condition and one or more demographic or geographic features. In such an example, an insurance company may desire to know the information and estimates of males in the age of 44 who have malignant neoplasm of colon (ICD-9-CM Diagnosis Code 153) and live in the New York state. Thus, the attributes in this example may include the diagnosis code, an age attribute having a value of ‘44;’ a gender attribute having a value of ‘male,’ a demographic residence of ‘New York state’ The attributes may also comprise a profile including a range of values or a broader category corresponding to certain attributes of an individual, for example, a profile of a person being within the age range of 40-49, male, having malignant neoplasm of large intestine, and residing in the Northeastern area of the United States. Other examples of attributes may include age, gender, area, minority status, income level, etc.
In a further embodiment, the attributes may include a temporal component. For example, the attribute may include a time period associated with the condition a long term disability such as, for example, malignant neoplasm of large intestine. In such an embodiment, the attribute would include the disability condition and a related time frame, such as a median life expectancy of the cancer. In such an example, the group of records may include all large intestine cancer patients with cost and duration information within the specified time frame of cancer onset or initial diagnosis (which could be either an ICD9 code or a lab reading or both).
Although the various functions of the server 102 and the CPU 302 are described in the context of modules, the methods, processes, and software described herein are not limited to a modular structure. Rather, some or all of the functions described in relation to the modules of
Generally, the interface module 502 may receive user inputs and display user outputs. For example, the interface module 502 may receive one or more attributes. The interface module 502 may further receive a predetermined time period, limiting criterion, and other user inputs. In a further embodiment, the interface module 504 may display cost and duration analysis results. Such analysis results may include statistics, tables, charts, graphs, recommendations, or the like.
Structurally, the interface module 502 may include one or more of an I/O adapter 310, a communications adapter 314, a user interface adapter 316, and/or a display adapter 322. The interface module 502 may further include I/O ports, pins, pads, wires, buses, and the like for facilitating communications between the processor 302 and the various adapters and interface components 310-324. The interface module may also include software defined components for interfacing with other software modules on the CPU 302.
In one embodiment, the CPU 302 may load and execute software modules configured to obtain one or more attributes associated with a subject having a condition, search a database stored on a data storage device for a group of records associated with the attributes, wherein the records comprise cost and duration information associated with the condition, estimate cost and duration in response to the cost and duration information, and generate an output in response to the cost estimate and duration estimate. These software modules may include a search module 504, an estimate module 506, and a generate module 508.
In a specific embodiment, the CPU 302 may load and execute computer software configured to generate, retrieve, send, or otherwise operate SQL instructions. For example, the search module 504 may communicate an SQL query to the data storage device 104 which is configured to search the database for a group of records associated with the attribute, wherein the attribute is associated with a subject having a condition, such as a disease or a disability. In addition, the records may contain cost and duration information associated with the condition. Such a group of records may be associated with individuals identical or similar to the subject in terms of the attributes used by the search module 504, but may be different in some other aspects.
In a certain embodiment, the records may be obtained from a database comprising insurance claim information history of enrollment in a healthcare claim, demographic data, insurance claims data, lab data, pharmacy data, race/ethnicity data, psychographic data, disability, absenteeism, worker compensation data for a large number of customers of a health insurance company or the like. In a particular embodiment, such a database may include historical data spanning ten or more years for several million customers. In such an embodiment, the breadth and depth of the database may provide detailed information regarding a large number of diseases, disabilities, conditions, and their associated stages, services including treatments, procedures, lab test, prescriptions, and outcomes.
In order to search the database, an example of the attribute may include a field value associated with the condition of the subject, such as a healthcare billing or medical code stored in a database of healthcare insurance information, in combination with a field value associated with the subject, for example, a field value that indicates certain specified characteristics of the subject, such as age, gender, minority status, a geographic feature, lab tests, lab results, comorbidity diagnosis, other diseases or diagnoses, use of medication, a genetic feature (presence or absence of a genetic marker such as single nucleotide polymorphism, a specific genotype, a specific gene or gene cluster, a characteristic of a specific chromosome), and the like, or a combination thereof.
In a specific embodiment, the analysis of cost and duration may provide benchmarks based on attributes including co-morbidity conditions shared among a group of individuals, which have not been currently accomplished to the inventors' knowledge.
In one embodiment, such a field value may be a medical code. Non-limiting examples of a medical code may include Episode Treatment Groups® (ETG®), International Classification of Diseases (ICD-9), Current Procedural Terminology (CPT), Ambulatory Surgical Center (ASC), Ambulatory Payment Classifications (APC), Current Dental Terminology (CDT), Code on Dental Procedures and Nomenclature (CDPN), Diagnosis Related Grouping (DRG), Health Care Financing Administration Common Procedure Coding System (HCPCS), National Drug Codes (NDC), Revenue Codes, and Per Diems.
In a certain embodiment, the ETG grouper technology could be used in combination with certain aspects of the invention to create the profile associated with the subject of interest and the subject's condition, for example, an injury profile of the subject. For example, the subject's claim may be processed through an ETG software or algorithm, and a specific ETG could be determined for this subject, which could be used for subsequence search for matched profiles or cohorts. In another aspect, the profile may be created by Natural History of Disease (NHD) technology and tool, by itself or in combination with the ETG or identification of any other attributes.
In one embodiment related to the search operation, the data in the database or datasets may be manipulated using Statistical Analysis Software (SAS) or the like, for example, be categorized. In an exemplary embodiment, the age range for categorization may be divided into: <18, 19-34, 35-49, 50-65, and 65+. For another example, the region of residence for data categorization may be determined by the census: pacific, middle west, northeast, and south, which can be sub-divided into states. For ETGs having low patient count, grouping of several ETGs may be needed for categorization of data.
In a further embodiment, the cost and duration analysis may be updated at a predetermined interval. The update may include updates of new age range and/or region mappings, ETG definitions, etc. The subsequent search and analysis could be updated accordingly.
In a specific embodiment, the search operation may identify a group of individuals having records that include a specified ICD-9 diagnosis code and a demographic feature corresponding to the subject's profile. For example, the search may identify a group of records in the database associated with individuals that have a profile including a specific diagnosis and certain age ranges, gender, residence, and/or comorbidity that may be associated with (e.g., identical or closely related to) the profile of the subject.
In a certain embodiment, the estimate module 506 may be configure to estimate cost and duration of the group of records in response to the cost and duration information contained in the records. For example, the search module may retrieve a group of records associated with a profile matching the profile of the subject having a condition, such as a disability or a disease. The estimate module 506 may then extract the cost and duration information from the identified records. In one embodiment, the cost and duration information may include cost and duration of one or more services associated with the condition. Examples of such services may include treatment, procedures, hospital admissions, rehabilitation, emergency room visit, prescriptions, and the like.
For example, the estimate module 506 may include analogue or digital logic, firmware, or software configured o carry out one or more calculations for estimation according to one or more predefined logic functions. In a further embodiment, the CPU 302 may include a software defined estimate module 506 configured to perform analysis and generation of statistic outputs in response to cost and duration estimates.
In a specific embodiment, the search module 504 may feed retrieved data into a spreadsheet configured to perform one or more estimations by the estimate module 506 on the data. For example, an Excel® spreadsheet may include one or more embedded functions or operations configured to estimate by calculating statistics such as averages, odds ratios, other probabilities, counts, summations, and the like. The data may be automatically imported into a spreadsheet using a macro, a software-based script, or the like. In an alternative embodiment, the estimate module 506 may include hard-coded or dynamically variable software functions for calculating such statistics and generating cost and duration estimates for a user.
In a further embodiment, the estimate module 506 may summarize the cost and duration information by calculating a statistic in response to the cost and duration information extracted. For example, the statistic may include mean, median and standard deviation of cost and duration associated with the condition, such as cost and duration of a treatment for the condition.
In specific embodiments, the cost may be a cost associated with a predetermined time period, or a distribution of cost along the duration period for the treatment. The estimate module 506 may provide estimation of cost in a subset of areas based on episode-related indicators, such as hospital admissions, rehabilitation visit, prescription, emergency room visit, and the like. The cost estimate may further be divided by the service categories such as inpatient, outpatient, physician, or pharmacy. One advantage of this aspect is certain subsets of information may have large fluctuations than the whole records and categorization of cost estimate may minimize the influence of such fluctuations. The estimate module may also provide cost estimate along a continuous time dimension by duration of the associated service, such as treatment of week 1, week 2, week 3 . . . .
In certain embodiment, the duration estimate generated by the estimate module 506 may include utilization statistics of specific places of service, types of service, provider types, or therapeutic classes along the duration of a specific condition or its associated service, such as a long term treatment. In a further embodiment, the estimate module 506 may estimate the duration of a condition, for example, by summarizing frequency distribution of records or individuals with similar profiles or attributes along a continuous time period related to the condition such as a time after the onset of a long term disability or permanent disability, which may include counting records or individuals within each time range of the time period.
In one embodiment, the generate module 508 may generate an output in response to the cost estimate and duration estimate. For example, the generate module 508 may calculate a benchmark for cost and duration of the condition or its associated service, such as costs and duration of treatments from initial diagnosis through all associated time points of the condition. The generate module 508 may provide a statistically-valid benchmark specific for a condition, for example, a benchmark of a future treatment cost for 90% of all injuries of this type.
Alternatively, the generate module 508 may determine ranges and trends for cost and duration of the condition or its associated service: for example, ranges and trends of costs of one or more treatments by stages of condition progression, types of services and providers; and ranges and trends of duration and frequency of treatments in outpatient procedure codes, lab codes, prescriptions, inpatient hospitalizations, or the like. These ranges and trends are based on a group of individuals sharing a similar set of attributes or a similar profile, such as a combination of geographic region, gender, age range, expected future lifetime, or medical code, etc.
In a further embodiment, the filter module 602 may filter the group of records according to a limiting criterion. The filter module 602 may filter the records for one that are more relevant to the condition, or separate the records according to selected categories. For example, the filter module 602 may filter the group of records by restricting search parameters before the search is performed. Alternatively, the filter module 602 may filter, remove, or otherwise delete the search results according to the criterion. In a certain embodiment, multiple limiting criteria may be used to restrict the scope of the returned search results. In one embodiment, a limiting criteria may include a field value, such as a record date, age, gender, a demographic factor, a geographic factor, a genotype, a phenotype, a condition, a comorbidity diagnosis, a temporal attribute, or the like.
In one embodiment, the extract module 604 may extract the cost and duration information from the group of records. For example, the extracted cost and duration information may have a time span of at least or about one year, two years, three years, five years, ten years, 15 years, 20 years, or any range derivable therein, or any time period associated with the specific condition, such as a period corresponding to an expected life expectancy following initial diagnosis, or a minimal time period needed for recovery of an injury. In certain embodiments, the extract module 604 may extract cost information according to time increments along the duration of a condition or treatment. In one embodiment, the extract module 604 may sort through the records, search cost and duration information, and transfer cost and duration information into a new data storage device.
In a further embodiment, the server 102 may include an aggregate module 606 configured to aggregate records from the group (e.g., a cohort group) according to a selected attribute. For example, the selected attribute may be a temporal index date. The temporal index date maybe a start date of certain event or occurrence associated with the condition, therefore being shared among the group of records. For example, the temporal index date may be the start date of initial diagnosis, treatment, drug administration, procedure, lab tests, or the like. In a certain embodiment, the groups of records may not share the same event but may have a closely related event in each record, such as a similar treatment or administration of a different drug with similar functions. The start date or a characteristic date of such a closely related event may also be used as the index date. By determining and using an index data for the group of records, the aggregate module 606 may normalize or align the records along a time period. The index data could be seconds, hours, years, or combination thereof, depending on circumstances, such as types of events. In one embodiment, the aggregate module 602 may also determine sub-ranges for aggregation of records, such as a series of time ranges of a treatment, e.g., treatment records are aggregated for 0-3 weeks after the index date, 4-6 weeks after the index date, etc.
The server 102 may also include cost estimate module 608 and duration estimate module 610. The estimate modules may estimate cost and duration separately or in combination. For example, the cost estimate module 608 may estimate total cost of a specific treatment associated with a condition of the subject. In another embodiment, the duration estimate module 610 may estimate duration of a condition after initial diagnosis. In certain embodiments, the cost estimate module 608 and the duration estimate module 610 may estimate an integration of cost and duration, such as a plurality of cost estimates along the estimated duration of a condition.
In one embodiment, the server may also include a calculate module 612 configured to calculate one or more benchmarks for specific services, costs and durations. The benchmarks may be calculated in response to the cost and duration estimates provided by the estimate module 506. The calculate module 612 may calculate benchmarks based on conventional statistical analysis of cost and duration estimate and aggregated records, such as regression analysis, simulation, or the like. After the initial benchmarks are set, the calculate module 612 may have the ability to fine-tune the benchmarks by limiting specific characteristics of the records and generate more restricted benchmarks to reduce fluctuations.
In another embodiment, the server may include a determine module 614 configured to determine trends and ranges of cost and duration, such as future treatment costs and patterns for various workers' compensation injuries, specific for a predetermined profile of a combination of a geographic region, gender, age range, expected future lifetime, and/or a medical code, etc. The determine module 614 may also generate a probability of the determined future treatment pattern or cost in response to the cost and duration estimate derived from the records.
In a further embodiment, the server may also include a graph module 616. The graph module 616 may generate, format, or provide a graphical representation of the outputs, such as the benchmarks, the trends and ranges, the statistics, or a graphic user interface for interactive and real time analysis.
These modules 602-616 may be stand-alone modules implemented in hardware, firmware, or software. Alternatively, the functions may be accomplished through commercial calculation products or spreadsheets, software or SQL instructions that are integrated with the other functions of the server 102. In a specific embodiment, the estimate module 506, including some or all of its component modules 606-610, or the generate module 508, including some or all of its component modules 612-616, may communicate the statistics, estimates, and outputs with the interface module 502 for display or communication to a user. The schematic flow chart diagrams that follow are generally set forth as logical flow chart diagrams. As such, the depicted order and labeled steps are indicative of one embodiment of the presented method. Other steps and methods may be conceived that are equivalent in function, logic, or effect to one or more steps, or portions thereof, of the illustrated method. Additionally, the format and symbols employed are provided to explain the logical steps of the method and are understood not to limit the scope of the method. Although various arrow types and line types may be employed in the flow chart diagrams, they are understood not to limit the scope of the corresponding method. Indeed, some arrows or other connectors may be used to indicate only the logical flow of the method. For instance, an arrow may indicate a waiting or monitoring period of unspecified duration between enumerated steps of the depicted method. Additionally, the order in which a particular method occurs may or may not strictly adhere to the order of the corresponding steps shown.
The method 700 may continue when the estimate module 506 estimates 706 cost and duration in response to the cost and duration information comprised in the group of records. In a further embodiment, the generate module 508 may generate 708 one or more outputs in response to the cost estimate and duration estimate of a condition and its associated service(s). For example, a spreadsheet program may generate an output such as cost distribution associated with a condition progression map comprising one or more progression states, which may be specific for the certain attributes used in search 704. Additional outputs may include ranges and trends for treatment patterns and costs, as well as statistics including averages, probabilities, and other computational products.
Limiting criterion may include windowing values configured to limit or restrict the time frames from which records could be searched, restrictions on minimal enrollment time, minimum number of records, gender restriction, age restrictions, ethnicity restriction, and other similar threshold and limiting values. The filter module 602 may incorporate the limiting criterion into a query used to search 804 the database for the group of records associated with the attributes. For example, the query may search for all records associated with individuals that have been diagnosed with neck injury, but the query may be restricted to return only results associated with individuals that have at least two years of records after injury in the database.
Alternatively, after search 804 the filter module 602 may filter 806 the records for having cost and duration information matching a predetermined temporal criteria, such as cost and duration information with a time span sufficient to analyze a specified condition. The method 800 may continue when the extract module 604 extracts 808 the cost and duration information meeting the predetermined temporal criteria.
In one embodiment, the aggregation module 606 may aggregate 810 records from the search 804 directly or from the extraction 808. The aggregation 810 may be based on a selected attribute, such as a continuous temporal period divided into several condition progression states, wherein cost and duration information can be aggregated for each condition progression state, such as cancer stages. In another embodiment, the aggregation 810 may aggregate 810 records based on types of services, places of services, types of providers or the like. The estimate module 506 may then estimate 812 cost and duration in response to the aggregated information.
The method 800 may end when the generate module 508 generates useful outputs and graphs for the cost and duration specific for the condition(s) as exemplified in steps 814-816. In one embodiment, the calculate module 612 may calculate 814 one or more benchmarks for cost and duration of the specific condition(s) or the associated services(s). The calculation may generate an average, an odds ratio, or other statistics. In a further embodiment, the determine module 614 may determine 816 future trends and costs for the condition(s) and the associated service(s). The graph module 616 may generate graphs for the calculation and determination of cost and duration.
For example, the GUI 2200 contains the user interface which include fields for entering a patient or a claimant's information (“patient information”), which could be used as constraints to find the matched cohorts in accordance with the search module 504. The fields of the patient information may usually include: age, gender, state of residence, and up to two diagnosis codes of a conditions (e.g., a disability) based on ICD-9. In one embodiment, the matched cohorts may consist of patients having the same gender and age range, living in the same region, and diagnosed with a similar disability as determined by the ETG algorithm. These characteristics may be presented under “matched cohorts” corresponding to the fields of “patient information” on the top panel 2202 of the GUI 2200.
In certain embodiments, using the patient information that are entered in the user interface, the matched cohorts' information may be extracted by the extract module 604, and processed in accordance with cost estimate module 608 and duration estimated module 610. The statistics estimates of cost and duration may include mean, median, percentiles (25th percentile and 75 percentile), and standard deviation of cost and duration as presented on the left panel 2204 of the GUI 2200 and illustrated in a line graph 2208. The patient count distribution of diagnoses that make up the matched cohorts may be also calculated and presented in a graph format such as a pie chart 2206. The details driving the mean cost and duration could be summarized and available for print in accordance with the generate module 508 as providing a benchmark for the cost and duration of the treatment of a condition, such as a long term disability. The summary could show the expected total service counts and costs by categories, including the acceptable medical codes.
In accordance with the filter module 602, the cost and duration analysis maybe processed and viewed by results from subsets of the standard matched cohorts to reduce potential large fluctuations between records from different subset groups. Subsets may be determined by the indicators of: recurrence of current diagnosis (“occurrence” field); additional co-morbidity in concurrence with current diagnosis (“co-morbidity” field); inpatient admits in previous year (“admit” field); and outpatient rehabilitation visits in previous year (“rehab” field). These indicators may be chosen via drop-downs (containing Y, N, and All as choices) of these fields in the panel 2210 of “Patient Characteristics”). The choice of the drop-down field determines if those patients with recurring episodes could be included in the selection of the matched cohort, which may be further explained in
Progression of cost and utilization can also be parsed into subsets that may have large fluctuations from the total matched cohort group, for example, by the dropdown list 2304 with episode-related indicators of: hospital admit (‘Admits’), rehabilitation visit (‘Rehab’), prescription (‘Scripts’), and emergency room visit (‘ER’). The first level of drivers of cost and utilization during specific duration increments (e.g., weeks 3 through 5) may be presented as the top 5 places of service, tope 5 types of service, tope 5 provider types, and tope 5 therapeutic classes in panel 2306. The length of increments are dynamic and can be chosen via scroll bar on the top of panel 2306.
Graphical interpretations generated by the graph module 616 in this GUI 2300 may include: line graph of cumulative distribution of cost by duration; high-low graph of cost and utilization at increments of treatment; plot graphs of medical codes (CPT, HCPCS, and/or ICD-9 codes) by duration; pie graphs of distribution of cost in each of the first level drivers, e.g., the top groupings of service categories and/or provider types.
The entry fields for the patient's initial data (“Patient Information” on the GUI 2600) may include the following items (the restrictions listed may not be enforced by the tool, but could help the tool to find a matched cohort group and the selections made could also be reflected on all other tabs with these items):
For Matched Cohort records that are retrieved in accordance with the search module 504, certain characteristics could be displayed on the GUI 2600, including:
In accordance with
In one embodiment, “Treatment Duration” may include:
In a further embodiment, “Cohort Characteristics” may include:
In certain embodiments, the GUI 2600 may further display output in accordance with the generate module 508, such as calculated benchmarks and ranges and trends of cost and duration. For example, the GUI 2600 shows Summary graphs such as a “Billed Range Graph,” a “Days Range Graph,” a “Billed PMPM graph” (Billed per member per month graph), and a “Billed per Episode by Condition Month Graph.”
In one embodiment, buttons may be used to execute a certain function or the like. In the GUI 2600, the Reset button may clear all of the entry fields on all tabs when clicked. the Co-morbid Diagnosis button could populate the Co-Morbid Diagnosis dropdown list based on the Diagnosis Code entered. The Cohort Match button could populate the Matched Cohort, Treatment Billed/Duration, and Cohort Characteristics fields, as well as the Summary graphs, based on the information in the entry fields.
In another embodiment, the GUI 2600 may contain the Condition Summary Printout link that could take the user to the Condition Summary tab in the GUI 2700 when clicked. The Patient Characteristics link could take the user to the Patient Characteristics tab in the GUI 2800.
The selection fields for the Condition Summary tab include the following items: rank, patient characteristics, episode characteristics, condition summary, detailed grid, buttons, and links.
The Rank radio buttons of the GUI 2700 may include Episode Count, Service Count, and Billed per Service as choices. The selection determines the descending sort order, within Condition Month and data type (medical or drug), of the items in the grid. When the Rank is changed on this tab, it could also be changed on the other tabs containing the Rank radio buttons.
The Patient Characteristics are in the top line of dropdown lists of the GUI 2700, including the following items:
The Episode Characteristics are in the second line of dropdown lists of the GUI 2700, including:
The detailed grid in the GUI 2700 is populated based on the entries on the Cost and Duration tab and the selections made on this tab. The top 5 in rank for each condition month and data type (medical or drug) is listed in the grid, followed by an “Other” line for anything not in the top 5 in rank. The rows with a white background contain medical data and the rows with a gray background contain drug data.
For the GUI 2700, some buttons and links may also be included. The Reset button could clear all of the entry fields on all tabs when clicked. The Condition button could populate the detailed grid on the tab based on the selections made. The Print button could send the Condition Summary and detailed grid to the user's default printer. The Summary link could take the user to the Cost and Duration tab in the GUI 2600 when clicked.
The Patient Characteristics of the GUI 2800, including the following fields: Recurrence, Admits, and Rehab as described in the GUI 2700. The Treatment grid 2802 is populated when the Similar Patients button is clicked, and it shows the statistics with the further restrictions of the selected patient characteristics. Queries and calculations are handled on the Patient Data tab.
The GUI 2800 may also include summary graphs such as Billed Range Graph, Days Range Graph, Billed PMPM Graph, Billed per Episode by Condition Month Graph, Episode Count by Condition Month Graph, which include both the base data from the Cost and Duration tab and data showing the further restrictions of the selected patient characteristics.
The GUI 2900 include entry filed of Patient Characteristics and Episode Characteristics as described above. More items of the Treatment Progression tab may include:
The Cost and Utilization Driver tab include several previously described fields such as Rank, Patient Characteristics, and Episode Characteristics, and more fields including:
The Top 10 Codes section of the GUI 3400 includes the following five categories, with a dropdown list and episode information for each. The lists are populated when the Injury Profile button is clicked, and are each sorted in descending order of the rank selected. The episode count and amount per episode could change as the selection in the dropdown list above it is changed. The DRG dropdown list contains all DRG codes that were included in the claims for the current matched cohort. The episode count and amount per episode are displayed beneath the selected DRG code. The Revenue CD dropdown list contains all Revenue codes that were included in the claims for the current matched cohort. The episode count and amount per episode are displayed beneath the selected Revenue code. The ICD9 dropdown list contains all ICD9 codes that were included in the claims for the current matched cohort. The episode count and amount per episode are displayed beneath the selected ICD9 code. The CPT dropdown list contains all CPT codes that were included in the claims for the current matched cohort. The episode count and amount per episode are displayed beneath the selected CPT code. The NDC dropdown list contains all NDC codes that were included in the claims for the current matched cohort. The episode count and amount per episode are displayed beneath the selected NDC code.
The Patient Selector section of the GUI 3400 includes the following checkboxes. The ‘Patient Must Incur ALL Checked Codes’ checkbox controls whether the selected codes for the selected code types could be added to the query with ‘and’ or ‘or’ between them. If the checkbox is unchecked, the default ‘or’ could be used and a claim could be included for the matched cohort if it contains any of the codes selected; if it is checked, ‘and’ could be used and a claim could be included only if it contains all of the codes selected. This could only apply if more than one code type is selected. For example, if the DRG checkbox is selected, the selected DRG code could be used to filter the matched cohort. If there is no DRG code, it could not affect the query.
The first grid in the Patient Selector section of the GUI 3400 includes the following four categories, with a dropdown list and episode information for each. The lists are populated when the Select Patients button is clicked, and are each sorted in descending order of the rank selected. The episode count and billed per service amount could change as the selection in the dropdown list above it is changed. The Inpatient dropdown list contains the inpatient provider categories from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected inpatient category. The Outpatient dropdown list contains the outpatient provider categories from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected outpatient category. The Physician dropdown list contains the physician provider categories from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected physician category. The Pharmacy dropdown list contains the pharmacy provider categories from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected pharmacy category.
The second grid in the Patient Selector section of the GUI 3400 includes the following four categories, with a dropdown list and episode information for each. The lists are populated when the Select Patients button is clicked, and are each sorted in descending order of the rank selected. The episode count and billed per service amount could change as the selection in the dropdown list above it is changed. The Place of Service dropdown list contains the place of service categories from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected place of service category. The Service Type dropdown list contains the service type categories from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected service type category. The Provider Type dropdown list contains the provider type categories from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected provider type category. The NDC AHFS Rollup dropdown list contains the NDC AHFS rollups from the claims included in the matched cohort. The episode count and amount per episode are displayed beneath the selected NDC AHFS rollup category.
All of the methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the system and methods of this invention have been described in terms of preferred embodiments, it could be apparent to those of skill in the art that variations may be applied to the methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the invention. In addition, modifications may be made to the disclosed system and components may be eliminated or substituted for the components described herein where the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope, and concept of the invention as defined by the appended claims.
This application claims priority to U.S. Provisional Application No. 61/258,969 filed Nov. 6, 2009, the entire contents of which is specifically incorporated herein by reference without disclaimer. This application is related to U.S. application Ser. No. 12/605,697 filed on Oct. 26, 2009 and U.S. application Ser. No. 12/562,608 filed on Sep. 18, 2009, the entire disclosures of which are specifically incorporated herein by reference in their entirety without disclaimer.
Number | Date | Country | |
---|---|---|---|
61258969 | Nov 2009 | US |