Clinical trials are conducted to validate the efficacy and uncover toxicity of a healthcare product, such as a pharmaceutical product or a medical device.
In one aspect, some implementations provide a computer-implemented method to model a clinical trial, the method including: receiving data encoding parameters defining a study plan of the clinical trial with more than one participant clinical sites, the parameters including a total enrollment target of human subjects as well as targeted milestone dates of the study plan, the study plan including more than one cohorts, and the clinical sites having varying capabilities in administering each cohort; adding a first cohort to the study plan, the first cohort characterized by a first set of attributes commonly possessed by a first group of human subjects; specifying the first set of attributes as well as a corresponding target enrollment number of the first group of human patients to define the first cohort; adding a second cohort to the study plan, the second cohort characterized by a second set of attributes commonly possessed by a second group of human subjects, the second set of attributes different from the first set of attributes; specifying the second set of attributes as well as a corresponding target enrollment number of the second group of human patients to define the second cohort, the second group of human subjects having no overlap with the first group of human subjects; subsequent to onset of the clinical trial, receiving, from data servers at the clinical sites, information encoding attributes of participant human subjects at each of the clinical sites, the information devoid of identifier information capable of identifying an individual human subject; parsing the received information to map the participant human subjects to a particular cohort of the study plan; in response to receiving, from the data servers at the clinical sites, update information encoding attributes of the participant human subjects at each of the clinical sites, longitudinally tracking, by a processor, the participant human subjects as mapped to corresponding cohorts of the study plan as the clinical trial progresses at the clinical sites; and providing analytics of the cohorts of the study plan based on the longitudinal tracking
Implementations may include one or more of the following features. Providing analytics of the cohorts of the study plan may include: analyzing the mapped human subjects for each cohort to generate actual enrollment statistics for the cohorts at the clinical sites; and presenting the actual enrollment statistics for each cohort of the study plan.
Analyzing the mapped human subjects may further include: generating longitudinal actual enrollment statistics for each cohort from the corresponding clinical sites involved in conducting the particular cohort of the study plan during progression of the clinical trial. The method may further include: projecting enrollment statistics for each cohort of the study plan based on the generated longitudinal actual enrollment statistics from the corresponding clinical sites; and presenting the projected enrollment statistics for each cohort of the study plan. Analyzing the mapped human subjects for each cohort may further include: generating actual enrollment statistics from the clinical sites that span across more than one country and are involved in conducting the particular cohort of the study plan based on the generated longitudinal actual enrollment statistics from the clinical sites. The method may further include: aggregating the generated actual enrollment statistics from the clinical sites involved in conducting the particular cohort of the study plan in each country; and presenting the aggregated actual enrollment statistics for cohorts of the study plan on a country-by-country basis.
The method may further include: presenting a progress indication for each cohort based on the generated longitudinal actual enrollment statistics for the particular cohort as well as the targeted milestone dates for the study plan.
The method may further include: generating actual enrollment statistics from all participating cohorts at a particular clinical site; and presenting a progress indication for each participating cohort at the particular clinical site.
Providing analytics of the cohorts of the study plan may further include: generating progression statistics for each cohort of the study plan being conducted at the clinical sites of the particular cohort. Generating the progression statistics for each cohort of the study plan may include: generating summary statistics on initiating the study plan at the corresponding clinical sites for the particular cohort as measured against the targeted milestone dates of the study plan.
Generating the progression statistics for each cohort of the study plan may include: generating summary statistics on screening human subjects at the corresponding clinical sites of the particular cohort. Generating the progression statistics for each cohort of the study plan may include: generating summary statistics on enrolling human subjects at the corresponding clinical sites for the particular cohort as measured against the total enrollment target of human subjects for the study plan.
Parsing the received information may further include: extracting at least one attribute of each participant human patient; and linking each participant human subject to a particular cohort of the study plan in accordance with the extracted at least one attribute of the participant human subject. Parsing the received information may further include: conducting an Extract, Transform, and Load (ETL) operation on the received information to map participant human subjects to corresponding cohorts of the study plan. Linking each participant human subject to a particular cohort of the study plan may further include: longitudinally linking the participant human subject at least in part based on a matching unique patient identifier of the human subject that does not reveal the human subject's identity. Linking each participant human subject to a particular cohort of the study plan may further include: longitudinally linking the participant human subject as the human subject is seen at more than one clinical site during the clinical trial.
The method may further include: adding a third cohort to the study plan, the third cohort characterized by a third set of attributes commonly possessed by a third group of human subjects, the third set of attributes different from the first set and the second set of attributes. The method may additionally include: specifying the third set of attributes as well as a corresponding target enrollment number of the third group of human patients to define the third cohort, the third group of human subjects having no overlap with the first group and the second group of human subjects.
In another aspect, some implementations provide a computer system comprising one or more processors, configured to perform the operations of: receiving data encoding parameters defining a study plan of the clinical trial to be conducted at more than one clinical sites, the parameters including a total enrollment target of human subjects as well as targeted milestone dates of the study plan, the study plan includes more than one cohorts, and the clinical sites having varying capabilities in handling human subjects for each cohort; adding a first cohort to the study plan, the first cohort characterized by a first set of attributes commonly possessed by a first group of human subjects; defining the first cohort by specifying the first set of attributes as well as a corresponding target enrollment number of the first group of human patients; adding a second cohort to the study plan, the second cohort characterized by a second set of attributes commonly possessed by a second group of human subjects, the second set of attributes different from the first set of attributes; defining the second cohort by specifying the second set of attributes as well as a corresponding target enrollment number of the second group of human patients, the second group of human subjects having no overlap with the first group of human subjects; subsequent to onset of the clinical trial, receiving, from data servers at the clinical sites, information encoding attributes of participant human subjects at each of the clinical sites, the information devoid of identifier information capable of identifying an individual patient; parsing the received information to map the participant human subjects to a particular cohort of the study plan; in response to receiving update information encoding attributes of participant human subjects, longitudinally tracking the participant human subjects as mapped to corresponding cohorts of the study plan as the clinical trial progresses at the clinical sites; and providing analytics of the cohorts of the study plan based on the tracking
In yet another aspect, some implementations provide a computer-readable medium, comprising software instructions, which when executed by a processor of a computer, causes the computer to perform the operations of: receiving data encoding parameters defining a study plan of the clinical trial to be conducted at more than one clinical sites, the parameters including a total enrollment target of human subjects as well as targeted milestone dates of the study plan, the study plan includes more than one cohorts, and the clinical sites having varying capabilities in handling human subjects for each cohort; adding a first cohort to the study plan, the first cohort characterized by a first set of attributes commonly possessed by a first group of human subjects; defining the first cohort by specifying the first set of attributes as well as a corresponding target enrollment number of the first group of human patients; adding a second cohort to the study plan, the second cohort characterized by a second set of attributes commonly possessed by a second group of human subjects, the second set of attributes different from the first set of attributes; defining the second cohort by specifying the second set of attributes as well as a corresponding target enrollment number of the second group of human patients, the second group of human subjects having no overlap with the first group of human subjects; subsequent to onset of the clinical trial, receiving, from data servers at the clinical sites, information encoding attributes of participant human subjects at each of the clinical sites, the information devoid of identifier information capable of identifying an individual patient; parsing the received information to map the participant human subjects to a particular cohort of the study plan; in response to receiving update information encoding attributes of participant human subjects, longitudinally tracking the participant human subjects as mapped to corresponding cohorts of the study plan as the clinical trial progresses at the clinical sites; and providing analytics of the cohorts of the study plan based on the tracking
The details of one or more aspects of the subject matter described in this specification are set forth in the accompanying drawings and the description below. Other features, aspects, and advantages of the subject matter will become apparent from the description, the drawings, and the claims.
Like reference symbols in the various drawings indicate like elements.
This disclosure generally describes a system and method for providing data analytics from cohort trials including multiple cohort groups of human subjects participating in clinical trial. Each cohort group of human subjects are linked based on comparable characteristics and are followed over time. Each cohort group may be exposed to a particular stimulus variable, such as a particular pharmaceutical product at a given dose range. The cohort trial study may include multiple cohort groups. These groups may be distributed to various facilities during the trial. To facilitate management of the trial study being conducted, human patient data may be gathered at each facility and then reported to a central facility, for example, on a daily basis. When the human patient data is gathered at each facility, such data is de-identified to maintain confidentiality. In other words, the de-identified data is void of information capable of identifying the actual patient or human subject. After such de-identified data is reported to a central facility, the de-identified data for a particular human patient can be longitudinally tracked. In other words, de-identified data for the same patient or human subject may be linked to generate longitudinal data over time. Thereafter, statistical ensemble analysis may be conducted to reveal summary data regarding patient enrollment at each participating facility, patient enrollment for each cohort group, as well as projection enrollment analysis. The ensemble analysis may track de-identified patients as they move from one facility to another. The ensemble analysis may reveal trend in patient recruitment or drop-out at a particular facility. The ensemble analysis may also aggregate patient data for a particular cohort group distributed among various facilities. The ensemble analysis may provide real-time patient enrollment status for a particular cohort group at each country. The projection analysis may factor in historical performance at a particular facilities, for a study of a particular disease. The projection analysis may generate statistical confidence indication for inter-group comparisons. The projection analysis may also indicate estimated time to completion for a particular comparison, for example, between two cohort groups.
Data for each participant patient may be recorded at a participating site. The data may include regular measurement data including, for example, urine sample data, blood sample data, blood pressure data, respiratory data, or heart rate data. The data may be electronic. For example, blood sample data may include blood glucose level, triglyceride level, low-density lipoprotein (LDL) level, high-density lipoprotein (HDL) level. In some instances, such electronic data may correspond to lab results from a contracting diagnostic laboratory, such as a contracting research organization (CRO). The lab results may not be limited to chemical analysis results. The lab results may also include image analysis results based on, for example, diagnostic images.
Data for each participant patient, as recorded at the participant site, is de-identified such that the data does not include information capable of identifying the particular participant patient. Examples of such identifying information include: patient's name, patient's insurance identification number, patient's Medicare/Medicaid identification number, patient's social security number, patient driver's license number, etc. In some implementations, such identifying information may be converted by a one-way hash-function to generate an alpha-numerical string. The alpha-numerical string conceals the identity of the individual participant patient, thereby maintaining confidentiality of the data as the data is being reported, for example, daily from the sites 104A to 104G to the central server 102. There, data corresponding to the same participant patient may be linked by virtue of the matching alpha-numerical string. Thus, data for the same participant patient may be longitudinally tracked as the clinical trial unfolds, without compromising confidentiality of the individual patients.
In some implementations, however, the de-identified data may be further encrypted before the data is reported to central server 102 to update database 112. For illustration, data 114A-114G may be encrypted using a symmetric encryption key specific to each participant site. The symmetric encryption key may only be known to the particular participant site and central server 102. Thus, only the participant site can encrypt the de-identified data with the symmetric key and only the central server 102 can decrypt the encrypted de-identified data with the particular symmetric key. In another illustration, a public-key infrastructure (PKI) may be used such that the reported data may be encrypted with the public key of the central server 102 so that only the central server 102 can decrypt using its private key. In other illustrations, the central server 102 and participant sites 104A-104G may exchange messages using the PKI to establish an agreed-on symmetric key.
Returning to
Referring to
Returning to
As illustrated in
Referring to
The longitudinal actual enrollment statistics from the corresponding clinical sites may also be received from multiple sites involved in one cohort that span more than one country (228). Thereafter, the actual enrollment statistics data for the particular cohort may be aggregated at the country level to reveal actual enrollment statistics from a particular country in real-time (230). The country-specific actual enrollment statistics may be further aggregated to reveal enrollment statistics at the regional level, such as the continent level.
As illustrated in
Some implementations may provide backward compatibility for visualizing data reported from facilities without capabilities in handling cohorts as well as data from facilities capable of handling such cohorts.
Some implementations disclosed herein allow participant patient data from a particular cohort group that involves multiple participant sites to be tracked in real-time. Some implementations disclosed herein allow participant patient data from multiple cohort groups at one given particular site to be tracked in real-time. Some implementations disclosed herein allow trending analysis based on the tracked data. Some implementations may blend trending analysis with historical performance at comparable time points in a study to project estimated time to completion or predict needs for further recruitment, to cope with, for example, patient drop-out or screening failure.
Implementations of the subject matter and the functional operations described in this specification can be implemented in digital electronic circuitry, in tangibly-implemented computer software or firmware, in computer hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Implementations of the subject matter described in this specification can be implemented as one or more computer programs, i.e., one or more modules of computer program instructions encoded on a tangible non-transitory program carrier for execution by, or to control the operation of, data processing apparatus. The computer storage medium can be a machine-readable storage device, a machine-readable storage substrate, a random or serial access memory device, or a combination of one or more of them.
The term “data processing apparatus” refers to data processing hardware and encompasses all kinds of apparatus, devices, and machines for processing data, including, by way of example, a programmable processor, a computer, or multiple processors or computers. The apparatus can also be or further include special purpose logic circuitry, e.g., a central processing unit (CPU), a FPGA (field programmable gate array), or an ASIC (application-specific integrated circuit). In some implementations, the data processing apparatus and/or special purpose logic circuitry may be hardware-based and/or software-based. The apparatus can optionally include code that creates an execution environment for computer programs, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them. The present disclosure contemplates the use of data processing apparatuses with or without conventional operating systems, for example Linux, UNIX, Windows, Mac OS, Android, iOS or any other suitable conventional operating system.
A computer program, which may also be referred to or described as a program, software, a software application, a module, a software module, a script, or code, can be written in any form of programming language, including compiled or interpreted languages, or declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data, e.g., one or more scripts stored in a markup language document, in a single file dedicated to the program in question, or in multiple coordinated files, e.g., files that store one or more modules, sub-programs, or portions of code. A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network. While portions of the programs illustrated in the various figures are shown as individual modules that implement the various features and functionality through various objects, methods, or other processes, the programs may instead include a number of sub-modules, third party services, components, libraries, and such, as appropriate. Conversely, the features and functionality of various components can be combined into single components as appropriate.
The processes and logic flows described in this specification can be performed by one or more programmable computers executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., a central processing unit (CPU), a FPGA (field programmable gate array), or an ASIC (application-specific integrated circuit).
Computers suitable for the execution of a computer program include, by way of example, can be based on general or special purpose microprocessors or both, or any other kind of central processing unit. Generally, a central processing unit will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a central processing unit for performing or executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. However, a computer need not have such devices. Moreover, a computer can be embedded in another device, e.g., a mobile telephone, a personal digital assistant (PDA), a mobile audio or video player, a game console, a Global Positioning System (GPS) receiver, or a portable storage device, e.g., a universal serial bus (USB) flash drive, to name just a few.
Computer-readable media (transitory or non-transitory, as appropriate) suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The memory may store various objects or data, including caches, classes, frameworks, applications, backup data, jobs, web pages, web page templates, database tables, repositories storing business and/or dynamic information, and any other appropriate information including any parameters, variables, algorithms, instructions, rules, constraints, or references thereto. Additionally, the memory may include any other appropriate data, such as logs, policies, security or access data, reporting files, as well as others. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
To provide for interaction with a user, implementations of the subject matter described in this specification can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube), LCD (liquid crystal display), or plasma monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input. In addition, a computer can interact with a user by sending documents to and receiving documents from a device that is used by the user; for example, by sending web pages to a web browser on a user's client device in response to requests received from the web browser.
The term “graphical user interface,” or GUI, may be used in the singular or the plural to describe one or more graphical user interfaces and each of the displays of a particular graphical user interface. Therefore, a GUI may represent any graphical user interface, including but not limited to, a web browser, a touch screen, or a command line interface (CLI) that processes information and efficiently presents the information results to the user. In general, a GUI may include a plurality of user interface (UI) elements, some or all associated with a web browser, such as interactive fields, pull-down lists, and buttons operable by the business suite user. These and other UI elements may be related to or represent the functions of the web browser.
Implementations of the subject matter described in this specification can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (LAN), a wide area network (WAN), e.g., the Internet, and a wireless local area network (WLAN).
The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any invention or on the scope of what may be claimed, but rather as descriptions of features that may be specific to particular implementations of particular inventions. Certain features that are described in this specification in the context of separate implementations can also be implemented in combination in a single implementation. Conversely, various features that are described in the context of a single implementation can also be implemented in multiple implementations separately or in any suitable sub-combination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a sub-combination or variation of a sub-combinations.
Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be helpful. Moreover, the separation of various system modules and components in the implementations described above should not be understood as requiring such separation in all implementations, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.
Particular implementations of the subject matter have been described. Other implementations, alterations, and permutations of the described implementations are within the scope of the following claims as will be apparent to those skilled in the art. For example, the actions recited in the claims can be performed in a different order and still achieve desirable results.
Accordingly, the above description of example implementations does not define or constrain this disclosure. Other changes, substitutions, and alterations are also possible without departing from the spirit and scope of this disclosure.