This application claims priority from provisional application 61/940,727, filed 17 Feb. 2014.
Research and Development (R&D) are investigative activities that a business or other organizations conduct with the intention of making discoveries that can either lead to the development of new products or procedures, or to improvement of existing products or procedures. R&D may proceed in linear or non-linear manner and typically involve several steps over long periods of time.
Every field of industry engages in extensive efforts of Research and Development for New Product Development. In many industries, such R&D may last for years or decades and costs may reach or exceed the multi-billion dollar range (as for example in Pharmaceutical development, Defense and other fields of application). A major problem in managing such R&D is that of optimally allocating resources to competing R&D activities since it is not generally known which research activities are most likely to “convert” to scientific-technological results that facilitate new products. Another problem is to accelerate the successful R&D efforts and eliminate the unsuccessful ones as early as possible.
For example in the Life Sciences, the process of “Translational Research” describes the research activities that eventually lead to practical applied innovations such as new diagnostic technologies/products, new drugs, improvements in the guidelines that determine the standard of care etc. Both private industry (e.g., Pharmaceutical companies) and the public sector (e.g., Federal Funding agencies such as the NIH) are faced with the pressing problem of allocating limited resources to a small number of efforts out of many candidate R&D initiatives. In many cases, one has to decide which R&D programs that have yielded partial results should be prioritized over other incomplete or yet-to-begin ones. In addition, since the time-to-market directly affects profitability (e.g., at the tune of>1 billion USD/year for “blockbuster” drugs), it is highly desirable to accelerate the R&D that is likely to be successful and eliminate the R&D that is likely to be unsuccessful as early as possible.
The same considerations are true for all industries where R&D plays a significant role in New Product Development (NPD). Examples include: electronics, telecommunications, computer and information technology, defense, aeronautics, aviation and aerospace, Internet commerce, financing and investing, energy, automotive and transportation, marketing and advertising to name a few.
The present invention provides a method, process and apparatus for:
Users of the invention may use it for:
Table 1 lists example input features for Model Training in the Translational Research Field of Application. These features can either be content-based or meta-data (e.g., bibliometric) features. Content features are based on document content such as the title or abstract. Bibliometric features are information based on the authors, publication, or other metadata.
Table 2 lists the top 10 important features for two use cases with different training corpora in Translational Research Field of Application.
The invention method comprises 3 stages, which are implemented in the system described and claimed.
Creating this Knowledge Base involves the following elements:
For example, in the domain of life sciences R&D, an appropriate unit of prediction may be the stage of research toward a new drug as evidenced by development and publication of basic science or clinical findings. The unit of prediction will typically be a complex relationship of objects; for example in drug development it can be the usefulness, applicability or potential of a particular molecule for a safe and efficacious new drug.
that constitute or represent archetypes or milestones of success of the R&D process. In the new drug development example, these may be clinical trials that prove the improved efficacy or safety of a new drug over the best drugs currently in market.
In the drug development example, such a network can be a citation graph among articles, websites and patents that indicate how various molecules, pathways, assaying technologies etc. gradually support the development of a new drug. The nature of influences in the Dependency Network may vary dramatically among distinct fields of application and needs be tailored accordingly. Appropriate networks include citation influences in a citation network of articles or web pages, causal relationships in a causal graph, information transfer relationships in an information network, resource input relationships, or any other appropriate network representation of how stages of R&D influence and depend on one another.
Creating this model and decision support system involves the following elements:
The model can now be used to assess retroactively (i.e., “historically”) the impact of a stage of R&D to successful endpoints by using standard graph algorithms for determining all paths from a stage or stages of interest to one or more success exemplars of interest. Existence of one or more paths is direct evidence for the impact of a stage of R&D to the success of the overall effort, lack thereof is evidence for lack of impact. Other ways to describe and infer macro properties of the R&D process modelled by the graph model and identify critical components include a variety of standard Network Science analytics tools (e.g., clustering coefficient, hubs, percent shortest path, characteristic path length, Betweeness Centrality, clusters etc.)
Creating this model and decision support system involves the following elements:
While construction of Markov Process models follows procedures in Decision Analysis, Operations Research and Applied Mathematics that are related to those of the prior art, the construction of predictive models uses established principles of predictive modeling highly customized for the purposes of the invention.
The steps followed include:
The provided technical report (attached hereto as Appendix 1, and incorporated herein by reference) provides details of the method as applied to the specific field of application of R&D for the Life Sciences (also commonly labeled as “Translational Research”). It demonstrates empirically that the invention leads to accurate predictions and in depth understanding of R&D process in a real-life complex domain (that of translational biomedical research leading to new drug development).
Differences from General-Purpose Text Categorization and Classification Methods
The invention categorizes not the internal content or other de-contextualized properties of a single stage in the R&D process but a specific type of complex relationship of a single stage with the set of R&D successes. That is what is classified and predicted is the future relationship of a stage of the R&D with yet-to-be realized (possible) endpoints of R&D process, directly or through other R&D stages.
These endpoint exemplars are NOT training exemplars for predictive modeling but need to be coupled with the dependence network that tracks paths from any stage of interest to the endpoint exemplars.
These include specialized processing methods for trimming the dependency network from false positive links; specialized filtering procedures for restricting the space of all stages to stages that are most relevant to the R&D success prediction task; and a multi-level modeling approach whereby the overall transition from initiation of R&D to success or failure endpoints is modeled via a Markov Process and transition probabilities are provided by predictive modeling.
While the invention has been described in its preferred embodiments, it is to be understood that the words which have been used are words of description rather than of limitation and that changes may be made within the purview of the appended claims without departing from the true scope and spirit of the invention in its broader aspects. Rather, various modifications may he made in the details within the scope and range of equivalents of the claims and without departing from the spirit of the invention. The inventors further require that the scope accorded their claims be in accordance with the broadest possible construction available under the law as it exists on the date of filing hereof (and of the application from which this application obtains priority, if any) and that no narrowing of the scope of the appended claims be allowed due to subsequent changes in the law, as such a narrowing would constitute an ex post facto adjudication, and a taking without due process or just compensation.