The present invention refers to a system and a method for performing functional analyses making use of a plurality of inputs.
The number of commercially-usable technologies in technical and end user product fields is growing. Despite a specialization of technologies, efforts are also made to spread technological solutions out of their original area, leading to a much higher number of practical applications in new products and processes.
The opportunity of accessing economical and strategic advantages through patents is often delayed and sometimes reduced by difficulties related both to the number of existing patents to be analyzed and to the formal language used to describe the invention.
Every day, many and many thinkers, inventor and companies waste time trying to evaluate the level of innovation, the feasibility to patent an invention and the originality of contents.
For instance, it is known that for about 50 years well-proven theories (TRIZ by Altshuller G., “Creativity as an Exact Science: The Theory of the Solution of Inventive Problems”, Gordon & Breach Science pub., 1984) have been issued to compare and evaluate levels of different inventions. These theories were based on human labor in their application and a great amount of this labor is dedicated to less intensive and valuable applications of the human brain, such as reading and building a mental image of relations between components of invention subject, instead of understanding concepts and the ontology of the invention.
Several software packages do exist that represent technical or technological contents based on the functional representation an engineering subset of mathematical theories of graph representation. Specialized tools have been proven really efficient to speed up the comprehension of concepts, but though with these aids, a lot of time is required to build functional representations of inventions.
The functional analysis is a powerful tool for designers who do not want to introduce only slight optimizations in an existing mechanical system, but also significant innovations in order to build really new products. The functional description of a product is a description at an abstract level, so that different design possibilities can be explored, at a conceptual design stage, by developing functional variants. Moreover, the functional analysis helps the designers in following a systematic approach also in the study of complex systems, by breaking up functions into simpler sub-functions and subdividing the problem into more manageable parts. Finally, the functional analysis can play an important role also in patent-breaking activities: a careful study of a patent from a functional point of view leads to the identification of ambiguous definitions such that it is possible with unremarkable changes to overcome the patent links. Conversely, when writing a new patent, a text-based functional analysis is an effective test of the suitability of the work done.
Functional modeling is often used also at a detailed design stage: following the Suh approach (Suh N. P., “The Principles of Design”, Oxford Press, 1990), the function is considered as the desired output and the design is decomposed into ;functional requirements which are mapped directly with the design parameters at any abstraction level.
Since products are defined and should be designed by the function they have to perform, several authors have classified functions for mechanical design in order to associate subsystems (assemblies of physical parts or components) and sub-functions such that the designer is supplied with a set of modular units to develop the final product. A comprehensive description of those works is given by Kirschman (Kirschman C. F. et al, “Classifying Functions for Mechanical Design”, Journal of Mechanical Design (ASME), Vol. 120, September 1998).
Several works have been published to propose comprehensive representations of functions which represent the different aspects of the designers' intention, that is a crucial issue for developing computer-aided conceptual design systems; aim of these works is defining effective ways to represent also the relationships among the functions, i.e. decomposed-into, conditioned-by, enhanced-by and described-as relations (Shimomura et al., “Representation of Design Object Based on the Functional Evolution Process Mode”, Journal of Mechanical Design (ASME), Vol. 120, June 1998).
In this context, the inventors have been developing tools, systems and methods to help the designer in performing functional analyses making use of several kinds of inputs. According to the definition of product architecture given by Ulrich (Ulrich, K. T., “The Role of a Product Architecture in the Manufacturing Firm”, Research Policy, vol. 24, 1995), that is the way in which the designer relates functional elements to physical components, much research in the area has been directed toward the process of mapping functions to components. The definition of some general criteria to map functions to the assembly features of a CAD system is understandably useful; this task has been approached also by Line et al. (Line J. K. et al, “Calculation of Product Architecture Metrics Within a Solid Modeler”, accepted for publication on Proceedings of the Institution of Mechanical Engineers Part B—Journal of Engineering Manufacture, 2001) even if with a completely different strategy.
Nowadays it is possible to use several types of techniques and technologies to analyze digital text and produce a facilitated way to access and read the contents. Especially for scientific and patents digital files, these technologies demonstrate their value, but a lot of work has still to be done in allowing fast and efficient comparison between similar document, identifying novelty in the publication or order publication according to novelty level.
Despite enormous efforts in the conception and development of software for automatic analysis of digital documents, both information extraction and information representation are usually from the “same field” of implemented technologies/techniques:
In such a background it is a logical consequence that automatic text analyses lead to categorization of concepts via taxonomy or similar text-based techniques and that well structured data such as programming languages or databases can be represented via graph theory representations; these representations both are natural in their field and indicate real advantages in evolving a further step in these field, hybridizing text- and graph-based analysis of digital documents with text and graph representation of their contents.
A computer-based software system and a method according to the principles of the present invention solve the problems described in the previous paragraph due to the ability to perform automatic analyses and comparisons of patents and technical descriptions of engineering systems (as technical reports, usage and maintenance handbooks, overhaul manuals etc.), capable to translate the text of the invention into a block diagram whose graphical representation is made according to the main market standards, i.e. functional models, IDEF models etc.
In fact, the proposed system, according to the principles of the present invention, is capable to recognize the components of the system described in the analyzed document, even if they are referenced with several nomenclatures. Further, the hierarchy of the recognized components, in terms of detail/abstraction level, can be identified and a classification in categories like “assembly”, “part”, “portion” can be performed. Furthermore the system is capable to analyze the functional interactions among the components and to identify the secondary products and the main product of the analyzed system.
The above and other objects and advantages of the invention, as will appear from the following description, are obtained by a as method and a system as claimed in claims 1 and 15, respectively. Preferred embodiments and non-trivial variations of the present invention are claimed in the dependent Claims.
These tasks are performed by the Text Analyzer Modules that interact with each other and make use of the content of a customizable database of stop words and analogue words as described below. The results of this analysis can be augmented making use of a semantic processor, as the one described in U.S. Pat. No. 6,167,370, capable to analyze an electronic document in order to extract from each sentence the Subject, the Action and the Object. The results of the analysis of the Text Analyzer Module can be supplied to the user in different ways, according to the functionalities of the Post Processing Module of the present invention.
(1) The first output mode consists in supplying the user with a comprehensive graphical representation of the analyzed system, that allows a quick overview of the system's components, their hierarchy classification and their functional interactions.
(2) A comparison between the results of different analyses can be executed according to a set of quantitative means capable to evaluate the relevance, the complexity and the level of innovation (according to the Altshuller definition) of the examined systems and (optionally) to follow their evolution pattern.
(3) The functions performed by or accomplished on a component in all the examined systems are stored in a database so that all possible functional uses of that component can be easily retrieved.
(4) The components capable of performing a given function, according to the results of the accomplished analyses, are stored in a database, in order to have a tool for an easy retrieval of technical solutions to perform a given task.
The databases described in (3) and (4) of the previous list can be easily integrated with CAD systems and conceptual design software tools.
The present invention will be better described by some preferred embodiments thereof, given as a non-limiting example, with reference to the enclosed drawings, in which:
One exemplary embodiment of the present invention, as depicted in
A user can input the text to be analysed by typing it directly with the keyboard 12 or with other computer devices (i.e. scanner 14 and OCR system, microphone 18 and speech recognition software etc.), or by selecting the documents on local storage means, LANs or on the Web.
According to the scheme of
The text to be analysed, entered by the user, is stored into the Temporary Storage Database 20. Such text is processed by the Text Analyser Module 30, making use of the Database of Stop Words and Analogue Words 40 and (optionally) of a commercially available semantic processor (external to the system described in the present invention), according to the methodology described in the next paragraphs.
The results of the text analysis are stored in the Database of the Extracted Information 50.
The Post Processing Module 60 supplies the user with the information contained in the Database 50 organized in several formats as described in the following paragraphs, for an effective and rapid usage of the analysis results. The output can be sent to any user device, to a local storage database or to the Web.
The Components Recognition Module 31 allows the identification of all system components described in the examined text (i.e., for a patent, the components of the invention).
This task can be performed following two different techniques: keywords search and semantic search (
The keyword search (
The following procedure is adopted:
The steps 4) and 5) can be properly inverted without any changes in the results.
The semantic search (
The Component Recognition task can be performed following alternatively or together the keyword search or the semantic search approaches. Combining these two techniques, it is possible also to identify supersystems: Tools and Artifacts not belonging to the reference numbered components of the examined system, clearly are external to the system itself.
The product of the Component Recognition Module is a list of names, representative of the conceptual components of the examined system; a reference number (the same reference number used in the text to describe the system) is associated with each component name as well as a list of the different denominations used in the system description to identify that component (synonyms, pronouns, adjectives that can be lost due to the intersection at step 5) of the keyword search approach, if sometimes they have been omitted).
The results of the Components Recognition Module 31 are stored in the Database of the Extracted Information 50.
The Components Classification Module 32 orders and classifies the components identified by the Components Recognition Module 31 according to three main criteria: detail/abstraction level; main product and secondary products of the system; role in the assembly. An exemplary embodiment of the Components Classification Module 32 according to the principles of the present inventions is shown in
A schematic representation of the method for classifying the components of the systems according to the principles of the present invention is shown in
The procedure consists in the following steps:
This search is performed taking into account also all the alternative denominations for each system component stored by the Components Recognition module 31.
As well as for the Components Recognition task, the capabilities of the present inventions can be augmented by using a semantic processor, since a more reliable identification of the components that effectively are the subject or the object of the descriptive verb can be guaranteed: let us consider the previous example modified as follows: “the arm support comprises two guide rails, a screw drive arranged between the opposite guide rails, and a carrier”; in this case the clause “arranged between the opposite guide rails” does not contain any objects of the verb “comprise”, but it contains an identified component of the system (the “guide rails”).
A Detail Level (DL) is assigned to each component so that the maximum abstraction level is represented by a DL=0 and the DL of each subsystem is one level greater than the DL of the corresponding supersystem.
The results of this first classification phase can be further processed to identify the role of a component in the assembly described in the text: it is possible to assign to each component an attribute as “assembly”, “part” or “portion” in order to allow a more effective data exchange with CAD/PDI/CRM systems; this information can be transferred through commonly used data exchange formats like IGES, STEP, IDEF etc.
Such classification allows a direct link to the Feature Tree of a Part model and/or to the Assembly Tree of an Assembly model in a CAD system, hence integrating the conceptual model of a mechanical system to its embodiment.
A third way to classify the components of the examined system consists in identifying which of them are secondary products of the system itself and among these which is the main product. This classification requires the results of the Interaction Analysis Module 33 and is described below.
The results of the Components Classification Module 32 are stored in the Database of the Extracted Information 50.
The Interactions Analysis Module 33 allows the identification of the functional links existing between the recognized components of the examined system. Again two different approaches (keywords search based and semantic based) can be followed:
An alternative embodiment of the same feature makes use of a semantic processor: for each TFA triad, if both the Tool and the Artifact are components of the system and the Field is a verb not belonging to the list f) of the Stop Words and Analogue Words Database 40, then that TFA triad is assumed as a basic functional block of the system; otherwise if just one among the Tool and the Artifact is a component of the system, but the Field is a verb belonging to the set c) of the Stop Words and Analogue Words Database 4 then the missing Tool/Artifact is assumed as an External Component of the system and the complete triad is assumed as a basic functional block of the system. Finally it may happen that the Tool is a component of the system, but the pair Field-Artifact itself is a function of the system (for example, provide-tension=tighten); if a pair Field-Artifact among those extracted by the semantic processor belongs to set g) of the Stop Words and Analogue Words Database 40, then the subject of the verb is assumed as the Tool of the triad and the pair Field-Artifact is translated according to set g) table of the Stop Words and Analogue Words Database 40 in a functional Field. The search for the Artifact of such a function can be demanded to the user or performed by looking for the first identified component following the preposition typically associated to that pair Field-Artifact (In the same table of the Stop Words and Analogue Words Database 40 one or more typical preposition following each pair Field-Artifact are cited). For example, in the sentence “the upper end of the piston contacts the arm to provide tension on the arm which . . . ” the semantic processor identifies the triad “upper end—provide—tension” from which the Tool “upper end”, the Field “tighten” and the Artifact “arm” (as the component following the preposition “on”) are extracted.
At the end of the Interactions Analysis phase, all Tools/Artifacts that have not been identified as system components by the Component Recognition Module 31 are assumed as External Systems and they can be used to link or combine two or more inventions, technical systems and/or to highlight the relations between the examined system and the other systems belonging to the State of the Art of that field of application.
Typically, if the text to be analysed is a patent, the Interactions Analysis is performed on the “Detailed description of the invention” and/or on the Abstract and/or on the Claims.
At the end of the interactions analysis performed by the Interactions Analysis Module 33, all identified triads TFA are stored in the Database of the Extracted information 50, as well the position in the examined text of the sentence from where such a triad has been extracted: the position is evaluated just numbering with a sequential order all the sentences of the examined text, distinguishing a sentence from another on the basis of the character “.” or the ASCII character Carriage Return.
Combining the results of the Components Recognition Module 31, the Hierarchy of Detail Identification Sub-Module 34 and the Interactions Analysis Module 33, it is possible to identify all secondary products and among these the main product of the examined system. In the exemplary embodiment of the present invention. shown in
The Post Processing Module 60 supplies the content of the Database of the Extracted Information 50 to the user, organizing such data in four different forms as function of the scope of the analysis. An exemplary embodiment of the Post Processing Module 60 according to the principles of the present invention is composed of four sub-modules as shown in
The Graphical Mapping of Text Content Module 61 is an effective tool to output all information extracted by the analysis of the Text Analyser Module 30: the results of the Components Recognition, the Components Classification and the Interactions Analysis phases are represented in a diagram, as in
The Text Comparison Module 62 allows the comparison between two or more systems descriptions according to the following parameters:
The Database of Functional Usage of Components in Different Systems 63 stores all functional interactions associated with homonymous components in all examined texts, recording the reference to the source text and the role of the component in the TFA triad.
A comparison between the way a same component is used in different systems can be supplied to the user.
The Database of Components Capable of Performing a Given Function 64 stores:
A comparison between the way a same action (Field) or a same function (Field-Artifact) is performed in different examined systems can be supplied to the user.
The Stop Words and Analogue Words Database 40 is composed of eight set of words, all customisable by the user both manually or through an automatic extraction procedure described below:
The Stop Words and Analogue Words Database 40 allows the customisation of the system described in the present invention, but some other customisations are also available:
Number | Date | Country | Kind |
---|---|---|---|
02425149.8 | Mar 2002 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP03/01554 | 2/17/2003 | WO | 10/29/2004 |