The present subject matter relates generally to interrelation of multiple data streams, and more particularly to tagging and correlation of data within aviation-related data systems and other asset-related applications.
Vast quantities of data are generally available related to performance tracking for transportation fleets and individual assets. For example, the aviation industry gathers aircraft operational data from a variety of particular sources. Data can be collected from aircraft via Quick Access Recorders (QARs), which can provide airborne recordation of raw flight data parameters received from a number of aircraft sensors and avionic systems. Data can be collected from maintenance records from an airline's Maintenance, Repair and Overhaul (MRO) systems. Data also can be collected from pilot reports or Aircraft Condition Monitoring Systems (ACMS) communications. Still further, Aircraft Communications Addressing and Reporting System (ACARS) messages can include relevant data including aircraft movement events, flight plans, weather information, equipment health, status of connecting flights, and the like.
Predictive analysis of aircraft operational data can offer useful information for maintenance and prognostics for individual aircraft or entire fleets. Many existing systems rely primarily on human interpretation of these vast amounts of data, which can be cumbersome, tedious and time consuming. In addition, enterprise-level analytic systems that consume multiple data streams can sometimes require a composite view of all available data for an asset. Knowledge inferred from such a composite view can be required to build an accurate situational awareness picture for a fleet of assets. These known options can yield limited accuracy and effectiveness in making intelligent decisions about maintenance actions to take and the urgency of taking those actions.
Aspects and advantages of embodiments of the present disclosure will be set forth in part in the following description, or may be learned from the description, or may be learned through practice of the embodiments.
One example aspect of the present disclosure is directed to a computer-implemented method for interrelation of multiple streams of asset-related data. The method can include providing, by one or more processors, one or more portions of data from a plurality of asset-related data streams as input text to a natural language processing engine. The method can further include identifying, by the one or more processors, one or more data entities within the asset-related data streams using the natural language processing engine. The method can further include generating, by the one or more processors, one or more processing rules for applying information to the data entities identified within the asset-related data streams. The method can further include storing, by the one or more processors, the one or more processing rules as part of a statistical model for evaluating subsequent portions of asset-related data for performance of one or more maintenance events.
Another example aspect of the present disclosure is directed to a system for interrelation of multiple streams of asset-related data. The system can include one or more processors and one or more memory devices. The one or more memory devices can store computer-readable instructions that when executed by the one or more processors cause the one or more processors to perform operations. The operations can include providing one or more portions of data from a plurality of asset-related data streams as input text to a natural language processing engine. The operations can further include identifying one or more data entities within the asset-related data streams using the natural language processing engine. The operations can further include generating one or more processing rules for applying information to the data entities identified within the asset-related data streams. The operations can further include storing the one or more processing rules as part of a statistical model for evaluating subsequent portions of asset-related data for performance of one or more maintenance events.
Variations and modifications can be made to these example aspects of the present disclosure.
These and other features, aspects and advantages of various embodiments will become better understood with reference to the following description and appended claims. The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments of the present disclosure and, together with the description, serve to explain the related principles.
Detailed discussion of embodiments directed to one of ordinary skill in the art are set forth in the specification, which makes reference to the appended figures, in which:
Reference now will be made in detail to embodiments of the invention, one or more examples of which are illustrated in the drawings. Each example is provided by way of explanation of the invention, not limitation of the invention. In fact, it will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the scope or spirit of the invention. For instance, features illustrated or described as part of one embodiment can be used with another embodiment to yield a still further embodiment. Thus, it is intended that the present invention covers such modifications and variations as come within the scope of the appended claims and their equivalents.
Example aspects of the present disclosure are directed to systems and methods for interrelation of multiple data streams associated with composite analytics for a fleet of assets, such as aircraft or the like. Some example embodiments can provide features for building a closed-loop system capable of combining quantitative input in the form of machine learning with qualitative input in the form of user instructions for automated analysis of multiple data streams. These features can serve as part of a data tagging and correlation framework. In this way, example aspects of the present disclosure can have a technical effect of providing a robust and extensible data analysis pipeline that leverages rules engine capability and natural language processing to extract meaning from free text input to identify data entities and to correlate identified relationships among data entities in the system.
Automated generation of facts and information about an asset or fleet of assets can have a technical effect of providing meaningful outputs to assist with making intelligent decisions regarding operations and reliability of a fleet of assets. In some examples, processing rules developed according to the disclosed techniques can have a technical effect of implementing tagging of faults and warnings within post-flight report (PFR) data. In some examples, maintenance, repair and operations (MRO) data can be processed and correlated to provide meaningful output information. Identified and generated datasets, rules, tagged data and/or resulting analytics can be provided for display to a user via an interactive user interface or other display options.
In one example implementation, a method for interrelation of multiple data streams can include the application of multiple data streams of asset-related data to a processing pipeline that includes natural language processing algorithms and/or business rule processing algorithms. In some examples, the asset-related data can correspond to aviation-related data including but not limited to aircraft faults and warnings data, post-flight report data, and/or aircraft maintenance report data.
The natural language processing algorithm can receive one or more portions of asset-related data as input and identify one or more data entities from free input text contained in the data streams. In some examples, the natural language processing algorithm also can identify one or more relationships among the identified data entities. One or more processing rules can be generated for applying information to the identified data entities. In some examples, the processing rules can include tagging rules for associating one or more keywords to one or more data entities identified within the asset-related data streams. In some examples, the processing rules can include relationship rules for identifying relationships among two or more data entities identified within the asset-related data streams. The generated processing rules can be stored as part of a statistical model for evaluating subsequent portions of asset-related data.
The statistical model can be developed at least in part by machine learning processes that receive datasets of asset-related data and corresponding processing rules. These datasets are used to help train classifiers for processing subsequent portions of asset-related data. In some examples, the datasets provided as input to the statistical model include asset-related data (e.g., identified data entities) and associated user instructions for tagging or correlating the identified portions of the asset-related data. These correlated associations between data and tagging or relationship rules can be used to generate additional processing rules for further development of a statistical model for analyzing subsequent data. In examples that combine both machine learning and user input, a flexible and robust system can facilitate both the manual and automated extraction of knowledge from asset-related data.
Referring now to the FIGS.,
Data available within the ASN Database 102 can be collected from a variety of particular sources maintained by one or more particular airlines, by general aviation tracking systems, by third party data collection and analysis entities authorized by an airline or other organization to track relevant data, or other entities. For instance, Faults and Warnings Data 104 can be collected from aircraft via Quick Access Recorders (QARs), which can provide airborne recordation of raw flight data parameters received from a number of aircraft sensors and avionic systems. Post-Flight Reports (PFR) Data 106 can include an electronic form of data that is collected automatically from aircraft systems and/or from information provided by pilot data entry that is pertinent for tracking customized information about particular aircraft flights. MRO Data 108 can be collected from maintenance records from an airline's Maintenance, Repair and Overhaul (MRO) systems. Tagging Data 110 can include different predetermined tagging options that can be employed by various processing algorithms to label keywords and define relationship for identified data entities. Tagging Data 110 also can include datasets of correlated items, including data entities and associated keyword tags or relationship tags that can be used as training input for machine learning of statistical models that define various processing algorithms. Data streams provided within ASN Database 102 can come from still further sources, including but not limited to pilot reports, Aircraft Condition Monitoring Systems (ACMS), and/or Aircraft Communications Addressing and Reporting System (ACARS) messages that include relevant data such as aircraft movement events, flight plans, weather information, equipment health, status of connecting flights, and the like.
Referring still to
Windowed data 114 extracted from data streams via data extraction and windowing algorithm 112 can be provided to natural language processing engine or algorithm 116. Natural language processing engine or algorithm 116 can generally involve the interpretation of natural language input or free text input from asset-related data streams. Natural language processing engine 116 can include one or more particular operational items, including a sanitization operation 118, a name entity recognition operation 120, a co-reference resolution operation 122 and a relationship extraction operation 124.
Sanitization operation 118 is configured to perform spelling corrections and remove or replace irrelevant characters within free text fields that are provided as part of the asset-related data streams from ASN database 102. Customizable sanitization processing rules can use example datasets to update probability models for tokenizing free text and filtering out unwanted results.
Name entity recognition operation 120 can include a process of identifying one or more data entities from text within various asset-related data streams. In the case of data streams from ASN database 102, example data entities for potential identification can include but are not limited to aircraft systems and subsystems, part numbers and names, aircraft tail numbers, aviation abbreviations, task numbers, actions, problems, and/or relationships among these entities and others. Customizable dictionaries can be stored for access by name entity recognition operation 120 to assist with the identification of entities that use terms common to the native language (e.g., English) of the system. Additional dictionaries can be customized to help handle domain specific entities by storing data entries for known entities that may not be common to a native language. Dictionaries also can store related synonyms used for resolving co-reference data within co-reference resolution operation 122.
Co-reference resolution operation 122 can identify when a particular data entity is co-referenced by other mentions within input text including pronouns, abbreviations or the like. In some cases, plural pronouns can be identified as co-referencing multiple data entities in the input text.
Relationship extraction operation 124 can identify relationships among different data entities identified via name entity recognition operation 120 and/or co-referenced mentions of data entities identified via co-reference resolution operation 122. Identification of relationships via relationship extraction operation 124 can be facilitated at least in part by part of speech tagging and/or phrase grouping. Part of speech tagging can label input text based on definition and context, while phrase grouping collects tokens and part of speech tags into related groups such as phrases.
The natural language processing algorithm 116 can advantageously offer a flexible system that can be readily updated to accommodate advancing capabilities and competencies of newly emerging natural language processing rules engines. Additional advantages can be realized in part when the natural language processing algorithm 116 is accomplished via machine learning processes. Such processes can be periodically or incrementally trained to maintain and enhance the accuracy of results. This training can be accomplished via user feedback and actions which can be translated and formatted into consumable datasets for the algorithms.
Processed data 126 can be received as output from the natural language processing engine 116 and provided as input to a business rule processing algorithm 128. Business rule processing engine 128 can include a relationship rule application 130 and a tagging rule application 132. In general, the processing rules provided within business rule processing engine 128 can continuously evolve as part of a statistical model that is progressively updated as new data is made available to the system. As such, business rule processing engine 128 can be configurable to facilitate maximum extensibility and also to support rules that are unique to specific customers and/or asset data. Business rule processing engine 128 also can be provided with linked access to particular libraries or dictionaries defining various data entities and relationships among those entities, similar to the dictionaries available to natural language processing engine 116.
Relationship rule application 130 corresponds to an algorithm of computer-executed operations that are configured to define relationships between one identified data entity and another. The data entities for which relationships can be defined can correspond to the data entities identified within the natural language processing algorithm 116. For example, relationship rule application 130 can provide a framework for automatically relating alerts to faults or other relevant entries from Post-Flight Report (PFR) data. In other examples, configurable processing rules can provide a framework for automatically correlating alerts with MRO data entries. An additional example of relationship tagging rules is illustrated in
Tagging rule application 132 corresponds to an algorithm of computer-executed operations that are configured to apply one or more tags, or keywords, to an entity. The types of tags or keywords available to associate with an entity can be predetermined or customizable. Some examples of available tags within aviation data can serve to implement data tagging of faults and warnings from post-flight report data. Pre-defined tags in such examples can include indicators such as but not limited to “Undetectable,” “Undetected,” and “Out of Scope.” Another example involves tagging of a maintenance defect as a “Miss.” An additional example of tagging rules is illustrated in
Additional user-defined tagging criteria also can be utilized. General tags also can be available to facilitate troubleshooting and notification of poor data quality. For example, a “Requires Manual Attention” tag can be used to indicate either an error while processing or the successful execution of a business rule designed to detect problems in the data. In another example, an “Indecipherable” tag can be used to indicate that the underlying data is indecipherable and thus cannot be processed. An example application of the “Indecipherable” tag would be for free text with no recognizable data entities.
In some examples, data tagging and correlation system 100 can include a post processing algorithm 134 that serves to update the system 100 with processing rules that are identified in the natural language processing algorithm 116 and/or the business rule processing algorithm 128. For example, when processing rules including tagging rules and/or relationship rules are developed within the business rule processing algorithm 128, those operational rules can be added to working memory within ASN database 102 so that the processing rules are available for subsequent data processing on a persistent basis. Results 136 from the business rule processing algorithm 128 can be further condensed in a condense results operation 138 or processed in a process results operation 140 to yield a proper form for storing within ASN database 102. In some examples, the processing rules are stored within ASN database 102 as part of a statistical model for evaluating subsequent portions of asset-related data.
Referring now to
Client systems 150 can correspond to computing devices and related storage databases for gathering and collecting data specific to the operational assets of a particular client, such as an airline or the like. Data from client systems 150 can be relayed via a Secure File Transfer Protocol (SFTP) process to an ASN Decoder 152. ASN Decoder 152 can be configured to decode larger text fields with varying degrees of structure depending on the particular type of analyzed assets and the particular client system 150. For example, in ASN database analysis, ASN Decoder 152 can be configured to decode fields including but not limited to Faults, Warnings and Maintenance Messages.
User 154 can correspond to one or more engineers, technicians, managers, or other specialists within an airline maintenance organization who help solve various aircraft maintenance problems. Such a user 154 can be provided with access to the data tagging and correlation system 100 through an ASN user interface and markup widget 156. Via the user interface 156, users can tag and relate the various data entities identified in the system with predefined system tags and/or user-defined tags. In particular, users 154 can enter mark-ups and/or corrections via user interface 156 that result in the identification of new data entities, corrected data entities, new tags or corrected tags for storing within the ASN database 102. This capability of providing instructional information via the user interface 156 can support the derivation of datasets which can be used to drive machine learning and derive processing rules for algorithms within the rules-based engine 148. Example actions that can be implemented via user instructions provided at user interface 156 include tagging identified alerts as having “No Value,” tagging identified faults as “Out of Scope” or “Undetected,” and/or relating alerts to relevant Fault, Warning and/or MRO data.
Rules based engine 148 can more particularly include a statistical model formulated by a machine learning training loop that identifies processing rules as training classifiers for processing of subsequent portions of asset-related data. As shown in
Each server 202 and client 222 can include at least one computing device, such as depicted by server computing device 204 and client computing device 224. Although only one server computing device 204 and one client computing device 224 is illustrated in
The computing devices 204 and/or 224 can respectively include one or more processor(s) 206, 226 and one or more memory devices 208, 228. The one or more processor(s) 206, 226 can include any suitable processing device, such as a microprocessor, microcontroller, integrated circuit, logic device, one or more central processing units (CPUs), graphics processing units (GPUs) dedicated to efficiently rendering images or performing other specialized calculations, and/or other processing devices. The one or more memory devices 208, 228 can include one or more computer-readable media, including, but not limited to, non-transitory computer-readable media, RAM, ROM, hard drives, flash drives, or other memory devices. In some examples, memory devices 208, 228 can correspond to coordinated databases that are split over multiple locations.
The one or more memory devices 208, 228 store information accessible by the one or more processors 206, 226, including instructions that can be executed by the one or more processors 206, 226. For instance, server memory device 208 can store instructions 210 for implementing processing rules, operations and algorithms 212 for performing various functions disclosed herein. In some examples, processing rules and algorithms 212 can include but are not limited to the natural language processing algorithm 116, business rule processing algorithm 128 and post-processing algorithm 134 depicted in
Computing devices 204 and 224 can communicate with one another over a network 240. In such instances, the server 202 and one or more clients 222 can respectively include a network interface used to communicate with one another over network 240. The network interface(s) can include any suitable components for interfacing with one more networks, including for example, transmitters, receivers, ports, controllers, antennas, or other suitable components. The network 240 can be any type of communications network, such as a local area network (e.g. intranet), wide area network (e.g. Internet), cellular network, or some combination thereof. The network 240 also can include a direct connection between server computing device 204 and client computing device 224. In general, communication between the server computing device 204 and client computing device 224 can be carried via network interface using any type of wired and/or wireless connection, using a variety of communication protocols (e.g. TCP/IP, HTTP, SMTP, FTP), encodings or formats (e.g. HTML, XML), and/or protection schemes (e.g. VPN, secure HTTP, SSL).
The client 222 can include various input/output devices for providing and receiving information to/from a user. For instance, an input device 236 can include devices such as a touch screen, touch pad, data entry keys, and/or a microphone suitable for voice recognition. Input device 236 can be employed by a user to provide marked up text and user-defined data for entities, tags or other components of the disclosed data tagging and correlation systems. An output device 238 can include audio or visual outputs such as speakers or displays for indicating data tagging and correlation outputs, user interfaces, and the like.
The technology discussed herein makes reference to servers, databases, software applications, and other computer-based systems, as well as actions taken and information sent to and from such systems. One of ordinary skill in the art will recognize that the inherent flexibility of computer-based systems allows for a great variety of possible configurations, combinations, and divisions of tasks and functionality between and among components. For instance, server processes discussed herein may be implemented using a single server or multiple servers working in combination. Databases and applications may be implemented on a single system or distributed across multiple systems. Distributed components may operate sequentially or in parallel.
It will be appreciated that the computer-executable algorithms described herein can be implemented in hardware, application specific circuits, firmware and/or software controlling a general purpose processor. In one embodiment, the algorithms are program code files stored on the storage device, loaded into one or more memory devices and executed by one or more processors or can be provided from computer program products, for example computer executable instructions, that are stored in a tangible computer-readable storage medium such as RAM, flash drive, hard disk, or optical or magnetic media. When software is used, any suitable programming language or platform can be used to implement the algorithm.
Tags can serve as both output and input into the algorithmic framework of business rule processing algorithm 128. In some examples, business rule processing algorithm 128 applies preconfigured rules to produce tags as an output relative to various data entities. In some examples, users can assign tags as input associated with identified data entities using the user interface 156. In the latter examples, the user-inputted tags and associated data entities then can be packaged into datasets used for training the processing algorithm using a statistical model.
Referring now to
User interface 320 of
The visualization provided within user interface 320 of
In general, the format of events tracked along a period of time as depicted in
The operational events identified within user interface 330 of
Referring still to
Referring still to
While method (400) of
Referring now to
Referring still to
One or more of the operational events and maintenance events identified at (446) can be provided for display at (448). The events can be tracked over a period of time for each selected aircraft in a chart format such as a Gantt chart. An example of such a chart is shown in the first “Faults and Warnings” interface portion 322 of
Although specific features of various embodiments may be shown in some drawings and not in others, this is for convenience only. In accordance with the principles of the present disclosure, any feature of a drawing may be referenced and/or claimed in combination with any feature of any other drawing.
This written description uses examples to disclose the invention, including the best mode, and also to enable any person skilled in the art to practice the invention, including making and using any devices or systems and performing any incorporated methods. The patentable scope of the invention is defined by the claims, and may include other examples that occur to those skilled in the art. Such other examples are intended to be within the scope of the claims if they include structural elements that do not differ from the literal language of the claims, or if they include equivalent structural elements with insubstantial differences from the literal languages of the claims.