The patent relates generally to electronic mail messages and, more particularly, to detecting content information from an electronic mail message and predicting its level of importance.
A summary of certain embodiments disclosed herein is set forth below. It should be understood that these aspects are presented merely to provide the reader with a brief summary of these certain embodiments and that these aspects are not intended to limit the scope of this disclosure. Indeed, this disclosure may encompass a variety of aspects that may not be set forth below.
Embodiments of the disclosure related to systems and methods for detecting and predicting level of importance of Electronic Mail Message. For example, a method comprises detecting topic information from an electronic mail message using topic detection module, detecting role information of one of the sender and the receiver from the electronic mail message using a role detection module, detecting a relationship between the topic information and the role information using a relationship detection module, and predicting level of importance based of the detected topic, role and relationship information. The embodiment further includes at least one of N-grams, Part-of-Speech (POS) Tags, Length features, Content features, meta features and the like located in one or more of a topic detection, a sender and receiver role detection, and relationship detection. Each detection includes in a non-transitory machine accessible medium when accessed by a machine, cause the machine to perform operations.
In alternate embodiment, a non-transitory machine accessible medium comprises a first detection module for detecting topic information from an electronic mail message, a second detection module for detecting role information of one of the sender and the receiver from the electronic mail message, a third detection module for detecting a relationship between the detected topic information and the detected role information. Based on the detected topic, role and relationship information the non-transitory machine accessible medium predicts level of importance of the electronic mail message. The feature includes at least one of N-grams, Part-of-Speech (POS) Tags, Length features, Content features, meta features and the like. Such feature is located in one or more of the detection modules.
The first detection module is a topic detection module, the second detection module is a sender role and receiver role detection module, the third detection module is a relationship detection module.
These and other features, aspects, and advantages of this disclosure will become better understood when the following detailed description of certain exemplary embodiments is read with reference to the accompanying drawings in which like characters represent like arts throughout the drawings, wherein:
One or more specific embodiments will be described below. In an effort to provide a concise description of these embodiments, not all features of an actual implementation are described in the specification. It should be appreciated that in the development of any such actual implementation, as in any engineering or design project, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related and business-related constraints, which may vary from one implementation to another. Moreover, it should be appreciated that such development effort might be complex and time consuming, but would nevertheless be a routine undertaking of design, fabrication, and manufacture for those of ordinary skill having the benefit of this disclosure.
The following description is presented to enable any person skilled in the art to make and use the described embodiments, and is provided in the context of a particular application and its requirements. Various modifications to the described embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the described embodiments. Thus, the described embodiments are not limited to the embodiments shown, but are to be accorded the widest scope consistent with the principles and features disclosed herein.
The network 102 can comprise one or more sub-networks, and can be installed between any combination of the client machines 106A-106D, the server 104, computing machines and appliances included within the network computing system 100. In some embodiments, the network 102 can be for example a local-area network (LAN), a metropolitan area network (MAN), a wide area network (WAN), a primary network 104 comprised of multiple sub-networks located between the client machines 106A-106D and the server 104, a primary public network with a private sub-network, a primary private network with a public sub-network, or a primary private network with a private sub-network 104. Still further embodiments include a network 102 that can be any network types such as a point to point network, a broadcast network, a telecommunication network, a data communication network, a computer network, an ATM (Asynchronous Transfer Mode) network, a SONET (Synchronous Optical Network) network, a SDH (Synchronous Digital Hierarchy) network, a wireless network, a wireline network, and the like. Depending on the application, other networks may be used so that data exchanged between the client machine and the server can be transmitted over the network. Network topology of the network 102 can differ within different embodiments which may include a. bus network topology, a star network topology, a ring network topology, a repeater-based network topology, or a tiered-star network topology. Additional embodiments may include a network of mobile telephone networks that use a protocol to communicate among mobile devices, where the protocol can be for example AMPS, TDMA, CDMA, GSM, GPRS, UMTS, LTE or any other protocol able to transmit data among mobile devices.
The communication link L1-L4 may be wired, wireless, or combination thereof. The system 100 may be used in commonplace in offices, enterprise-wide computer networks, intranets, internets, public computer networks, or combination thereof. The wireless communication link may include cellular protocol, data packet protocol, radio frequency protocol, satellite band, infrared channel, or any other protocol able to transmit data among client machines. The wired communication link may include any wired line link.
The computer readable media 204 may be partitioned or otherwise mapped to reflect the boundaries of the various subcomponents. The computer readable 204 typically includes both volatile and non-volatile media, removable and non-removable media. For example, the computer readable media 204 includes computer storage media and communication media. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology, CD-ROM, DVD, optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage device, or any other medium which can be used to store the desired information and which can accessed by a client machine. For example, computer storage media can include a combination of random access memory (RAM), read only memory (ROM) such as BIOS. Communication media typically includes computer readable instructions, data structures, program modules, or other data in a modulated data signal such a carrier wave or other transport mechanism and include any information delivery media. Communication media may also include wired media such as a wired network or direct-wired communication, and wireless media such as acoustic, RF, infrared (IR) and other wireless media. Communications of the any of the above should also be included with the scope of computer readable media.
The input/output subsystem 208 includes various end user interfaces such as a display, a keyboard, joystick, a mouse, a trackball, a touch pad, a touch screen or tablet input, a foot control, a servo control, a game pad input, an infrared or laser pointer, a camera-based gestured input, and the like capable of controlling different aspects of the machine operation. For example, user can input information by typing, touching a screen, saying a sentence, recording a video, or other similar inputs. The communication interface 212 allows software and data to be transferred between the computer system and other external electronic devices in the form of signals which may be, for example, electronic, electromagnetic, optical, or other signals capable of being received by the communication interface 212. The communication interface 212 may be for example a modem, a network interface, a communication port, a PCM-CIA slot and card, or the like.
The processor 214 may be a general or special purpose microprocessor operating under control of computer executable instructions, such as program modules, being executed by a client machine. Program modules generally include routines, programs, objects, components, data structure and the like that perform particular tasks or implement particular abstract types. In one embodiment, some or all of the sub-processors may be implemented as computer software tangibly stored in a memory to perform their respective functions when executed. In alternate embodiment, some or all of the sub-processors may be implemented in an ASIC.
The manager system module 216 can be a single module and coupled to other computer implemented device via the system bus. The manager system module 216 can also be integrated into at least one of the computer readable media 204, the processor 214, or any computer implemented device. The manager system module 216 includes instructions for detecting object information and content information in at least one electronic mail message, detecting the relationship between the detected object information and detected content information, and predicting level of importance of the electronic mail message based on the detected object information, detected content information, and detected relationship. In one embodiment, the manager system module 216 may be integrated into the server 104 for detecting object information and content information in at least one electronic mail message, detecting the relationship between the object information and content information, and predicting level of importance of the electronic mail message. In alternate embodiment, more than one manager system module 216 may be coupled or integrated into the client machine 106, the server 104, and any computer implemented device. The detection and prediction of the object information, content information, the relationship, and level of importance may be either performed by a single manager system module or several manager system. The object information may be a topic information and the content information may be the roles of the sender and the receiver. A topic modeling in a form of computer executable instructions stored in the manager system model 216 is used to detect the topic information.
At least one of topic detection 304, role detection 306, and relationship detection 308 of one or more incoming emails can be performed by any server, manager system module, processor, computer implemented device, and the like. The topic detection 304 includes detecting of topic information in at least one of the first and second context section 14, 16 and anywhere else in the email using a topic modeling. In some embodiment, the topic modeling may be in a form of computer executable instructions stored in the topic detection 304 configured to detect the topic information. The detected topics can be a list of keywords extracted or derived from the content of the email. The topic information may be categorized/structured into one or more groups/community such as business contact group, personal contact group, social contact group, advertisement contact group, and the like. The personal contact group may be further categorized/structured into several sub-groups such as family and friends. Similarly, the business contact group may also be categorized/structured into several sub-groups such as internal business/co-workers and external business liaisons. Various levels or hierarchy can be assigned to one of the groups or sub-groups, depending on the application. For example, family sub-group of the personal contact group can be divided into levels or hierarchy distant family, direct family, and the like. Other topic information includes as junk, span, or the like may be discarded or removed without assigning a specific group.
Roles or title of the sender and the receiver in at least one of the first context section 14, second context section 16, and elsewhere in the email can be performed by the role detection 306. The role of the title of one of the sender and receiver may include such as business title or job function, family role, social role, in various environments. In one embodiment, the role or title of one of the sender and receiver may not be visibly available in the first context section 14, the role detection 306 continue to detect information in either the second context section 16 or the elsewhere in the email. In this case, the information can be title or role of the sender/receiver appeared in a signature field or among the content of information in text format or other format. In another embodiment, the role or title of one of the sender and receiver may be outside the business context such as personal (friend, family), social, community, and the like.
The relationship between at least one of the object information, content information, topic information, sender, receiver in the email can be identified and detected using the relationship detection 308. The relationship may be close-close relationship, close-distant relationship, distant-distant relationship, direct-indirect relationship, and the like. The relationship can be further categorized into various content-role, environment-role, and content-environment relationships. For example, the relationship includes employer-to-employee relationship, peer-to-peer relationship, husband-wife relationship, parent-child relationship, client-business relationship, friend-friend relationship, and the like. In one embodiment, one or more of the detections 304, 306, 308 may be performed either in a single step or multiple steps using a single computer implemented device or various computer implemented devices. In another embodiment, some or all the detections 304, 306, 308 may be performed simultaneously in a single step.
Each detections 304, 306, 308 include at least one feature extractor. The feature extractor includes N-grams, Part-of-Speech (POS) Tags, Length features, Content features, and the like, suitable for detecting at least one of the topic information, object information, or content information in at least one electronic mail message, identifying the role of the sender and receiver, detecting the relationship between the at least one of the topic information, object information, content information, or roles of the sender and receiver, and predicting level of importance of the electronic mail message. For example, POS tags analyze input word's characteristics based on word, context and the like. Any known POS techniques available in the field may be used to analyze input word's characteristics. Length features analyze information such as length of content, number of word counts, and the like. In one embodiment, POS tags and Length features are two independent feature extractor for performing different analysis. In another embodiment, POS tags and Length features can be integral into a single feature extractor for performing some or all the analysis. Content feature extract hints of the email message content such as number of question marks, data information, capitalized words, and the like.
A statistical model 310 in the form of software or firmware are provided in the manager system module for detecting topics in an electronic mail message, identifying the role of the sender and receiver, detecting the relationship between the at least one of the topic information, or roles of the sender and receiver, and predicting level of importance of the electronic mail. The model for detecting the topics, roles or relations can be statistical or rule-based. As illustrated in
The embodiments described above have been shown by way of example, and it should be understood that these embodiments may be susceptible to various modifications and alternative forms. It should be further understood that the claims are not intended to be limited to the particular forms disclosed, but rather to cover all modifications, equivalents, and alternatives falling with the sprit and scope of this disclosure.
While the patent has been described with reference to various embodiments, it will be understood that these embodiments are illustrative and that the scope of the disclosure is not limited to them. Many variations, modifications, additions, and improvements are possible. More generally, embodiments in accordance with the patent have been described in the context or particular embodiments. Functionality may be separated or combined in blocks differently in various embodiments of the disclosure or described with different terminology. These and other variations, modifications, additions, and improvements may fall within the scope of the disclosure as defined in the claims that follow.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2016/079447 | 12/1/2016 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62261530 | Dec 2015 | US |