A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the xerographic reproduction by anyone of the patent document or the patent disclosure in exactly the form it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The present invention relates generally to computer-generated social network analysis mechanisms, and more particularly to methods and a computer-based apparatus for a new organizational metric to predict business performance based on longitudinal social network analysis.
Albert Einstein called quantum entanglement “spooky action at a distance” (Einstein et al., 1935), predicting that quantum mechanics should allow objects to influence each other's action at great distance. It took other Nobel prize winning physicists decades after Einstein's death to confirm his prediction. The current study proposes a similar social entanglement effect between people.
“You share everything with your bestie. Even brain waves.” (Angier, 2018). This is how the New York Times summarized the work of Parkinson et al. (2018), who found that brain scans of close friends show similar patterns as they watch a series of short videos. Using these results, the researchers trained a computer algorithm to predict the strength of a social bond between two people based on the relative similarity or synchronization of their neural response patterns. Such neural synchronization patterns are also observed in various other studies in different contexts, e.g., to determine neural contingencies between musical performers and their audiences. Hou et al. (2020) assess the neural synchronization between violinist and audience and the relation to the popularity of violin performance. Their findings suggest that neural synchronization between the audience and the performer might serve as an underlying mechanism for the positive reception of musical performance. Further, neural synchronization can be confirmed by analyzing verbal group communication (Liu et al., 2019). Individuals try to achieve neural and body synchronization in order to facilitate fluid interaction (Fairhurst et al., 2013; Yun et al., 2012). Experiments show that synchrony of fingertip movement and neural activity between two persons increases after cooperative interaction (Yun et al., 2012).
Hence, engaging individuals in synchronized activities like walking, dancing, etc. is an effective way of increasing subsequent cooperation between those individuals. However, the studies mentioned above focus on neural or body synchronization and are not applied in typical work environments or contexts. However “being in sync” or “in flow” in work environments is a relevant research topic and should be considered by decision-makers to determine the impact of such behavior on employee performance.
However, there exist opportunities to analyze online communication data in near-real time for continuous monitoring of team learning and performance. Metrics based on communication flow from person to person or the amount of communication are suitable for real-time processing. In addition, studies have shown that analyzing online communication data in organizational contexts (de Oliveira et al., 2019; Gloor et al., 2017b) could be used as a predictor for job-related constructs, such as employee turnover or employee performance. The speed of responding to an e-mail, for example, is a good predictor of individual and team performance (Gloor et al., 2020). It might be a proxy for the passion of the person who is responding to an e-mail (Gloor, 2017), or for other external reasons such as urgency, power differentials, etc.
There are multiple teachings that have been disclosed to facilitate the Team synchronization and flow state but unfortunately, there is not a straightforward study and solution proposed in this domain.
The current inventions propose a sophisticated system where a structured methodology is introduced to answer these questions by introducing a metric called entanglement, which measures the synchronization of e-mail communication behaviors of team members and their flow state over time. This metric is grounded in SNA and identifies the similarity of the time series of SNA metrics. The metric is validated by conducting four case studies, with different datasets from different organizations. Each case study is in a different context and variants of the entanglement measure are used as a predictor of different individual and group performance indicators
In light of the disadvantages of the prior art, the following summary is provided to facilitate an understanding of some of the innovative features unique to the present invention and is not intended to be a full description. A full appreciation of the various aspects of the invention can be gained by taking the entire specification, claims, drawings, and abstract as a whole.
The primary object of the invention is related to an advancement of a computer-based novel metric to measure how synchronized communication between team members is.
It is further the objective of the invention to provide a computer-based structured approach to calculate the Euclidean distance among team members' social network metrics time series.
It is also the objective of the invention to promote a new and versatile indicator for automatic computer-based analysis of employees' communication, analyzing the hitherto underused temporal dimension of online social networks which could be used as a powerful predictor of employee and team performance, employee turnover, and customer satisfaction.
It is also the objective of the invention to provide a novel computer-generated synchronization metric, called entanglement, which is based on computer-generated SNA of e-mail communication between different actors.
It is again the objective of the invention to provide an easy and simple computer-generated metric entanglement that can also predict individual employee turnover and might help such studies to improve their prediction model quality.
It is also the objective of the invention to provide a computer-generated Gini coefficient of betweenness entanglement, which is automatically calculated from time series of betweenness centrality of each employee in the email network, demonstrating that it is associated with individual employee performance. A high Gini index of betweenness entanglement—indicating that an employee is strongly entangled with a small team, while being weakly entangled with the rest of the organization—significantly increases the chance of being a top performer.
This summary is provided merely for purposes of summarizing some example embodiments, so as to provide a basic understanding of some aspects of the subject matter described herein. Accordingly, it will be appreciated that the above-described features are merely examples and should not be construed to narrow the scope or spirit of the subject matter described herein in any way. Other features, aspects, and advantages of the subject matter described herein will become apparent from the following Detailed Description, Figures, and Claims
The following non-limiting and non-exhaustive embodiments of the present invention are described with reference to the following drawings. The system and method of the present invention will now be described with reference to the accompanying flow chart drawing figure, in which:
Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the features in the figures may be exaggerated relative to other elements to improve understanding of embodiments of the present invention. The apparatus and method components have been represented where appropriate by conventional symbols in the drawings, showing only those specific details that are pertinent to understanding the embodiments of the present invention so as not to obscure the disclosure with details that will be readily apparent to those of ordinary skill in the art having the benefit of the description herein.
Detailed descriptions of the preferred embodiment are provided herein. It is to be understood, however, that the present invention may be embodied in various forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but rather as a basis for the claims and as a representative basis for teaching one skilled in the art to employ the present invention in virtually any appropriately detailed system, structure, or manner.
The current invention in its preferred embodiment aims to provide a sophisticated system based on the idea of using structured communication data to measure different categories of individual and organizational performance.
Synchronization is a fundamental element of life. Besides neuronal synchronization mentioned in the introduction, one finds studies that deal with the synchronization of human activities (Guastello and Peressini, 2017). Synchronization is often defined as the manifestation of unintended coordination. It is part of the natural behavior of a human being and takes place so invisibly that we usually do not notice it. It is triggered by audio-visual stimuli, haptic perception, or simply by the presence of certain people. Synchronization can be analyzed as neuromuscular coordination, where there is a relatively exact or proportional tracking of body, hand, and head movements, autonomic arousal, or electroencephalogram (EEG) readings between two or more people (Guastello and Peressini, 2017). For example, N'eda et al. (2000) show that the audience of a concert synchronizes its applause after an asynchronous start and Fairhurst et al. (2013) and Yun et al. (2012) show that people synchronize their finger tapping to improve coordination. While these studies only look at synchronization as neuromuscular coordination and task coordination, there are research efforts currently underway to uncover connections between synchronization in cognition, task structures, and performance outcomes in teams (Gipson et al., 2016). Better work performance outcomes would also be expected when teams are similarly synchronized (Elkins et al., 2009; Stevens et al., 2013). The hypothesis that team synchronization leads to better performance is further motivated by the theory of flow state. While the concept of synchronization in the above-mentioned studies applies a natural science perspective, human sciences like positive psychology consider synchronization as a part of the flow state (Gloor et al., 2012) and expect the flow state to cause better performance. A team is in the flow state (Csikszentmihalyi, 1996) when members create a sense of shared confidence and empathy, which culminates in a collective mental state in which individual intentions harmonize and are in-sync with those members of the group. This condition is also referred to as achieving a “group mind”, which is marked by a deep emotional resonance that enables e.g., jazz musicians to be completely coordinated throughout the improvisational flow. In other words, group flow manifests itself in physical and verbal activities, for instance, people mirroring each other and quickly finishing each other's sentences using the same words and phrases, indicating a “parallel synchronization of thought” (Armstrong, 2008). The more the team members are in-sync, the more likely it is to observe group flow.
Group flow can be analyzed by applying “interaction analysis”, which entails closely observing and categorizing the interactions, movements, and body language of group members. However, it cannot be limited to neurological studies of particular participants of the group's emotional conditions or subjective memories (Sawyer, 2003). Thus, group flow cannot be split down into specific tasks; rather, it is a process that arises from group dynamics and has the ability to improve job satisfaction, intrinsic motivation, vigor, performance, or efficiency (Delarue et al., 2008; Sawyer, 2003; van den Hout et al., 2018). Hence, flow represents rather an oscillating dynamic state that combines continuous and sudden changes across time (Ceja and Navarro, 2012) than a static one.
The flow concept can be transferred into the organizational context (Heyne et al., 2011). Bakker (2005) defines work-related flow as a short-term peak experience at work that is characterized by absorption, work enjoyment, and interest. Teams “are in flow” if there is a certain balance between challenges and the skill sets of the individual team members. Work-related flow leads to better productivity and performance (see
Each team member, or individual, may have a computing identifier to identify the member in the computing environment.
The idea of the entanglement measure is to determine how a person is in sync with his/her group and shares the same flow with the other team members, with regard to communication over a period of time. In an attempt to conceptualize entanglement, a multidisciplinary approach is proposed, bringing together concepts from several disciplines, ranging from quantum mechanics to human and social sciences. A result of this phenomenon is that when one measures the quantum state of one particle, one simultaneously determines the quantum state of the other particle. A quantum state (of a particle) is a representation of knowledge or information about an aspect of the system or reality (Pusey et al., 2012). In this study, we interpret the reality as the state about a person-to-person relationship. Thus, the two particles are seen as two individuals that have potentially interacted with “others”, not necessarily with each other, and have therefore become entangled. Our idea of synchronicity is that people are in-sync when they show similar behavioral patterns, such as communication activity. Hence, two persons are entangled even when they are physically separated or not involved in a (local) interaction with each other but share a similar communication behavior (an example is provided in
Similar concepts have previously been described in psychology and sociology. “Entrainment” describes a process where one system's motion or oscillation frequency synchronizes with another system, for instance, the brainwaves of two people rocking together in their chairs. Cross et al. (2019) defines interpersonal entrainment as the synchronization of organisms to a rhythm, for example singing, dancing, or even walking together. Much earlier, early twentieth-century French sociologist Emile Durkheim defined collective effervescence as the similar but broader notion of synchronized action between humans (Durkheim, 2008), to describe when a community or society comes together to communicate the same thought or participate in the same action. This concept has been picked up by sociologist Randall Collins through his construct of “Interaction Ritual Chains” (Collins, 2005), which explain collective action through shared emotional energy. The common theme of all these constructs is colocation, people creating and experiencing emotional energy by being together at the same location. We therefore prefer the term “entanglement” to describe synchronous action between humans independent from where they are located, to describe in the words of Albert Einstein, “spooky action at a distance”.
Human communication is fundamentally synchronous and rhythmic, two important characteristics of individual and interactional behavior (Condon, 1986). The synchronization of interactional behaviors helps to generate a sense of flow state for the persons involved (Condon, 1986). Further, it always takes other people for a person to reach the state of flow (Collins, 2005), while the other people do not have to be physically present. Thus, entanglement leads to a flow state of two persons analogous to the “mysterious change” of a particle's quantum state. Intuitively, we propose that the “more similar the communication” of two persons A and B is, the more person A is in sync and is able to share the same flow of communication with person B over a period of time. Individuals who are in flow might have higher abilities to productively channel their cooperative spirit when working together.
Thus, we can state that the distance of the data points representing the communication intensity between two or more persons in a specific time window is an indicator of their synchronization. Here, we use the Euclidean distance, a straight-line distance between two points in Euclidean space. We calculate the Euclidean distance d of two data points x and y of a communication metric A of the same time window t with:
This Euclidean distance specified in the formula above is calculated for every pair of nodes and time window t. An essential requirement to determine if persons are entangled is to consider both team synchronization and team flow. Team flow is based on flow experienced in relational embeddedness (Burt, 2005) which can be established by e.g., communication and collaboration. To address this structural feature of communication, we propose to apply SNA. SNA offers a suitable methodology to study group dynamics as well as to investigate the role of the individuals within these dynamics (Wasserman and Faust, 1994). It focuses on various aspects of the relational structures and the flow of information, which characterize a network of people, through graphs and structural measures.
To better illustrate the concept of “entanglement” we consider an email network, characterized as a graph made of a set of nodes (e-mail accounts) and a set of directed edges (weighted by the number of emails) connecting these nodes. The direction of an edge specifies the source (e-mail sender) and target (e-mail receiver) node; the weight of an edge shows the relation intensity (number of e-mails) between two nodes (see
To illustrate the idea and calculation of entanglement with an example, we use an individual mailbox representing a dataset of e-mails of persons who work together on several projects. First, we collected the mailbox and stored it in a database, where the e-mail data was structured from a network perspective. In order to calculate the entanglement of the mailbox owner and his/her colleagues, we take the inverse of the Euclidean distance of the time series of the communication activity represented by messages sent over time for each node/actor in the network. This value will get larger the more similar the activity time series of the two actors are. However, we have to distinguish between two pairs of actors at different locations in the network, one pair embedded into a tight cluster communicating with many other actors, while the other pair is exchanging the same number of e-mails as the first pair, but is only weakly connected to other actors. To make this metric comparable among pairs of actors with different levels of activity in the same network, we multiply it by the product of the degree centralities of both actors. Degree measures the centrality, sometimes seen as a proxy of popularity, of a node in a network, by counting the number of its nearest neighbors (Freeman, 1978).
Further, it can be a proxy for the level of engagement within a group, team, or organization (Gloor et al., 2020). Communication activity via e-mail (Gloor et al., 2014) indicates the number of e-mail messages sent by a person within a time interval.
Accordingly, we define the activity entanglement EA (xT, yT) between two individuals, named x and y in a specific time window T, as:
where CD (xT) and CD (yT) are the degree centralities of the two individuals x and y, and d(A(xT), A(yT)) is their Euclidean distance, with respect to communication activity A in a defined time window T. In other words, the entanglement of two individuals x and y is given by the multiplication of the number of their direct contacts in the e-mail network divided by their synchronization of communication activity. As has been said above, it is necessary to include the product of the degree centralities of x and y into the entanglement formula to provide for the differences in centralities between actors: assume that actor x has low degree, if x is synchronized with highly connected actor y having high degree centrality, the high degree of actor y will boost entanglement of actor x in comparison with all other actors in the network. In other words, we want our metric to reward less influential actors who are synchronized with influential actors.
Similarly, we could consider not just communication activity, but also individuals' synchronization in weighted and unweighted betweenness centrality. Betweenness is a well-known metric in social network analysis. It is the sum of the fraction of all-pairs shortest paths that pass through a node v (Freeman, 1977):
where V is the set of nodes, σ (s, t) is the number of shortest paths from s to t, and σ (s, t|v) is the number of those paths passing through node v (Brandes, 2001). Inverse arc weights are considered for the determination of node distances. To control for network size, the above index is usually normalized between zero and one.
If the betweenness centrality time series of two individuals are in sync, it means that they share similar network positions, and levels of influence, at the same time. Individual betweenness entanglement EB is the product of the degree of two individuals divided by their Euclidean distance in betweenness centrality over a period of time.
In addition, we speculate on the possibility of evaluating how much an individual is in sync with the aggregated flow of the entire network. As a proxy of the aggregated rhythm of the team, we take Freeman's group betweenness centralization, CGB (Freeman, 1978). Group betweenness centralization is the sum of the differences between the betweenness centrality of the most central node, CB(v*), and that of all other nodes in the network (Freeman, 1978; Wasserman and Faust, 1994), normalized by its maximum value which is (G−1)2(G−2) where G is the total number of nodes:
This definition of group betweenness centralization is appropriate for this use case, as we compare how entangled an individual node is with all other nodes with regard to betweenness.
Formally, we measure group betweenness entanglement EGB by dividing group betweenness centralization CGB by the Euclidean distance of group betweenness centralization and normalized betweenness centrality of the actor being analyzed over a time period. CGBT—as a metric of variation—is an indicator for the centralization of the group in time window T, the individual betweenness centrality CB (xT) in this sense is an influence on CGBT, i.e., how much an actor impacts CGBT. Intuitively, this metric reflects the contribution of this actor to the level of centralization of its group. In other words, it measures how far away the normalized betweenness centrality of an actor is from the betweenness centralization of its group at any point in time. If an actor's betweenness is high and its group betweenness centralization is high, the actor is probably responsible for the centralized network structure-thus the Euclidean distance between group betweenness centralization and an actor's betweenness centrality is small, and therefore the actor's group betweenness entanglement high. On the other hand, if an actor's betweenness is low and its group betweenness centralization is high, it means somebody else is central and the actor is unimportant in betweenness centrality terms, thus less entangled with the group.
betweenness entanglement, EGB(xT) of x as:
To show the inequality in individual group betweenness entanglement we calculate the Gini coefficient for EGB:
The same formula can also be used for activity entanglement to calculate G(EA). Intuitively, the Gini coefficient measures inequality in the distribution of entanglement among all actors in a network. This is based on the observation that for an actor x being resource-poor or resource-rich in a network—the resource being entanglement in this case—can be highly predictive for the behavior or performance of x. It therefore makes sense to put the entanglement of x in relationship to the entanglement of all other actors in the network through Gini entanglement.
This is illustrated by four case studies (Table 1) that show how the proposed entanglement metric can be used with e-mail data to predict work-related outcome variables, such as team performance and employee turnover. The four cases are related to different business contexts and consider different dependent variables. In all cases we analyze email data, illustrating the suitability of the entanglement metric for online communication data. Our goal here is not to directly compare results across case studies, deriving general conclusions, or claiming causality. Rather we want to show the versatility of our entanglement metrics, which can be adapted to study business interaction dynamics in different scenarios.
indicates data missing or illegible when filed
Case study A—learning behavior and performance: This case study was conducted as a pilot in a healthcare organization to determine if activity entanglement EA between 53 team members of 11 medical innovation teams could predict performance and learning behaviors. The performance and learning behaviors of each team were rated and triangulated every other month for the duration of a year by three overall project managers. They individually rated the team performance and the capability of the team to learn new things. At the same time, all e-mails of the project members were collected and analyzed. Individual activity entanglement of each actor with all other actors was calculated, and then the average was taken for each actor. Finally, for each team average and standard deviation of activity entanglement over all team members were computed. We find that team performance and learning behavior are significantly correlated with the standard deviation of activity entanglement of team members, as shown in
Case study B—turnover prediction: In our second case study, we conducted a pilot study at a global professional services firm. In this case, we wanted to evaluate the possible association of entanglement with executives' decision to leave the firm, through voluntary resignation. Turnover of highly important employees such as senior executives is critical for companies, because it has negative implications for firm performance (Hancock et al., 2013; Zylka and Fischbach, 2017). Eight months of e-mail data of 113 senior executives at a large global services company was collected from May to December 2014 (see
From a preliminary t-test, we immediately notice that there is a significant difference in the Gini index of activity entanglement, between senior executives who leave the company (M=0.0457, SD=0.0070) and those who stay (M=0.0488, SD=0.0059), t(111)=−2.513, p=0.0013. On average, Gini entanglement is significantly higher for those who stay.
Past studies have shown that managerial disengagement might depend on multiple factors and that communication-based and social network analysis metrics, captured from e-mail communication, can reveal it (Gloor et al., 2017b). Accordingly, we present Pearson's correlations (in Table 2) and logistic regression models (in Table 3), to see if the effect of the entanglement variable remained significant when combined with other predictors. The highest correlation of entanglement is with the contribution index, which however does not lead to collinearity issues. A high contribution index is an indication of “spammers”, the higher the contribution index, the more somebody sends compared to receiving e-mail. If there is a spammer, s/he will be entangled with many, while others who are sending much less, will thus be less entangled. This results in a high Gini entanglement for that person. Extending this effect to all users will lead to high correlation between the two values.
p <
p <
indicates data missing or illegible when filed
We first tested a model with only the control variables of rank, tenure, and time since the last promotion (TSLP) was measured in months. In the subsequent models, we added the other predictors in blocks showing, in Model 4, that the only significant predictor, before adding entanglement, is Ego ART. This suggests that managers who leave the company are less responsive to e-mails and take more time to answer. In the full model, Ego ART, messages sent, contribution index, and Gini activity entanglement are significant. Including this last predictor in the model leads to a significant improvement of the McFadden's pseudo-R-squared, which more than doubles (going from 0.08 to 0.18). As we can see from Model 5, a higher Gini entanglement makes the probability of leaving the company smaller.
To evaluate the possibility of using the entanglement variable for making predictions, we used machine learning. In particular, we used a tree boosting model named CatBoost and its related Python library (Prokhorenkova et al., 2018). This boosting approach is now well-known and has proven its usefulness in past research, where it also sometimes outperformed other supervised machine learning methods, such as Support Vector Machines (SVM) and Random Forest Models (Huang et al., 2019). The model performance has been assessed through Monte Carlo Cross Validation (Dubitzky et al., 2007), with 300 random splits of the dataset into train and test data (75% vs 25%). Thanks to the contribution of our variables, we could achieve an average accuracy of predictions of 80.25%, with an average value of the Area Under the ROC-Curve (AUC) of 0.81.
In a second step, we considered the average model resulting from cross-validation and used it to interpret the impact of each variable on predictions (calculated as the average of its absolute Shapley values). We used the SHapley Additive explanations (SHAP) Python package (Lundberg and Lee, 2017). This method proved to be particularly suitable for tree ensembles and to work well also with respect to other approaches (Lundberg et al., 2020, 2018). As
Case study C—employee performance: We analyzed the e-mail interactions of 81 managers working for a big international services company. Every year the performance of managers was evaluated by their bosses and by the HR department. Whereas the rating of almost all of these managers was “exceeded expectations” for the year 2015, we noticed that 15 of them obtained a lower rating. Like in the case study B of resigning senior executives, we were interested in understanding if entanglement could be related to individual work performance. Carrying out a t-test, we could see that there is a significant difference between the Gini coefficients of betweenness entanglement EB scores of tops (M=0.0508, SD=0.0061) and low (M=0.469, SD=0.0028) performers, t(79)=2.432, p=0.0017.
As we did for leavers in case study B, we additionally built logistic regression models to assess the combined impact of variables on the probability to be a low performer. Pearson's correlations among our predictors are presented in Table 4. The highest correlation of entanglement is again with the contribution index, but this time lower than case study B.
p <
p <
indicates data missing or illegible when filed
As Table 5 shows, in the full model the p-value of Gini entanglement is only <0.1; however, the inclusion of this variable leads to a good improvement of the McFadden's pseudo-R-squared, from 0.2314 (Model 4) to 0.2803 (Model 5). A significant performance improvement is also obtained by including weighted betweenness centrality.
indicates data missing or illegible when filed
The usefulness of the entanglement predictor is confirmed by the results of the CatBoost model that we trained to classify managers into top and low performers. We followed the same procedure as in the previous case study B—i.e., a Monte Carlo cross-validation with 300 repetitions—and obtained good average results (Accuracy=74.73%, AUC=0.68).
Case study D—Customer Satisfaction: In this case study, we show that entanglement is significantly related to team performance, measured as customer satisfaction through the Net Promoter Score (NPS). 13 teams within the company participated in our study, comprising a total of 82 managers. Each team was dedicated to a specific client.
We measured betweenness entanglement of each team by taking the group betweenness entanglement of each member and considering group dispersion by means of the Gini coefficient.
We find that high group betweenness entanglement inequality is positively related to team performance—this time measured as customer satisfaction. Running a Pearson's correlation test, we find a significant association of the Gini group betweenness entanglement with team performance (r=0.0522, p=0.0002). For each team, we have repeated measures over three time periods. Therefore, we used multilevel linear models (Hoffman and Rovine, 2007; Nezlek, 2008; Singer and Willett, 2009) as a more appropriate technique to evaluate the possible effect of entanglement on customer satisfaction. We nested repeated measures into groups (level 2). Results are presented in Table 6.
As the table shows, the biggest variance proportion can be attributed to team characteristics: the intraclass correlation coefficient is 0.7604, meaning that 76% of the empty model variance is at level 2 (Model 1). Including the entanglement variable in the model (Model 2) reduces this variance by 30.56%, which is a highly significant result for a single predictor. The higher the inequality in group betweenness entanglement is, the happier the customer is. Similarly, to case study A, this confirms that selective communication of teams, where some team members are highly entangled and others are not, leads to happier customers.
Table 7 shows the summary of the cases.
1) A data retrieval module 1102: This module interfaces with email servers or other communication platforms to collect electronic communication data 1114. It uses secure APIs or database queries to extract relevant metadata such as sender, recipient, timestamp, and message content.
2) An entanglement analysis module 1104: This core component implements the algorithms for calculating entanglement metrics. It processes the raw communication data 1114 to construct social network graphs and compute various centrality measures. This module sends performance measures 1116 to the outcome prediction module 1106
3) An outcome prediction module 1106: This module uses the calculated entanglement metrics along with other relevant data to predict performance outcomes, employee turnover, or other organizational behaviors of interest.
4) A processor 1110: A central processing unit that executes the computations required by the data retrieval module 1102, entanglement analysis module 1104, outcome prediction module 1106, and other modules.
5) Memory 1112: Both volatile (RAM) and non-volatile (e.g., SSD) storage for holding data and program instructions, including instructions for the data retrieval module 1102, entanglement analysis module 1104, and outcome prediction module 1106. Memory 1112 also holds the system's database.
6) User interface 1108: A graphical interface for displaying results and allowing user interaction with the system. The User interface 1108 may show the results from the outcome prediction module 1106 and may collect parameters for the data retrieval module 1102.
The modules interact as follows: The data retrieval module 1102 periodically collects new communication data 1114 and stores it in the system's database. The entanglement analysis module 1104 processes this communication data 1114 to calculate up-to-date entanglement metrics, called performance measures 1116. The outcome prediction module 1106 then uses these metrics, along with other relevant data, to generate predictions or insights. Results are presented to users through the graphical User interface 1108.
The entanglement analysis module 1104 implements the following algorithm to calculate activity entanglement, as seen in
1) For each pair of individuals (x, y) in the network (each pair of individuals 1304):
a) Construct time series A(xT) and A(yT) representing their communication activity over time window T (time series is constructed of the communications activity 1306).
b) Calculate the Euclidean distance d(A(xT), A(yT)) between these time series (Euclidean distance 1308).
c) Compute the degree centralities CD(xT) and CD(yT) for both individuals (compute centrality measures 1310).
d) Calculate the activity entanglement EA(xT, yT) (entanglement formula 1312) using the formula:
2) Repeat the process for betweenness entanglement, using betweenness centrality time series instead of activity.
3) For group betweenness entanglement:
a) Calculate the group betweenness centralization CGBT for each time window.
b) For each individual x, calculate EGB (xT) using the formula:
4) Calculate Gini coefficients for the distribution of entanglement scores across the network.
The system implements these calculations using efficient matrix operations and parallel processing techniques to handle large-scale networks (Gini Coefficient 1208).
Before calculating entanglement metrics, the system performs several preprocessing steps:
1) Data cleaning: Removing or correcting invalid entries, handling missing data, and resolving inconsistencies in the communication records.
2) Time window selection: Dividing the data into appropriate time windows (e.g., daily, weekly) based on the analysis requirements.
3) Activity normalization: Scaling communication activity measures to account for differences in overall activity levels between individuals or time periods.
4) Graph construction: Building the social network graph representation from the communication data, including handling of multi-recipient messages and thread structures.
Next, the entanglement analysis module 1104 constructs a social network graph 1204. In one embodiment, this graph is a database of nodes linking to each other. Each email or social media communication may include a record of the originator of the message, the destinations of the message, the date and time of the message, the type of message (email, LinkedIn, Facebook, a calendar entry, a meeting entry, an instant message, a text, a phone call, a video call, etc), and other pertinent information.
Next, the entanglement analysis module 1104 calculates the entanglement metrics 1206 using the formulas described in
Optionally, the Gini Coefficient 1208 is calculated. The performance measures 1116, the results of the Gini calculation, and the entanglement metrics, are sent to the outcome prediction module 1106 to predict outcomes 1210. The Gini Coefficient 1208 is calculated using the following formula:
The predict outcomes 1210 step predicts team performance, individual performance, employee turnover, and customer satisfaction, as described above.
A time series is constructed of the communications activity 1306 by sorting each pair by date.
Then the Euclidean distance 1308 between the time series is calculated using the following formula:
The compute centrality measures 1310 are then calculated, determining the degree and betweenness factors. The results are then applied to the entanglement formula 1312:
The hardware configuration may include a Communications 1420 subsystem that provides a Wired 1422 and Wireless 1424 access to external devices through direct connection, local area networks, wide area networks, and the Internet or the Cloud 1436. Within the Communications 1420 subsystem could be interfaces to the Internet/Web Interface 1426 (such as support for web browsers and web servers), an Email Interface 1428 (such as ports for receiving and sending email, and an access mechanism for retrieving email databases for analysis), a Facebook Interface 1430 (providing access to a database of Facebook posts and chats), a LinkedIn Interface 1432 (that is able to retrieve the LinkedIn database of connections, chats, and posts), a Twitter Interface 1434 (with the ability to retrieve tweets and chats from X/Twitter), interfaces to a calendar, a list of meetings, instant messages, texts, phone calls, video calls, as well as access to other social media databases. In some embodiments, some or all of this functionality may be moved to the processor 1110 and Memory 1112 or remotely to a server accessible through the Wired 1422 or Wireless 1424 network interfaces.
The Memory 1112 could be made up of ROM 1404, RAM 1408, Disk Drives 1410, optical storage, and similar storage devices. The Memory 1112 could be local to the processor over the Bus 1402 or remote or any combination thereof. The Memory 1112 could include a System Database 1412 of the communication data 1114 retrieved by the data retrieval module 1102. The Memory 1112 could also include Modules 1416 such as the data retrieval module 1102, the entanglement analysis module 1104, the outcome prediction module 1106, and other modules.
While a specific embodiment has been shown and described, many variations are possible. With time, additional features may be employed. The particular shape or configuration of the platform or the interior configuration may be changed to suit the system or equipment with which it is used.
Having described the invention in detail, those skilled in the art will appreciate that modifications may be made to the invention without departing from its spirit. Therefore, it is not intended that the scope of the invention be limited to the specific embodiment illustrated and described. Rather, it is intended that the scope of this invention be determined by the appended claims and their equivalents.
The Abstract of the Disclosure is provided to allow the reader to quickly ascertain the nature of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, it can be seen that various features are grouped together in various embodiments for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus, the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separately claimed subject matter.
This application is a continuation-in-part of U.S. patent application Ser. No. 17/812,606, filed by Peter A. Gloor on Jul. 14, 2022, said application incorporated herein in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | 17812606 | Jul 2022 | US |
Child | 18794037 | US |