This invention pertains to semantic analysis, and more particularly to analyzing content to determine if a candidate is compatible with a job or a corporate culture.
In a hiring process, candidates generally present themselves to a potential employee through résumés. Additionally, these days additional information about the candidate can be found in their public online activity. Further, as the hiring process continues, additional information is available in the form of interviews, e-mail exchanges, questionnaires, etc.
Recruiters can use this information to develop an assessment of the candidate. The recruiter generally forms an assessment based on many factors, including his or her own experience, understanding of the job opening or corporate culture, reading between the lines of what the candidate is presenting, etc. Additionally, the recruiter may use his or her instinct to decide whether to recommend hiring a candidate or not.
Some aspects of these assessments are quantitative, like education level, specific degree in a specific discipline, years of experience, etc. Other aspects are qualitative like the candidate's ability to be creative, work in teams, be forceful or be courteous, etc.
The existing approach to measure, assess, and match the qualitative aspects of a candidate and a job involve: a) interviews in which people representing the job opening ask questions and evaluate the responses, b) self-assessment questionnaires in which the candidate is asked to comment upon his or her own qualitative aspects, and c) feedback from references who have worked with the candidate in the past.
The problem with the current approach is that the current approach is time consuming and does not scale up to considering large number of candidates at the same time. Moreover, assessments made by someone representing the job, the candidate himself, or a reference will not be consistent from one person to another or over time.
A need remains for a way to address these and other problems associated with the prior art.
In an embodiment of the invention, a computer can store a taxonomy. A scanner can scan content to identify phrases that correspond to grammatical patterns in the taxonomy. The computer can then calculate percentages of occurrences, for both individual grammatical patterns and combinations of grammatical patterns. The calculated percentages can then be output.
In another embodiment of the invention, the calculated percentages can be compared to calculated percentages for another source content, such as a job description or a corporate culture. The comparison can be used to determine how close a fit the content is to the source content.
The foregoing and other features, objects, and advantages of the invention will become more readily apparent from the following detailed description, which proceeds with reference to the accompanying drawings.
Memory 130 can store taxonomy 135. Taxonomy 135 provides a mapping between grammatical patterns and qualities of language. This taxonomy provides a way to analyze content about a job candidate and determine whether the job candidate is a good fit, to whatever end is desired by the reviewer. For example, one embodiment of the invention can determine whether the job candidate is a good fit for a job, whereas another embodiment of the invention can determine whether the job candidate is a good fit for a corporate culture. Although
Computer system 105 can also include scanner 140, percentage calculator 145, and outputter 150. Scanner 140 can scan a provided content to identify phrases in the content that are grammatical patterns as determined by taxonomy 135. Percentage calculator 145 can then calculate the percentage of occurrences for each grammatical pattern, relative to all grammatical patterns identified in the content. Percentage calculator can also calculate the percentage of occurrences for each combination of grammatical patterns, relative to all combinations of grammatical patterns. These calculated percentages of occurrences provide a profile of the candidate, which can be compared with other content, as needed. Finally, outputter 150 can output the calculated percentages of occurrences, as the profile of the candidate, for other uses.
Additional components of computer system 105 can include comparator 155, ranker 160, and character profile creator 165. Comparator 155 can be used to compare the calculated percentages of occurrences for one content with calculated percentages of occurrences for a second content. In this manner, the system can determine if a job candidate is a good match for either the job description or the corporate culture. For example, the second content can be a description of the job. This content will have its own grammatical patterns, which can be identified against taxonomy 135 to calculate percentages of occurrences for the second content. By comparing the calculated percentages of occurrences in the content for the job candidate with the calculated percentages of occurrences in the content for the job description, the system can determine if the candidate is a good match for the job description. A person skilled in the art will recognize that the use of a job description as the second content is an arbitrary choice, and other content can be used to determine whether the candidate is a good match. Thus, the second content could be a description of the corporate culture instead.
Ranker 160 can take calculated percentages for multiple candidates and rank them based on how closely they are a match to another content, such as a résumé or a corporate culture. Ranker 160 is discussed further with reference to
Character profile creator 165 can take the calculated percentages of occurrences and create a character profile from the calculated percentages of occurrences. The character profile can then be stored, in either short-term or long-term storage in computer system 105, or elsewhere, for later comparison with other content, either for determining a good match or for ranking purposes.
The content that is analyzed according to embodiments of the invention can be any content. For example, the content can include a résumé by a job candidate, or written material from the job candidate, a transcript of an interview with the candidate, e-mails, or essays, among other possibilities.
Taxonomy 135 does not need to cover all possible words in the language (shown as English in the drawings, but embodiments of the invention are equally applicable to other languages as well). Parts of the language that do not fit a grammatical pattern can be ignored. That is, when calculating the percentages of occurrences, the percentages of occurrences are calculated only relative to all phrases that correspond to grammatical patterns. But it is possible to calculate percentages of occurrences relative to all text in the content. In that case, the sum of all calculated percentages of occurrences can be less than 100%.
The distance between two source contents can be calculated in any desired manner. For example, distance can be measured as a count of the number of differences (between calculated percentages of occurrences for each quality) between the two source contents. Or, the distance can be adjusted by weighting different qualities differently, to reflect certain qualities that are considered more or less significant. Or, distance can be calculated by creating a vector for each source content, where each coordinate in the vector is a calculated percentage of occurrence for a quality. The distance between two source contents can then be calculated as the distance between the two vectors in N-dimensional space, again using any desired distance formula. Thus, the distance between two N-dimensional vectors can be measured using a Euclidean distance formula, or using taxicab distance, among other possibilities.
The comparison itself can be achieved by comparing the calculated percentages of occurrences for each grammatical pattern in the contents. For example, auxiliary verbs might constitute 2% of the résumé, but might constitute 4% of the job description. This difference can suggest that the candidate is less accepting than might be desired for the job. Other differences between the calculated percentages of occurrences in the contents can reflect other concerns that might exist with the candidate. The closer the candidate's content comes to matching the other content (in terms of calculated percentages), the better a match the candidate is for the job or corporate culture.
At block 640 (
At block 655, the system can rank contents based on distances from a base content (such as a job description or a corporate culture). Block 655 can be omitted, as shown by dashed arrow 660.
At block 665, the entire process (e.g., blocks 605-655, including or omitting all optional blocks, as desired) can be repeated additional times using other taxonomies, to provide alternative analyses for the content. Block 665 can be omitted, as shown by dashed arrow 670.
At block 675, a character profile can be created from the calculated percentages of occurrences, and at block 680 the character profile can be output. Blocks 675-680 can be omitted, as shown by dashed arrow 685.
The embodiments of the invention represented in the above flowcharts are merely exemplary, and are not intended to represent the only operative embodiment of the invention. Various blocks can be omitted, and the sequence of blocks can reordered, without affecting the employability of the embodiments of the invention. While the drawings might show specific ways in which blocks can be omitted or arranged, other arrangements are also possible and are intended to be covered by embodiments of the invention.
The following discussion is intended to provide a brief, general description of a suitable machine in which certain aspects of the invention can be implemented. Typically, the machine includes a system bus to which is attached processors, memory, e.g., random access memory (RAM), read-only memory (ROM), or other state preserving medium, storage devices, a video interface (185), and input/output interface (185) ports. The machine can be controlled, at least in part, by input from conventional input devices, such as keyboards, mice, etc., as well as by directives received from another machine, interaction with a virtual reality (VR) environment, biometric feedback, or other input signal. As used herein, the term “machine” is intended to broadly encompass a single machine, or a system of communicatively coupled machines or devices operating together. Exemplary machines include computing devices such as personal computers, workstations, servers, portable computers, handheld devices, telephones, tablets, etc., as well as transportation devices, such as private or public transportation, e.g., automobiles, trains, cabs, etc.
The machine can include embedded controllers, such as programmable or non-programmable logic devices or arrays, Application Specific Integrated Circuits, embedded computers, smart cards, and the like. The machine can utilize one or more connections to one or more remote machines, such as through a network interface (185), modem, or other communicative coupling. Machines can be interconnected by way of a physical and/or logical network, such as an intranet, the Internet, local area networks, wide area networks, etc. One skilled in the art will appreciate that network communication can utilize various wired and/or wireless short range or long range carriers and protocols, including radio frequency (RF), satellite, microwave, Institute of Electrical and Electronics Engineers (IEEE) 545.11, Bluetooth, optical, infrared, cable, laser, etc.
The invention can be described by reference to or in conjunction with associated data including functions, procedures, data structures, application programs, instructions, etc. which, when accessed by a machine, result in the machine performing tasks or defining abstract data types or low-level hardware contexts. Associated data can be stored in, for example, the volatile and/or non-volatile memory, e.g., RAM, ROM, etc., or in other storage devices and their associated storage media, including hard-drives, floppy-disks, optical storage, tapes, flash memory, memory sticks, digital video disks, biological storage, and other tangible, physical storage media. Associated data can also be delivered over transmission environments, including the physical and/or logical network, in the form of packets, serial data, parallel data, propagated signals, etc., and can be used in a compressed or encrypted format. Associated data can be used in a distributed environment, and stored locally and/or remotely for machine access.
Having described and illustrated the principles of the invention with reference to illustrated embodiments, it will be recognized that the illustrated embodiments can be modified in arrangement and detail without departing from such principles, and can be combined in any desired manner. And although the foregoing discussion has focused on particular embodiments, other configurations are contemplated. In particular, even though expressions such as “according to an embodiment of the invention” or the like are used herein, these phrases are meant to generally reference embodiment possibilities, and are not intended to limit the invention to particular embodiment configurations. As used herein, these terms can reference the same or different embodiments that are combinable into other embodiments.
Consequently, in view of the wide variety of permutations to the embodiments described herein, this detailed description and accompanying material is intended to be illustrative only, and should not be taken as limiting the scope of the invention. What is claimed as the invention, therefore, is all such modifications as can come within the scope and spirit of the following claims and equivalents thereto.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 61/885,415, titled “SYSTEM AND METHOD FOR SEMANTIC ANALYSIS OF CANDIDATE INFORMATION TO FIND COMPATIBILITY WITH A JOB”, filed Oct. 1, 2013, and U.S. Provisional Patent Application Ser. No. 61/885,418, titled “SYSTEM AND METHOD FOR SEMANTIC ANALYSIS OF CANDIDATE INFORMATION TO FIND COMPATIBILITY WITH A COMPANY CULTURE”, filed Oct. 1, 2013, both of which are incorporated herein by reference for all purposes. This application is also a continuation-in-part of U.S. patent application Ser. No. 13/706,044, titled “METHODS AND SYSTEMS FOR TEAM SELECTION AND HIRING BY ANALYZING TEXT”, filed Dec. 5, 2012, now pending, which claims the benefit of U.S. Provisional Patent Application Ser. No. 61/567,746, titled “METHODS AND SYSTEMS FOR TEAM SELECTION AND HIRING BY ANALYZING TEXT”, filed Dec. 7, 2011, both of which are incorporated herein by reference for all purposes. This application is also a continuation-in-part of U.S. patent application Ser. No. 13/923,164, titled “RÉSUMÉ SCREENING”, filed Jun. 20, 2013, now pending, which claims the benefit of U.S. Provisional Patent Application Ser. No. 61/662,577, titled “RÉSUMÉ SCREENING”, filed Jun. 21, 2012, and is a continuation-in-part of U.S. patent application Ser. No. 13/706,044, titled “METHODS AND SYSTEMS FOR TEAM SELECTION AND HIRING BY ANALYZING TEXT”, filed Dec. 5, 2012, now pending, which claims the benefit of U.S. Provisional Patent Application Ser. No. 61/567,746, titled “METHODS AND SYSTEMS FOR TEAM SELECTION AND HIRING BY ANALYZING TEXT”, filed Dec. 7, 2011, all of which are hereby incorporated by reference for all purposes.
Number | Date | Country | |
---|---|---|---|
61885415 | Oct 2013 | US | |
61885418 | Oct 2013 | US | |
61567746 | Dec 2011 | US | |
61662577 | Jun 2012 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13706044 | Dec 2012 | US |
Child | 14495294 | US | |
Parent | 13923164 | Jun 2013 | US |
Child | 13706044 | US | |
Parent | 13706044 | Dec 2012 | US |
Child | 13923164 | US |