The invention relates generally to a system and method for implementing a natural language processing platform that supports loan financing and other industries.
The U.S. residential mortgage industry has been transformed from a straightforward product and process into a highly complex, extremely interdependent ecosystem that remains characterized by redundant and manual practices. Complexity exists at every step, and the industry has been gradually adopting technology to streamline the origination process to make the consumer experience at the earliest stage of the process smoother and faster.
This complexity, however, has led to a significant decline in efficiency and customer experience in the mortgage servicing stage of the mortgage lifecycle, which suffers from a host of inadequacies due to its distinct set of regulatory, agency, and market-driven requirements as well as its dependence on an archaic, legacy technology platform first developed over 50 years ago. These problems are exacerbated by the fact that loan repayment takes place over decades, creating a cumulative snowball of inefficiencies that cost servicers precious time and money.
Homeownership is an important financial goal and taking out a mortgage loan and purchasing a home is the biggest financial decision and largest—and longest—investment most people will make in their lifetimes. The lifecycle of a mortgage loan begins with the origination process, which includes applying for a loan, processing the application, credit checks and considerations by the lending institution, and the closing process. Once the loan is closed, the debt becomes the legal responsibility of the borrower, and the lending institution retains a legal lien on the property until the final payment is made.
Post-closing, the most common action is for a lender to sell off portions of the loan or even the entire loan to other lenders and investors in what is known as the secondary market. The repayment phase of the loan is by far the longest, lasting 15 to 30 years or longer. This phase is managed by a mortgage servicer. Once the loan is repaid in full, the lien is removed from the property and the borrower owns the home completely.
Outstanding U.S. residential mortgage debt is approximately $12.5 trillion and the ecosystem comprises four major stakeholders-originators, investors, servicers, and customers, as well as peripheral players, including the Consumer Financial Protection Bureau (CFPB), rating agencies, auditors, and government-sponsored enterprises (GSEs).
Mortgage servicing is the backbone of the mortgage ecosystem. Without rethinking the entire servicing operation, implementing industry-wide transformation is difficult, if not unachievable. The mortgage servicing industry's inability to innovate and implement new technologies has been hampered by its ties to an antiquated legacy technology system. The current system hampers efficient workflows, costs an excessive amount of money to run and maintain, and requires an increasing number of staff members to create and manage the integration of endless third-party systems and workaround solutions. In short, the current system lacks the agility necessary to reckon with the unprecedented digital transformation hurtling toward the industry. Servicers are forced to integrate third-party systems that raise technological and regulatory risk as well as overall costs.
Currently, there are no tools in the existing legacy infrastructure to effectively manage workflow and streamline operations.
These and other drawbacks exist.
According to an embodiment, the invention relates to a system that implements a natural language processing engine. The system comprises: an interface that communicates with one or more client systems via a communication network; and a natural language processing engine comprising a computer processor coupled to the interface and a memory component, the computer processor further configured to: receive, via the interface, an input data comprising one or more guidelines; extract, via the computer processor, a plurality of text segments from the input data using a verb base approach where at least one verb forms a root of each text segment; label, via the computer processor, each of the plurality of text segments with a depth-parameter number based on a weight-based approach wherein the weight-based approach determines a weight that represents text relevance using a set of factors based on text size, text weight, indent and text format; generate, via the computer processors, a node-tree structure that represents the weight for each of the plurality of text segments; and apply, via the computer processor, the weight-based approach within a predetermined scope as defined by a data-dictionary and a knowledge graph.
According to another embodiment, the invention relates to a method that implements a natural language processing engine. The method comprises the steps of: receiving, via an interface, an input data comprising one or more guidelines wherein the interface communicates with one or more client systems via a communication network; extracting, via a natural language processing engine that comprises a computer processor, a plurality of text segments from the input data using a verb base approach where at least one verb forms a root of each text segment; labeling, via the computer processor, each of the plurality of text segments with a depth-parameter number based on a weight-based approach wherein the weight-based approach determines a weight that represents text relevance using a set of factors based on text size, text weight, indent and text format; generating, via the computer processors, a node-tree structure that represents the weight for each of the plurality of text segments; and applying, via the computer processor, the weight-based approach within a predetermined scope as defined by a data-dictionary and a knowledge graph.
The system and method may utilize a specially programmed computer system comprising one or more computer processors, interactive interfaces, electronic storage devices, and networks. The computer implemented system, method and medium described herein provide unique benefits and advantages to consumers/borrowers, investors, auditors and other participants, users or entities, according to various embodiments of the invention.
An embodiment of the present invention recognizes the challenges facing the mortgage industry and those of servicers in particular. An embodiment of the present invention is directed to providing the tools and customized technology to streamline and enhance existing operations while breaking down barriers to entry for those looking to enter the servicing business.
The innovative platform architecture and module design provide customers the option of a complete or single, customizable module solution. An embodiment of the present invention provides an efficient cloud-based implementation with lower coordination and support costs. The transformation of servicing creates a more secure, efficient, digitized mortgage ecosystem enabling all stakeholders to leverage decentralized finance.
These and other advantages will be described more fully in the following detailed description.
In order to facilitate a fuller understanding of the present invention, reference is now made to the attached drawings. The drawings should not be construed as limiting the present invention, but are intended only to illustrate different aspects and embodiments of the invention.
The following description is intended to convey an understanding of the present invention by providing specific embodiments and details. It is understood, however, that the present invention is not limited to these specific embodiments and details, which are exemplary only. It is further understood that one possessing ordinary skill in the art, in light of known systems and methods, would appreciate the use of the invention for its intended purposes and benefits in any number of alternative embodiments, depending upon specific design and other needs.
An embodiment of the present invention is directed to an innovative platform that improves servicing of mortgage loans. Participants of the mortgage business may include customers/borrowers who take a loan to purchase residential real estate and make monthly payments towards the loan or mortgage. Investors own the loan while servicers are responsible for collecting the monthly payments which are placed in a custodial account and then remitting the proceeds to the investors (or owners of the loan). In addition to collecting the monthly payments, servicers also apply the principal, taxes, insurance and make payments such as property tax. If payment is not received, then the servicer contacts the borrower. Auditors perform an audit that involves a comprehensive analysis of mortgage data to determine any violations or discrepancies associated with a borrower's mortgage. This may involve ensuring compliance with various regulations.
An embodiment of the present invention is directed to a data-driven, analytics-based system that increases efficiencies and reduces costs for all stakeholders in the ecosystem. An embodiment of the present invention is directed to leveraging Artificial Intelligence (AI) and Machine Learning (ML) technologies focused on complex borrower issues and prioritizing customer care.
For originators, AI and ML models may use data to predict conversion rates and help originators focus on high-value leads more likely to take out loans. These models may also flag at-risk loans in the origination/underwriting process and detect potential fraud. For servicers, AI and ML may be applied to build models for call/contact prioritization, e.g., “best time to call,” based on customer behaviors and identification of borrowers looking for workout agreements or facing changes in their finances. Investors may benefit from improved default and prepayment models, which help develop hedging strategies designed to mitigate risk. Borrowers may access personalized products, advice, and better servicer communication. In addition, self-service decision support may assist borrowers to evaluate the financial impact of workout options when facing payment challenges as well as simplify and speed up the loan modification process.
Operational efficiencies may be improved across the platform as blockchain technology, artificial intelligence, and machine learning eliminate or substantially reduce inefficient and arcane processes while ensuring regulatory compliance. Efficiencies may be realized in other areas in the servicing operation, including loan boarding, loan administration, escrow, tax and insurance, cash processing, letter writing, records and collateral, loss mitigation, foreclosure, bankruptcy, claims processing, etc.
An embodiment of the present invention is directed to implementing a platform with an intuitive user interface that substantially reduces the need for systems training and increases productivity. According to an embodiment of the present invention, the user interface may enable borrowers to easily access information about their loans, make payments accurately, upload and receive documentation easily, and communicate seamlessly with their servicers in a self-service digital manner. In addition, a fully responsive browser and mobile app will allow customers and operations professionals to securely work and manage loans from anywhere.
System 130 may interact with various entities as an integrated system/component or as a third party service entity. For example, an entity, such as a financial institution, may host System 130. Users may interact with System 130 via Network 120. System 130 may receive data from various sources including Databases 180, 182, Data Source(s) 186, etc. Databases 180, 182 may store and manage data relating to real estate properties, loans, transactions, analytics, visualizations, etc. Various forms of data may be stored and managed. Other implementations and relationships may be realized.
Interface 144 may enable various users to interact with components and services provided by System 130. Interface 144 may include browser or other user interactive interface. User interaction may be supported through various user devices, including computers, laptops, mobile devices, smartphones, tablets, etc. Users may include Originator 102, Servicer 104, Investor 106, Borrower 108, Auditor 110 and other users represented by Other User(s) 112, such as various agencies. While a single entity is shown for illustration purposes, multiple users and/or participants may be supported.
For example, Investor 106, such as a loan owner, may interact with System 130 for investor reporting, portfolio surveillance, modeled versus actual comparisons, etc. Customer/Borrower 108 may interact with System 130 through a website, mobile app, chatbot as well as other self-service options. For example, Customer/Borrower 108 may make payments, receive loss mitigation assistance, etc. Other User(s) 112 may represent Accounting Services to perform automated bank reconciliation, automated general ledger entries, forecasting models, etc. Other User(s) 112 may include Call Centers that interact through an Intelligent Agent Portal. Other operations that may be supported include Non-Performing Loan Operations (e.g., loss mitigation, foreclosure/bankruptcy, claims, etc.) and Performing Loan Operations (e.g., loan boarding, loan administration, escrow, tax and insurance, cash processing, letter writing, records and collateral, etc.).
For non-performing assets, borrowers may agree to a modified payment plan for delinquent loans. The new terms of the payment plan, along with borrower communications, may be stored and tracked on a blockchain. This promotes efficiency and ensures compliance with various regulations and other requirements, including timely notifications, etc.
Blockchain API 168 may communicate with a Blockchain Node, NFT Platform 176 and/or other networks to register a secured token on a blockchain or distributed ledger.
System 130 may support various services including loan processing (e.g., originating loans, loan servicing, etc.), communications and services, payment processing, accounting services, auditing services, etc.
Loan Processing Module 132 may support loan set-up, transfers and processing, as shown by Loan Boarding 134 and Data Processing Tool 136.
Loan Boarding Module 134 is directed to facilitating loan setup and transfer, reduce the costs, and deliver benefits that percolate throughout the loan lifecycle. Loan Boarding may involve the process where loans change hands from lenders to servicers or servicers to servicers. Handling sensitive data, such as borrower details, loan terms and conditions, and original documents at this stage makes this already data-heavy process extremely vulnerable to errors. The process, which can take from 30 to 90 days, includes receiving the data and documents from the prior lenders/servicers, extracting and mapping data from the files to servicers' systems, identifying discrepancies in data/documents, and preparing the portfolio for servicing after remediating the discrepancies.
Data Processing Tool 136 is directed to processing and deciphering data coming from multiple sources that lacks standardization—and to do it in the shortest possible time by mapping data to standard data fields. Data Processing Tool 136 focuses on at least three core aspects of loan boarding data: Data Transfer, Data Mapping and Data Storage.
Data Transfer may involve a Blockchain-based loan transfer. Data Transfer may offer security and easy accessibility to authorized personnel. It further achieves faster, reliable, and cost-efficiency as data moves seamlessly over a secure network and records changes that occur during the process.
Data Mapping may include a ML-based mapping module and support NLP-based document recognition and indexing. This further reduces manual intervention and improves accuracy as well as increases the turnaround time for processing each set of data.
Data Storage provides a reliable and cost-efficient cloud infrastructure to store data. This offers flexibility and scalability in addition to easy accessibility and management and further drastically reduces the cost by eliminating huge investments in storage infrastructure.
Communication Module 140 supports various modes of communication, including via website, mobile device, chat feature, etc. Communication Module 140 may also provide an enhanced customer service experience shown by 142 and interactions via various interfaces, shown by 144. An embodiment of the present invention is directed to facilitating a customer's interaction with the servicer across multiple mediums.
Customer Service functions may include: website registration and customer onboarding; co-browsing capabilities; multiple payment options; access to bills and updates; real-time payment processing; payoff automation; homeowners assistance; education videos, etc.
Payment Processing Module 150 is directed to simplifying various cashiering and payment processes through user-friendly, automated platforms. This enables servicers to have freedom and control over the processes as they configure them to suit their terms and conditions. Payment Processing Module 150 may support automated and real-time payment applications via payment plans or schemes (e.g., waterfall payment, installment payments, etc.) as shown by Real-Time Payment 152; real-time bank transactions and reconciliation as shown by Real-Time Bank 154; and real-time general ledger (GL) posting and generation as shown by Real-Time GL 156.
Real-Time Payment 152 supports a configurable payment waterfall to enable automated payment application with minimal human intervention. This may include a pre-configured government sponsored enterprise (GSE) waterfall template for a plug-and-play experience for associated servicers. Real-Time Payment 152 provides a faster and reliable process to perform repetitive tasks, reducing personnel cost; and further minimizing regulatory and compliance issues arising out of erroneous application of principal, interest, escrow, fee, and advances. Real-Time Payment 152 may further reduce the error-rate by running processes through an automated, pre-configured program.
Real-Time Bank 154 supports real-time bank transaction and reconciliation. Bank transactions post in real-time, creating an authentic and verifiable transaction trail for reconciliation and audit purposes. Real-Time Bank 154 may mitigate chances of missed or erroneous postings thereby reducing manual reconciliation of historical data. Accordingly, overall costs and resources may be reduced due to reduction or elimination of errors and reversals where real-time data may further support effective and timely decision making.
Real-Time GL 156 supports real-time general ledger (GL) posting and generation. GL entries for transactions may be generated and posted in real-time to appropriate GLs. In addition, GL entries and balances may be available for review at any time. Accounting ledgers may be available with up-to-date transaction history. For example, GL balances and posting may be retrievable in one click or other simplified user interaction. With an embodiment of the present invention, dependable GL posting functionality makes third-party accounting systems redundant, resulting in immense cost savings on systems and personnel.
Communication Tool 140 may represent various forms of communication to support customer service functions, as shown by 142. Communication Tool 140 may also include an Interface 144 that interacts with a user as well as user system or application. For example, Communication Tool 140 may facilitate sending letters and notices to borrowers during a servicing tenure, which is traditionally an extremely cost-intensive.
Customer Portal 1146 may leverage artificial intelligence and machine learning to deliver a personalized and exponentially improved customer experience within an intuitive and customizable user interface.
An embodiment of the present invention may support by various mobile applications, as represented by Mobile App 1147.
An embodiment of the present invention is directed to reducing calls to call centers while increasing customer satisfaction with an improved integrated portal. Driven by machine learning technology, an embodiment of the present invention is directed to proactively predicting customer behavior and delivering situational messaging and notifications based on a customer's previous site activity as well as real-time interactions and recent loan activity. Predictive Actions 1148 provides AI and ML driven custom and predictive calls to action and tailored messaging.
Other features may include: integration with real estate marketplaces for increased engagement and cross-sell opportunities; Integrated Chatbot 1149 provides extensive functionality and superior responsiveness; custom calculators; an Integrated Learning Center, as shown by 1150; and Loan Management 1151 that provides streamlined loan management including integrated flexible payment options.
An embodiment of the present invention is directed to providing a personalized dashboard for real-time loan information, as shown by Dashboard 1155.
An embodiment of the present invention is directed to homeownership support with real-time insights and recommendations for informed decision-making, via a Homeowners Assistance (HOA) App. HOA financial details may be provided via HOA Financials 1152.
For customers facing financial challenges, the HOA App provides self-service tools and resources to request payment assistance. An embodiment of the present invention is directed to reducing default management timelines and increasing successful workout percentages with an easily integrated, user friendly online application that prequalifies customers for a payment assistance plan in minutes, and further decreasing the likelihood of serious delinquency, without a call to a customer service agent.
Other features may include: an intuitive user interface that collects customer information with no or minimal agent intervention necessary; integrated compliance and audit trail information, as shown by 1153; automated completion of various forms; automated and seamless supporting document collection, as shown by 1154; servicer support with dashboards and access to customer survey responses, as shown by 1155; and Decision Engine 1156 for prequalification with viewable waterfall.
An embodiment of the present invention facilitates and simplifies communications by providing a communication tool that may be integrated with a servicing system. In addition to offering customization of letters, the communication tool may be used to schedule recurring events and generate ad hoc letters for one or multiple loans in the portfolio. Other services may include: configurable letter templates and creating recurring and ad hoc mail campaigns. Customizable templates and schedules for repetitive events may be supported.
Investor Accounting Module 160 may provide accounting services including Real-Time Flow 162 and Real-Time Reconciliation 164. An embodiment of the present invention recognizes that investors rely heavily on the financial accounting reports provided by servicers to understand the health of their portfolios and to make crucial decisions. Oftentimes, a multitude of reports are made available on a daily/weekly/monthly basis to investors. The reports may contain information about individual loans, portfolios, bank accounts, and expenses.
Pulling relevant data, performing accounting transactions, and reconciling every detail can be overwhelming for employees. Incompetent systems coupled with outdated technology only make matters worse.
An embodiment of the present invention addresses the gravity of the issue through its servicing system with features that post transactions and generate reports in real-time. The comprehensive accounting and reporting feature provides valuable insights into a portfolio at multiple levels for user-defined durations.
Real-Time Flow 162 supports a real-time flow of a monies dashboard. For example, a dashboard may illustrate a bird's eye view of the portfolio and further enable the user to perform detailed analysis of desired parameters with a simple user interaction, e.g., one click. Managerial review may be supported through the dashboard that shows monies flowing through the servicing lifecycle of loans. This also provides real-time information to enable timely decision making and corrective measures to reduce monetary and reputational losses.
Real-Time Reconciliation 164 provides real-time custodial and bank account reconciliation. Transactions may be posted and recorded in real-time in custodial and bank accounts thereby reducing time and cost of fetching data for reconciliation.
Loan Data on Blockchain 166 and Loan Documents on Blockchain 167 are directed to storing loan data and loan documents on a distributed ledger, such as a private blockchain through a Blockchain API 168 or other interface. This enables access to an entire history of transactions and relevant data associated with a loan. An embodiment of the present invention supports various users, participants, etc. For example, investors may access data on the blockchain in real-time through an interface, such as an investor portal. With current systems, servicers and investors require back and forth communications which tend to lead to delay and inefficiencies. With an embodiment of the present invention, daily cash details are available in real-time. This facilities cash reconciliation, audit processes, etc. Additional details are provided in related application, U.S. patent application Ser. No. ______, titled “SYSTEM AND METHOD FOR IMPLEMENTING A BLOCKCHAIN PLATFORM,” filed concurrently, the contents of which are incorporated herein by reference in their entirety.
Loan Servicing Transfer 170 facilitates transfers from one servicer to another. An embodiment of the present invention recognizes that investors may select different servicers for a particular loan or portfolio of loans. For example, an investor may transition from servicer 1 to servicer 2. With current systems, servicer transitions require a transfer of many years' worth of data with back and forth communications. The current process, however, is time-consuming with a high risk of data loss and errors. With an embodiment of the present invention, storing loan data on the blockchain with supporting data enables users to easily access the data and further ascertain when each document was generated, transmitted, shared, etc. Other loan history data may be available. Accordingly, the servicer transition process of an embodiment of the present invention is highly efficient, simplified with significant cost savings. Additional details are provided in related application, U.S. patent application Ser. No. ______, titled “SYSTEM AND METHOD FOR IMPLEMENTING A BLOCKCHAIN PLATFORM FOR LOAN SERVICING,” filed concurrently, the contents of which are incorporated herein by reference in their entirety.
An embodiment of the present invention recognizes that investors also sell pooled loans in a secondary market where investors may then sell to a new investor. In this scenario, the new investor (or buyer) may want to perform their own due diligence. With an embodiment of the present invention, due diligence may be performed on the blockchain by appraising the loan, accessing historical data and verifying information in real-time.
An embodiment of the present invention is directed to developing an innovative Natural Language Processing (NLP) Engine, as shown by NLP Engine 172. As shown in
Secured Tokens/NFT 174 may communicate with various platforms and marketplaces, as illustrated by NFT Platform 176. An embodiment of the present invention is directed to generating a secured token for a specific property which may be unique identified based on a combination of various characteristics including property address, loan date, loan amount, first payment date, etc. The NFT represents a unique secured token that may be placed on a node of a blockchain. With an embodiment of the present invention, the NFT may be transacted on in a marketplace or platform, such as a NFT marketplace. As equity in the property grows through monthly payments, the owner may extract a portion of the equity on marketplace which may be used for a new transaction or instrument, such as a new line of credit. Accordingly, the NFT marketplace/platform supports a decentralized environment for the mortgage industry. Additional details are provided in related application, U.S. patent application Ser. No. ______, titled “SYSTEM AND METHOD FOR IMPLEMENTING A BLOCKCHAIN PLATFORM THAT CREATES AND MANAGES SECURED TOKENS,” filed concurrently, the contents of which are incorporated herein by reference in their entirety.
For purposes of illustration, an embodiment of the present invention may be applied to mortgages such as residential property. However, the invention is not limited to mortgages and may be applied to other types of loans including credit card loans, automobile loans, student loans, small business loans, etc. Moreover, the embodiments of the present invention may be applied to various other industries, beyond the mortgage industry.
The system 100 of
Networks 120, 122 may be a wireless network, a wired network or any combination of wireless network and wired network. For example, Networks 120, 122 may include one or more of an Internet network, a satellite network, a wide area network (“WAN”), a local area network (“LAN”), an ad hoc network, a Global System for Mobile Communication (“GSM”), a Personal Communication Service (“PCS”), a Personal Area Network (“PAN”), D-AMPS, Wi-Fi, Fixed Wireless Data, IEEE 802.11a, 802.11b, 802.15.1, 802.11g, 802.11n, 802.11ac, or any other wired or wireless network for transmitting or receiving a data signal. Also, Networks 120, 122 may support an Internet network, a wireless communication network, a cellular network, Bluetooth, or the like, or any combination thereof. Networks 120, 122 may further include one, or any number of the exemplary types of networks mentioned above operating as a stand-alone network or in cooperation with each other. Networks 120, 122 may utilize one or more protocols of one or more network elements to which it is communicatively coupled. Networks 120, 122 may translate to or from other protocols to one or more protocols of network devices. Although Networks 120, 122 is depicted as one network for simplicity, it should be appreciated that according to one or more embodiments, Networks 120, 122 may comprise a plurality of interconnected networks, such as, for example, a service provider network, the Internet, a cellular network, corporate networks, or even home networks, or any of the types of networks mentioned above.
Data may be transmitted and received via Networks 120, 122 utilizing a standard networking protocol or a standard telecommunications protocol. For example, data may be transmitted using Session Initiation Protocol (“SIP”), Wireless Application Protocol (“WAP”), Multimedia Messaging Service (“MMS”), Enhanced Messaging Service (“EMS”), Short Message Service (“SMS”), Global System for Mobile Communications (“GSM”) based systems, Code Division Multiple Access (“CDMA”) based systems, Transmission Control Protocol/Internet Protocols (“TCP/IP”), hypertext transfer protocol (“HTTP”), hypertext transfer protocol secure (“HTTPS”), real time streaming protocol (“RTSP”), or other protocols and systems suitable for transmitting and receiving data. Data may be transmitted and received wirelessly or in some cases may utilize cabled network or telecom connections such as an Ethernet RJ45/Category 5 Ethernet connection, a fiber connection, a cable connection or other wired network connection.
While
System 130 may be communicatively coupled to Databases 180, 182. Databases 180, 182 may include any suitable data structure to maintain the information and allow access and retrieval of the information. For example, Databases 180, 182 may keep the data in an organized fashion and may be an Oracle database, a Microsoft SQL Server database, a DB2 database, a MySQL database, a Sybase database, an object oriented database, a hierarchical database, a flat database, and/or another type of database as may be known in the art to store and organize data as described herein.
Databases 180, 182 may be any suitable storage device or devices. The storage may be local, remote, or a combination thereof with respect to Databases 180, 182. Databases 180, 182 may utilize a redundant array of disks (RAID), striped disks, hot spare disks, tape, disk, or other computer accessible storage. In one or more embodiments, the storage may be a storage area network (SAN), an internet small computer systems interface (iSCSI) SAN, a Fiber Channel SAN, a common Internet File System (CIFS), network attached storage (NAS), or a network file system (NFS). Databases 180, 182 may have back-up capability built-in. Communications with Databases 180, 182 may be over a network, or communications may involve a direct connection between Databases 180, 182 and an entity. Databases 180, 182 may also represent cloud or other network based storage.
With all data and transactions on the blockchain, various actions are available using the same blockchain source data. This promotes consistency and accuracy of real-time transaction data. For example, the system may post to an accounting system, perform bank reconciliations, generate investor reports, etc.
An embodiment of the present invention may be applied to various stages of a loan, including loan boarding, loan processing and loan modifications (e.g., new terms, new payment plan). All of this may be captured on the blockchain along with various borrower communications.
Data and documents may be stored on the blockchain from lenders, prior servicers, existing servicers, etc. For example, transactions representing cash flow and other actions may be stored on the blockchain from various sources including payment gateways, borrowers, payments made to vendors, invoices for third parties, taxes being paid, etc.
An embodiment of the present invention is also directed to assisting customers with loss mitigation, modifications, borrower assistance, collection agency interactions, communications, etc. In such instances, there is a constant communication with the borrower through various channels including email, chat, text, regular mail, etc. Accordingly, the communications and hashcodes may be stored on the blockchain. This further supports compliance with servicing guidelines as well as regulatory and other requirements. Moreover, cross referencing and verification procedures and efforts may be simplified and streamlined.
Using the loan identifier, loan details along with the Blockchain Network and Channel Data may be retrieved from a data source, such as relational database management system (RDBMS) 232. User identity may be validated through Fabric SDK 234 which communicates with Blockchain Network 236. App Server 228 may then perform an operation GET/SET on the Blockchain Network 236 through a Specific Channel 238. An embodiment of the present invention may use, an open-source permissioned blockchain framework. Permissioned blockchain framework is a modular framework that offers identity management and access control features.
An embodiment of the present invention is directed to implementing Blockchain in a Mortgage Ecosystem. An embodiment of the present invention combines blockchain technology with project/product technology to enable users to achieve an exponential increase in process automation, such as increased transaction transparency, activity tracking, data security, regulatory compliance and audit, and real-time data availability. For example, by combining blockchain technology with project technology, end users may interact and access loan data stored on a blockchain project user interface, access blockchain data through project APIs, etc.
Additional benefits may include: secure record keeping; immutable data (trust moves from humans to the blockchain as manual interventions are reduced); and increased transparency as all permissioned parties on the network may view records and cross verify with their own systems, reducing manual reconciliation of data. Data verification may be improved due to the blockchain storing all historical transaction data. Smart contracts enable the system to automate rules and validations before a transaction is entered onto the blockchain. The system may automatically accept or reject a transaction based on defined rules in the contract. Smart contracts may be available to all permissioned parties on the network who are then able to verify transactions before submitting their consensus to approve or reject.
In accordance with an embodiment of the present invention, storing mortgages, e.g., the loan data, on a blockchain (or distributed ledger technology) realizes significant efficiencies and benefits, such as saving mortgage servicers time, lower expenses, and reducing the operational burden of meeting regulatory requirements by eliminating manual document tracking and information reconciliation. An embodiment of the present invention further facilitates digitized payments and payment recording with embedded credit, compliance, risk alerts, and reporting.
An embodiment of the present invention is directed to a core servicing solution that moves data from various transactions in the blockchain to a secure data warehouse at the end of the day or other interval. Accordingly, this data may be easily and readily available at any time, with no or minimal security risk or potential manipulation issues. This seamless access to data significantly improves the overall response time of the servicing system. Additionally, the core servicing solution automates a large percentage of the servicing process and is further developed with a clean and intuitive dashboard with features designed to aid in remittance, cash accounting, and reporting. This facilitates real-time data sharing between various participants including servicers, investors, GSEs, regulators, and borrowers.
To address cybersecurity concerns, in particular as they relate to lending institutions, an embodiment of the present invention is directed to encrypting non-public data where the blockchain's inherently decentralized nature effectively executes an information security agenda.
The system of
Initially, an investor may not be prepared to join the blockchain and may want to continue with its traditional means of data communication. In that case, an embodiment of the present invention may add that investor to the blockchain as a proxy organization, ensuring that the system will be ready for a future investor request to participate in the blockchain network. This provides consistency in data storage at the blockchain end along with a future option for the investor to start actively participating in the blockchain. Other scenarios and situations may be realized.
As shown in
With Payment Data/Loan Transactions, represented by 372, after processing the payments received from the borrower, an embodiment of the present invention may generate loan transactions prepared as digital assets and therefore candidates to store into the blockchain. Other activity in the system/application, that may or may not trigger amount movement, may generate a loan transaction that is a candidate for blockchain. An embodiment of the present invention may store loan transactions into the blockchain channel based on the loan investor.
With Custodial Bank Account Balances (e.g., Bank Balances 374, Loan Balances 376, Loan Documents 378, etc.), during the servicing of a loan, various activities taken on an account may require auditing where the reconciliation of accounts and balances may be considered a significant servicer responsibility. An embodiment of the present invention may store investor custodial account balances in the blockchain so that they may be made available to the investor in real-time as the transaction is occurring in the system. An embodiment of the present invention also stores servicer account balances in the blockchain so that the servicer may reconcile their own bank account balances with the system.
An embodiment of the present invention recognizes the importance for a servicer to store communications with borrowers for compliance with audits, adherence to government guidelines and reliance on data as proof in case of bankruptcy or foreclosure of the property. Communication data may include: emails, short message service (SMS), text messages, chat messages, facsimiles; letters, notices, disclosures or other documents sent to borrower electronically or regular mail; phone calls via call centers, representatives, etc. Communication data, which is specific to a borrower, may be prepared as digital assets and stored on the blockchain, which then becomes authenticated and tamper-proof data. The servicer may then rely on this data and further share with auditors or rely on the data for bankruptcy, etc.
An embodiment of the present invention is directed to adding loan details along with all financial transaction information on the blockchain for each loan.
To create transparency in operations, communication and activities and further ensure compliance to various rules defined by government agencies, such as Federal Housing Administration (FHA) or Veterans Affairs (VA), and guidelines defined by investors, such as Fannie Mae or Freddie Mac, a servicer may store loan details and transaction information on the blockchain.
Loan details may include: loan information; financial activities; general ledger entries; loan documents; communications, etc. Loan information may include: Borrower Name, Address, Phone/Mobile, Loan Note Amount, Note Date, current interest rate, etc. Loan information may also include Property Valuation data, Bankruptcy data, Foreclosure data and Loss Mitigation data. Property Valuation may include: Estimated Property Value and Property Valuation Date, etc. Bankruptcy may include: Bankruptcy Chapter, Bankruptcy Case Number, Bankruptcy Setup Date, Bankruptcy Filed Date, Bankruptcy Type, Number of Days in Bankruptcy, etc. Foreclosure may include: Sale Amount, Setup Date, Status Code, Removal Code, Deed Recorded Date, Stop Code, etc. Loss Mitigation may include: Loss Mitigation Open Date, Status Code, Removal Code, Template, Stage, etc. Financial activities may include: Payment Received, Principal Collected, Interest Collected, Payment transferred to Custodial Accounts (principal/interest (PI) & taxes and insurance (TI)) or Suspense Account, Escrow Disbursement which may be recorded as “Loan Transaction” on blockchain.
General Ledger entries resultant from financial activities may also be recorded as “GL Transaction” on blockchain.
Loan documents stored on blockchain may include documents/data: received from borrower throughout the life of loan; created and sent by servicer to borrower; received from third party vendor or government agencies etc.
Communications stored on blockchain may include: emails, mail/letters, facsimile; customer care agents; communications/interactions through website/portal, etc.
Various benefits may be realized by storing loan details on blockchain that relate to compliance, reconciliation, bankruptcy and foreclosure, reporting, etc., in accordance with the various embodiments of the present invention.
Regarding compliance, if the loan information along with all financial details are available on blockchain, an auditor is not required to wait for information or a data extract from system for days. The auditor may login at any time and cross-check all the information and validate the information against guidelines (e.g., FHA, VA, Freddie Mac, or Fannie Mae guidelines) followed or not.
Regarding reconciliation, because the system is recording transactions in real-time on the blockchain, interested parties may cross-check the transactions in real-time against bank transactions and reconcile data at any point of time. As an investor is also participating on the blockchain, the transactions may also be available on a certain channel (Distributed Ledger Technology (DTL)). DTL generally refers to the protocols and infrastructure that allow computer systems in various locations to propose and validate transactions and further update records across a network. Accordingly, investors may match and verify Custodial Bank Account balances with Actual Bank Account balances. This verification may be performed in real-time when this transaction has happened at a servicer end. As a result, the investor does not need to wait for month end files from servicer for reconciliation thereby realizing significant efficiencies.
In case of bankruptcy and foreclosure, blockchain may be useful to prove the authenticity of information, documents and more importantly communication between borrower and servicers.
Regarding reporting, as information is available to various parties (e.g., Servicer, Investors, Auditors, etc.) who are part of a blockchain network, the parties may generate different reports based on their needs. For example, an auditor may create an “Income Statement” or “Trial Balance” using “GL Transactions” available on blockchain. In another example, a servicer may create a “Remittance Report” while an investor may create portfolio analysis.
According to an embodiment of the present invention, loan boarding may represent a process by which an investor transfers a pool of mortgage loans to a servicer. This data may come to the servicer in various formats and files, including .CSV. The servicer may process these files and add the loan data to the system.
Loan boarding may be considered a first step for loans entering the servicing workflow, and it is the natural starting point for adding loan data to the blockchain. Entering data in this manner from the start of the servicing cycle ensures the data is synchronized with the processes and activity that occur post-boarding and throughout the servicing period of the loan.
At step 410, a blockchain channel identifier may be identified. This may involve finding a blockchain channel name. Servicers service loan pools for specific investors. The blockchain network may be built between the servicers and their investor clients, where the privacy of investor data is critical. Security or other concerns may delay investor participation on a platform where data is available on a common network. To address the security concern and maintain the privacy of the data between the investors, a separate channel may be created between the servicer and each investor. Additionally, the information saved on the blockchain may be encrypted and the hash code may not be visible to the client until a privacy key and certificate are applied to decrypt it.
When a new loan object, e.g., asset, is added to the blockchain, the system may be required to know which investor owns the asset so the correct channel is selected for transactions on the blockchain.
At step 412, the system may connect to the blockchain network and validate the identity of the user attempting to post a transaction on the blockchain. Once the identity of the user is confirmed, the system allows the user to proceed.
At step 414, a loan may be created as a digital asset. The system may identify important identifying information about a loan and then bundle the information into a format that may be stored in blockchain as an asset. This data may be encrypted and then used to generate a hash value using an encryption algorithm, such as SHA 256 algorithm.
At step 416, a request may be transmitted to add loan objects (e.g., assets) to the blockchain. The system may submit a request to the blockchain, which redirects it to a smart contract. According to an embodiment of the present invention, the smart contract may represent a component used to validate the transaction, apply business logic and rules, and push a transaction request to a processing engine. In the case of loan boarding, a smart contract may be used to validate the loan information as well as prepare the transaction.
At step 418, the transaction may be committed to the blockchain ledger. Here, the system may wait for consensus (e.g., sign-off) from concerned participants in the blockchain network. The transaction requested for loan boarding may be given to the correct participants to validate its authenticity. If they find it is correct, they will provide sign-off and send it to a caller.
The client application, which has initiated this request, may check for confirmation from all parties and if it is “Yes,” from all, the commit process may be initiated. This step may then prepare a hash value for the loan object (e.g., asset) and add it to the blockchain ledger as a key-value pair. The key must be unique to add a new loan to a blockchain ledger. For example, the loan number may be used as the unique key.
Various benefits may be realized by using blockchain in a load boarding process. For example, by boarding/adding loan data in encrypted format (e.g., hashed data) to a secured channel built between the servicer and the investor, only the parties on this channel may see/access the information and have a secret key. Accordingly, all data/communications are secure and investors cannot see each other's loans on the blockchain.
With an embodiment of the present invention, blockchain enables near real-time data availability for participating organizations to see the data and further verify the data. For example, investors and auditors may be able to see loan information and review transactions in real-time without depending on the servicer to provide data in a file each night.
An embodiment of the present invention facilitates and simplifies loan transfers between servicers. Depending on employees to facilitate the labor-intensive loan boarding process is inefficient and error-prone. According to an embodiment of the present invention, blockchain may decrease transfer timelines and costs by changing the ownership of records (assets) digitally while maintaining data integrity as well as the transaction history and ownership of the loan.
Currently, these transactions are recorded in a traditional database system and shared with other parties (e.g., investors, auditors) at a fixed interval, i.e., daily, weekly, or monthly, so that other parties can also match and reconcile their systems. Servicers also use these transactions to match account balances, custodial account balances, income, and expense balances, e.g., to reconcile them with bank transactions and other balances.
By organizing transactions so that they are treated as assets on the blockchain, an embodiment of the present invention is directed to overcoming existing deficiencies of the current system and facilitating more efficient sharing of data between and within the organizations.
As shown by
For loan-level activities, during the normal course of servicing a loan, a servicer may engage in various activities that may or may not involve the movement of funds, which affects the account balances of parties, including the servicer, borrower, or custodian.
For Activity 1, a borrower makes a payment using a website/mobile app, as shown by 510, which is assigned to the payment processing unit (PPU), at 512. The PPU may process the payment based on the loan due, shortages and/or rules defined by the servicer and may divide the amount into multiple categories, such as principal and interest. Payment may be applied on different loan heads, at 514. For example, loan heads may represent loan balances which may be updated when new payments are received and then distributed under different heads of loans, such as principal, interest, escrow, advances if any. Once the categories are calculated, the system may update the loan balances at 516, including unpaid principal balance (UPB) amount, escrow balance, custodial account balances, using these categories.
According to an embodiment of the present invention, the system may generate transactions for any processes that change balances or has money moving from one account to another, at 520. These transactions are first recorded into a traditional database at 518 and then sent to the blockchain processing unit at 522.
In this example, the loan UPB amount will decrease, the escrow balance will increase, the PI custodial account and TI custodial account will also be credited and so on.
As shown in
For example, each month when this job runs, it searches for loans that are scheduled for disbursement in the current month and consolidates or groups them by escrow payment receiver/beneficiary. As shown by
Once a list is prepared and approved by the servicer, the correct amount may be transferred to the disbursement clearing account (DCA), at 544.
Payments, such as checks, may be issued to “escrow payment receiver/beneficiary” from the DCA, at 546.
As in Activity 1, the system may generate transactions for any process that changes balances or has money moving from one account to another. Transactions may be created at 548 where GL transaction process may be applied at 550. The transactions are first recorded into a traditional database at 518 and then sent to the blockchain processing unit at 522.
In this case, the loan escrow balance will decrease, the TI custodial account will be debited, the disbursement clearing account will be credited, and so on.
An embodiment of the present invention may support General Ledger Transactions. General Ledger (“GL”) transactions are important to any organization that wants to be on top of the financial health of the business. Many organizations hire accounting firms to input all business transactions to their accounting software to generate financial statements and reports, such as trial balance, income statements, etc., which helps them understand the health of the business and allows them to take corrective/preventive actions to maintain margins and profitability.
In the mortgage servicing world, both the servicer and the investor may require general ledger transaction data to generate financial statements to know and understand the health of the business. Investors, in particular, need to monitor loan pools to stay on top of pool performance and returns on investment. Investors often need to review the activities performed on loans that may affect their investments.
This data is often fed into the investors' accounting and modelling software. The timely reporting of data to investors is key to enabling them to stay ahead of macroeconomic and microeconomic events that may affect the value of a loan portfolio. The availability of real-time data in these instances will help the servicer keep borrowers abreast of changes affecting their payments and the resulting effects on the investor portfolio.
As shown in
Each activity may generate a loan transaction that may be further processed and then stored to the blockchain. In the same or similar manner, these loan transactions may be used to generate GL entries. At 524, a loan transaction may be received as an input. Once loan transactions have been prepared, a new function may be triggered in parallel to generate GL transactions. This may queue to the processing unit (PU).
The PU may look up the accounting rules (e.g., GL rules), as shown by 528, which may be prepared by the servicer and entered into the system against the type of transaction, as shown by 526. There may be one or more rules pre-configured in the system by the servicer for each transaction.
The PU may run the rules against that particular transaction and generate a GL transaction. These may be saved into the database, at 530, and queued to the blockchain network 534, as shown by 532.
As discussed in connection with
A request to add the asset (or loan object) may be transmitted to the blockchain. According to an embodiment of the present invention, the request may be submitted to the blockchain, which in turn redirects the request to a smart contract, a component that validates the transaction, applies business logic and rules, and pushes the transaction request to the processing engine.
The transaction may then be committed to the Blockchain Ledger. Here, an embodiment of the present invention may wait for consensus (e.g., sign-off) from all participants available in the blockchain networks. The transaction object may be given to each participant to validate and run through a smart contract at their end. Upon passing validation, the participant may provide a sign-off to commit to the transaction.
The client application, which initiated the request, may check for confirmation from all parties and if it is a “Yes” from all, then it initiates the commit process.
For example, a hash value may be created for the loan transaction and the GL transaction separately and then added to the blockchain ledger as a key-value pair. In this example, the key must be unique in order to add a new transaction object to the blockchain ledger. Here, the string <LoanNumber>TX<TranId> may be created as a unique key for the loan transaction and <LoanNumber>GL<GL_TranId> for the GL transaction.
An embodiment of the present invention may realize various benefits from using blockchain technology for Loan Transactions and General Ledger Transactions. Because the system records transactions in real-time on the blockchain, investors and auditors may cross-check transactions in real-time against bank transactions and reconcile data at any time.
In the example of “Activity 1” (as shown by 510 in
In the case of an investor who is participating on the blockchain, transactions may be available to view on their channel (e.g., distributed ledger technology). The investor may match and verify custodial bank account balances with actual bank account balances in real-time. The investor no longer needs to wait for end-of-day or end-of-month files from the servicer to perform account reconciliation.
Similarly, an embodiment of the present invention records GL transactions on the blockchain in real-time. Investors may utilize these GL transactions and generate GL entries for their own accounting software. Once entries are posted to the accounting software, the investor may generate financial statements instantly and perform an immediate analysis at the portfolio and pool levels.
For example, during “Activity 1” (payment received from borrower), as shown by 510 in
Because GL entries are generated in real-time and posted to the blockchain at the same time, an investor may utilize this data and post the same on an internal accounting software to check the trial balance or other financial statements at any time.
Transactions stored on the blockchain may be based on cryptographic proof, meaning all the parties involved, including investors and auditors, may trust the data as it is recorded and be further assured in the knowledge that it cannot be tampered with.
An embodiment of the present invention is directed to building a Data Dictionary and a Knowledge Graph specific to a particular domain, such as a mortgage domain. In this example, the knowledge graph may be highly unique to the mortgage industry. The NLP Engine may use mortgage specific data to ingest/read the mortgage specific guidelines.
The NLP Engine of an embodiment of the present invention is directed to maintaining an innovative hierarchy that specifies conditions and actions. Data extraction may be performed in a manner that keeps the hierarchy intact. By keeping the hierarchy intact, an output from a data-extraction process ensures that the vertical top-down placement of sentences (e.g., plain, bulleted, indented) is consistent with the input documents. An embodiment of the present invention may apply a weight-based approach which considers any two immediate sentences/paragraphs to be in the form of: Parent-Child or Sibling. Accordingly, an embodiment of the present invention understands the hierarchy for further pipeline to function, as shown in
An embodiment of the present invention is directed to a unique node tree structure with a weight-based approach. An embodiment of the present invention applies a mathematical logic that distributes weights to a term, phrase, sentence structure, etc. The weight-based approach considers a particular domain, such as a mortgage domain knowledge. In this context, the term “loan date” or the term “payment” have unique significance. In addition, when considering regulations, certain terms, conditions, phrases, and/or sentences may have more significance under the weight-based approach. An embodiment of the present invention may then learn from a particular knowledge base which includes guidelines from government-sponsored enterprise (GSE), such as Fannie Mae and Freddie Mac, and federal government agencies, such as Ginnie Mae that guarantees timely payments on mortgage-backed securities. For consistency, terminology across various guidelines in the same domain would have consistent meanings and definitions. Accordingly, the weight assigned to a term such as “first lien mortgage” may be consistently applied in any guideline within a particular domain.
An embodiment of the present invention is directed to a Domain Specific Weight Assignment Tool that may be applied to various domains including mortgage domain, automobile domain, pharmaceutical domain, etc.
An embodiment of the present invention is directed to data extraction as applied to unstructured text (into PDFs) while keeping a hierarchy intact.
Data preparation as in data-readiness is a time-consuming and exhaustive process, and if not done effectively, will impact further pipelines. An embodiment of the present invention is directed to a unique node-tree structure that clearly clubbed every sentence/paragraph with its associated topic/section. Basis for document structure and formatting, a metric may be defined in a configuration-file which identifies the level metric for a topic/section. The node-tree structure may establish a relationship, such as “parent-child,” “sibling-sibling.” An embodiment of the present invention may apply an algorithm that determines up-to what depth the sentences/paragraphs are to be clubbed under each topic/section.
Each sentence may be labeled with a depth-parameter number based on a uniquely developed weight-base-approach. The depth-parameter number may represent a derived basis a calculation which uses a set of defined weights. These weights may be altered in a configuration file as per the document format and structure.
The NLP of an embodiment of the present invention also defines scope which later helps in running the business-rules falling under this scope, thereby executing independent business use-cases. For example, the metric may represent a level that determines/identifies a topic/section as well as scope. This may be further combined with business-rules.
An embodiment of the present invention may apply techniques such as Name-Entity-Recognition (NER), Part-Of-Speech (POS), to extract grammatical knowledge from textual data. An embodiment of the present invention may be applied to unearth existing conditions and actions, for which an independent unique framework may be developed by applying a “Verb” base approach. The Verb/Verbs approach forms the root of any sentence while enabling context to be built around.
NLP applied to determine underlying condition-action concepts generally gravitates towards SVO (Subject-Verb-Object) orientation. This may be used to develop a pipeline where multiple features are derived by disintegrating each sentence into its multiple atomic levels which have their own SVO understanding (e.g., towards sentence simplification). With features based out of SVOs generated on each sentence, an embodiment of the present invention may be applied to discover underlying condition-actions as well as how they are inter-linked.
An embodiment of the present invention may integrate Human-In-Loop (HIL) features to improve the NLP system through auto-learning with consistent feedback cycle.
An embodiment of the present invention uniquely applies NLP to the mortgage domain by transforming auto-extracted business rules and developing a Data-Dictionary and a Knowledge-Graph specific to the mortgage domain. While the mortgage domain is described for illustration purposes, the NLP of an embodiment of the present invention is not limited to a particular domain and may be applied to various other types of loans, industries, uses, applications, etc.
According to an embodiment of the present invention, the Data-Dictionary may capture mortgage subject matter experts (SMEs) human intelligence by mapping the extracted rules with relevant column-names in database(s). This improves NLP models by retraining on this new mapped information such that the business rules make more sense as the variables are now predicted with an inclination towards mortgage-specific columns and may be run directly on a pool of loans.
An embodiment of the present invention is directed to a Knowledge-Graph (KG). With such magnitude of unstructured data being available in the form of guidelines/directions/instructions pertaining to mortgage domain, there is a vital need to have an advanced way to not only store this data but to also connect the information in a meaningful way. As recognized by an embodiment of the present invention, the Knowledge-Graph may represent a database to store and manage textual data with underlying information. An embodiment of the present invention is directed to developing a network of interlinked information in the form of Node-Edge-Node where each Node acts as an entity being connected with some relation as marked by the Edge. Mortgage domain related entities benefit from a comprehensive view about how they are related to other entities.
An embodiment of the present invention applies a weight-based approach in preparing the node-tree structure and keeping the hierarchy intact. For example, the weight may be calculated using various factors or coefficients. Specifics may vary depending on the document and document format.
“significance_score_factor_textsize” may represent how font-size of each sentence/text defines overall hierarchy.
“significance_score_factor_weight” may represent how weight (e.g., being bold) of the text matters in getting the hierarchical understanding.
“significance_score_factor_indent” may represent how ‘X coordinates’ of a surrounding box around the text configures out.
“significance_score_factor_sentence_is_uppercase” may represent whether text turns out to be all in UPPERCASE format and whether this alters the placement of text. Other variations of factors/coefficients may be applied based on specific formatting or structuring. For example, a factor/coefficient may be removed and others may be added or otherwise modified.
According to an exemplary illustration, the values may be configured as shown below:
For Section 144, as shown in
An exemplary calculation may be represented as:
As recognized by an embodiment of the present invention, servicers may be required to adhere to established policies, guidelines, and requirements set out by investors, GSEs, and/or other regulatory agencies. These guidelines act as the legal and definitive sources of servicing rules with regard to certain conditions and subsequent actions. The sheer volume and ever-changing nature of these regulations as well as the requirements to adhere to federal and agency state regulations make it difficult for mortgage servicers to design their servicing systems to ensure that they remain compliant without the help of subject matter experts.
An embodiment of the present invention is directed to leveraging natural language processing (NLP) technology based in advanced machine learning/deep learning technologies to create an improved method and system for extracting rules from guidelines.
Mortgage servicing documents contain certain rules in the form of mandates, guidelines, notifications, and requirements from various agencies including GSEs, such as Fannie Mae, Freddie Mac, and Ginnie Mae, and federal regulatory bodies, e.g., the Federal Housing Administration and the CFPB. Servicers may be required to follow these rules in their day-to-day management of mortgage loans.
Though the guidelines are written in plain English in an unambiguous form, it is often difficult for employees with limited debt servicing experience to unearth the information and rules.
An embodiment of the present invention is directed to leveraging NLP methods to auto-extract the information and rules from relevant sections and topics. For example, information from each document may be parsed into these three formats at their atomic level: (1) simplified version of original text into a more readable format; (2) process flow charts for enhanced visualization; and (3) prepared code snippets in the Python programming language as functions.
At step 710, input source text may be received. An embodiment of the present invention may receive input as printed source text (PDF) that may be stored while caching the versioning of each of these investor specific guidelines. Data extraction process commences here with below levels.
For example, Level-0 may be responsible for parsing out this raw information in JSON structure which remains consistent in structure irrespective that these PDFs are from multiple vendors. Highlights of this structure may include: customizing this extracted data per pre-developed configuration(s) into proper hierarchy while mapping them as per their individual: main heading/topics; sub-heading/sub-topics; page number; HTML base tagging, etc.
Each line/sentence in these documents may be extracted along with various tags and configurations as: bounding boxes; fonts; path, etc.
Should any information appear in tables in these PDFs (as is evidently present in servicing guidelines), then these may be dedicatedly extracted in individual ‘.xlsx’ format. The path of this may be embedded into the JSON output at level-0 at correct level.
Level-1 may cover transforming the JSON data into multiple dataframes (e.g., tabular structure) based on a final atomic detail which may be generally the sub-topics accessible with a hyperlink. These dataframes hold each extracted input at sentence level in each row. Other related details, such as the page number, heading, sub-topics, path, tags, etc., may also be captured in other columns.
At step 712, input data may be preprocessed. Post raw data extraction in step 710, the unstructured text in multiple dataframes gets pre-processed for each sentence to be refined and cleaned with methods including: grammar/spelling/punctuation correctness; sentence syntactical improvement (such as pronoun replacement with its proper noun); sentence simplification (e.g., breaking complex and compound sentences into simpler sentences that stand in totality as independent clauses thereby holding the “subject-predicate-object” formation). The pre-processed data may then be stored in a new folder as output of this step.
At step 714, semantic sentence processing may be applied. Once sentences are pre-processed and cleaned, the semantic relationships may be retrieved from these sentences as various features. Sentences may be semantically disintegrated from their root (which generally is the main verb) into a tree structure as shown in
At step 716, simplified document generation may be performed. With the help of semantics as established in step 714, an embodiment of the present invention may generate simplified documents. For example, simplified documents may capture the required highlights in a simplified and easy-to-comprehend manner: Large complex sentences may be formulated into simpler points with proper indentations and bullets. Certain phrases and entities may be highlighted (e.g., bold) based on context and impact. Such phrases may include: first lien mortgage loan. mortgage loan delinquency. unpaid principal balance/UPB. late fees and charges. current interest accrual rate, etc.
Words/phrases with higher attention may be highlighted (e.g., capitalized representation, color, font, etc.) and may include terms such as: must/must not if/then when and/or, etc.
At step 718, flow chart generation may be performed. This step may generate a descriptive process flow-chart of each of these sub-sections/sub-topics. These process flow-charts may be used to transform the raw English text into visually rich comprehendible illustrative representations. They also capture the flow of: required condition-action along with: proper entity recognition and thoroughly ordered instructions.
At step 720, code base generation may be performed. This may include a Code Base Snippet Generation. For example, Python Language base code snippets may be generated as individual functions for these sub-sections/sub-topics. Other programming languages may be applied. With the help of semantics and contextualized vectorization techniques, this step may involve: auto declaration/definition of variables basis context and phrases as found in each of these broken segments within sentences; auto recognition of values (e.g., numbers or strings or dates) that gets assigned to these variables; auto development of IF and THEN statements basis condition-action as seen in these segments of sentences and auto handling of proper indentation for basic code syntax.
The foregoing examples show the various embodiments of the invention in one physical configuration; however, it is to be appreciated that the various components may be located at distant portions of a distributed network, such as a local area network, a wide area network, a telecommunications network, an intranet and/or the Internet. Thus, it should be appreciated that the components of the various embodiments may be combined into one or more devices, collocated on a particular node of a distributed network, or distributed at various locations in a network, for example. As will be appreciated by those skilled in the art, the components of the various embodiments may be arranged at any location or locations within a distributed network without affecting the operation of the respective system.
As described above, the various embodiments of the present invention support a number of communication devices and components, each of which may include at least one programmed processor and at least one memory or storage device. The memory may store a set of instructions. The instructions may be either permanently or temporarily stored in the memory or memories of the processor. The set of instructions may include various instructions that perform a particular task or tasks, such as those tasks described above. Such a set of instructions for performing a particular task may be characterized as a program, software program, software application, app, or software.
It is appreciated that in order to practice the methods of the embodiments as described above, it is not necessary that the processors and/or the memories be physically located in the same geographical place. That is, each of the processors and the memories used in exemplary embodiments of the invention may be located in geographically distinct locations and connected so as to communicate in any suitable manner. Additionally, it is appreciated that each of the processor and/or the memory may be composed of different physical pieces of equipment. Accordingly, it is not necessary that the processor be one single piece of equipment in one location and that the memory be another single piece of equipment in another location. That is, it is contemplated that the processor may be two or more pieces of equipment in two or more different physical locations. The two distinct pieces of equipment may be connected in any suitable manner. Additionally, the memory may include two or more portions of memory in two or more physical locations.
As described above, a set of instructions is used in the processing of various embodiments of the invention. The servers may include software or computer programs stored in the memory (e.g., non-transitory computer readable medium containing program code instructions executed by the processor) for executing the methods described herein. The set of instructions may be in the form of a program or software or app. The software may be in the form of system software or application software, for example. The software might also be in the form of a collection of separate programs, a program module within a larger program, or a portion of a program module, for example. The software used might also include modular programming in the form of object oriented programming. The software tells the processor what to do with the data being processed.
Further, it is appreciated that the instructions or set of instructions used in the implementation and operation of the invention may be in a suitable form such that the processor may read the instructions. For example, the instructions that form a program may be in the form of a suitable programming language, which is converted to machine language or object code to allow the processor or processors to read the instructions. That is, written lines of programming code or source code, in a particular programming language, are converted to machine language using a compiler, assembler or interpreter. The machine language is binary coded machine instructions that are specific to a particular type of processor, i.e., to a particular type of computer, for example. Any suitable programming language may be used in accordance with the various embodiments of the invention. For example, the programming language used may include assembly language, Ada, APL, Basic, C, C++, COBOL, dBase, Forth, Fortran, Java, Modula-2, Pascal, Prolog, REXX, Visual Basic, JavaScript and/or Python. Further, it is not necessary that a single type of instructions or single programming language be utilized in conjunction with the operation of the system and method of the invention. Rather, any number of different programming languages may be utilized as is necessary or desirable.
Also, the instructions and/or data used in the practice of various embodiments of the invention may utilize any compression or encryption technique or algorithm, as may be desired. An encryption module might be used to encrypt data. Further, files or other data may be decrypted using a suitable decryption module, for example.
In the system and method of exemplary embodiments of the invention, a variety of “user interfaces” may be utilized to allow a user to interface with the mobile devices or other personal computing device. As used herein, a user interface may include any hardware, software, or combination of hardware and software used by the processor that allows a user to interact with the processor of the communication device. A user interface may be in the form of a dialogue screen provided by an app, for example. A user interface may also include any of touch screen, keyboard, voice reader, voice recognizer, dialogue screen, menu box, list, checkbox, toggle switch, a pushbutton, a virtual environment (e.g., Virtual Machine (VM)/cloud), or any other device that allows a user to receive information regarding the operation of the processor as it processes a set of instructions and/or provide the processor with information. Accordingly, the user interface may be any system that provides communication between a user and a processor. The information provided by the user to the processor through the user interface may be in the form of a command, a selection of data, or some other input, for example.
The software, hardware and services described herein may be provided utilizing one or more cloud service models, such as Software-as-a-Service (SaaS), Platform-as-a-Service (PaaS), and Infrastructure-as-a-Service (IaaS), and/or using one or more deployment models such as public cloud, private cloud, hybrid cloud, and/or community cloud models.
Although the embodiments of the present invention have been described herein in the context of a particular implementation in a particular environment for a particular purpose, those skilled in the art will recognize that its usefulness is not limited thereto and that the embodiments of the present invention can be beneficially implemented in other related environments for similar purposes.