Computing devices, such as mobile phones, tablet computers, laptop computers, or wearable devices, allow users to access sensitive content such as, for example, account information. Account information may include banking information, rewards/loyalty information, historic information (e.g., purchases, browsing information, offers, and information generated therefrom), utility account information, medical information, and other nonpublic information accessible by the user via, for instance, a password or personal identification number (“PIN”). Generally, users access account information via an application, installed on a computing device, that is associated with the account information. Alternatively, users can often access account information via a website associated with the account information via a web browser executing on the computing device. Often, users experience difficulty or frustration accessing account information because associated applications or websites typically require users to manually enter user names, passwords, and other account-related information, which can be cumbersome to input, particularly on devices that do not utilize a traditional keyboard. Further, once a user is able to access his account, the user often experiences further difficulty in completing the tasks he set out to accomplish by accessing the account.
Aspects of existing speech recognition technology and, in particular, internet-enabled voice command devices, allow users to utilize voice commands to, for example, control smart devices or ask questions that can be answered based on an internet query. Such technology, however, may not enable users to access sensitive content such as account information.
Accordingly, a need exists for systems and methods that allow users an improved experience when accessing sensitive content such as account information and completing tasks associated with the account. In particular, a need exists for such systems and methods that utilize voice-recognition technology and allow users to interact with the account using natural language.
Disclosed implementations provide systems and methods for providing users access to sensitive content such as account information, such systems and methods utilizing voice-recognition technology that allows users to interact with the systems and methods using natural language.
Consistent with the disclosed implementations, the system may include one or more processors and a memory coupled to the one or more processors and storing instructions that, when executed by the one or more processors, cause the system to receive, from a computing device, a data file that includes data representative of a voice command received at the computing device from a user. The one or more processors may further execute instructions that cause the system to transmit a request for user authentication information responsive to determining that the voice command is directed to a banking-related inquiry, and verify the user authentication information once it is received. Additionally, the one or more processors may execute instructions that cause the system to query a banking account for requested information in response to determining that the voice command includes a request relating to the banking account. Finally, the one or more processors may execute instructions that cause the system to output data indicative of the requested information.
Consistent with the disclosed implementations, methods for providing users access to sensitive content such as account information using voice-recognition technology that allows users to interact with the systems and methods using natural language.
Further features of the disclosed design, and the advantages offered thereby, are explained in greater detail hereinafter with reference to specific embodiments illustrated in the accompanying drawings, wherein like elements are indicated be like reference designators.
Reference will now be made to the accompanying FIGS., which are not necessarily drawn to scale, and which are incorporated into and constitute a portion of this disclosure, illustrate various implementations and aspects of the disclosed technology and, together with the description, serve to explain the principles of the disclosed technology. In the FIGS.:
Some implementations of the disclosed technology will be described more fully with reference to the accompanying drawings. This disclosed technology may, however, be embodied in many different forms and should not be construed as limited to the implementations set forth herein.
Example implementations of the disclosed technology can provide systems and methods for voice-controlled account servicing. For example, some implementations utilize speech recognition technology and thus allow a user to access and interact with sensitive information such as account information. According to example implementations, a computing device (e.g., a user device) receives a user's voice command, which can be a natural-language voice command or request. The user device can create a capture of the voice command, such as an audio file, which the user device can process and convert to a data file, which may be a text string representing the user's voice command. Based on the data file, the user device can set about determining the intent of the user's voice command. Upon determining the voice command was intended to access or interact with an account associated with the user, the user device can transmit the data file to a remote server associated with the user's account. The server can further process the data file to determine the exact nature of the user's command. For example, if the voice command is directed to a user's financial account (e.g., bank account, credit card account, money market account, or other type of financial account), the command could relate to the account's balance, recent transactions, account rewards balance or redemption, budgeting questions, or bill payment questions. After determining the nature of the nature of the request, the server can request additional account authentication information or access the user's account for the requested information. Depending on the nature of the request, the server can output a response to the user device, which the user device can provide to the user as, for example, a verbal response or on a display associated with the user device. Alternatively, in some implementations, if the user request relates to a payment, the server can initiate a transaction with a designated payee on behalf of the user/payor.
Example implementations may include a method that comprises receiving, at a processor and from a computing device, a data file that comprises data representative of a voice command, which can be a natural-language voice command, received at the computing device from a user. The data file can include a text string that represents the text of the voice command. After determining that the voice command is directed to a banking-related inquiry (e.g., a request for an account balance or for an itemized list of purchases made during a particular time period), the method can include transmitting, to the computing device, a request for user authentication information. Additionally, the method can include receipt and authentication of the user authentication information in addition to querying a banking account for the requested information in response to determining that the voice command includes a request for information relating to the banking account. Finally, the method can include outputting data indicative of the requested information.
The method, in some example implementations, may further include initiating an electronic payment to a third-party account from the banking account or, conversely, initiating payment from a third-party account to the banking account. Further, in example implementations, the method can include initiating payment from a first bank account to a second bank account, both held by the user, and both provided by the same financial institution.
Example implementations of the disclosed technology will now be described with reference to the accompanying figures.
As desired, implementations of the disclosed technology include a computing device with more or fewer of the components illustrated in
The computing device architecture 100 of
In an example implementation, the network connection interface 112 is configured as a wired or wireless communication interface and provides functions for rendering video, graphics, images, text, other information, or any combination thereof on the display. In one example, a communication interface includes a serial port, a parallel port, a general purpose input and output (GPIO) port, a game port, a universal serial bus (USB), a micro-USB port, a high definition multimedia (HDMI) port, a video port, an audio port, a Bluetooth port, a near-field communication (NFC) port, another like communication interface, or any combination thereof.
The computing device architecture 100 may include a keyboard interface 106 that provides a communication interface to a physical or virtual keyboard. In one example implementation, the computing device architecture 100 includes a presence-sensitive display interface 108 for connecting to a presence-sensitive display 107. According to certain example implementations of the disclosed technology, the presence-sensitive input interface 108 provides a communication interface to various devices such as a pointing device, a capacitive touch screen, a resistive touch screen, a touchpad, a depth camera, etc. which may or may not be integrated with a display.
The computing device architecture 100 may be configured to use one or more input components via one or more of input/output interfaces (for example, the keyboard interface 106, the display interface 104, the presence sensitive input interface 108, network connection interface 112, camera interface 114, sound interface 116, etc.) to allow the computing device architecture 100 to present information to a user and capture information from a device's environment including instructions from the device's user. The input components may include a mouse, a trackball, a directional pad, a track pad, a touch-verified track pad, a presence-sensitive track pad, a presence-sensitive display, a scroll wheel, a digital camera, a digital video camera, a web camera, a microphone, a sensor, a smartcard, and the like. Additionally, an input component may be integrated with the computing device architecture 100 or may be a separate device. As additional examples, input components may include an accelerometer (e.g., for movement detection), a magnetometer, a digital camera, a microphone (e.g., for sound detection), an infrared sensor, and an optical sensor.
Example implementations of the computing device architecture 100 include an antenna interface 110 that provides a communication interface to an antenna; a network connection interface 112 may support a wireless communication interface to a network. As mentioned above, the display interface 104 may be in communication with the network connection interface 112, for example, to provide information for display on a remote display that is not directly connected or attached to the system. In certain implementations, a camera interface 114 is provided that acts as a communication interface and provides functions for capturing digital images from a camera. In certain implementations, a sound interface 116 is provided as a communication interface for converting sound into electrical signals using a microphone and for converting electrical signals into sound using a speaker. According to example implementations, a random access memory (RAM) 118 is provided, where computer instructions and data may be stored in a volatile memory device for processing by the CPU 102.
According to example implementations, the computing device architecture 100 includes a read-only memory (ROM) 120 where invariant low-level system code or data for basic system functions such as basic input and output (I/O), startup, or reception of keystrokes from a keyboard are stored in a non-volatile memory device. According to example implementations, the computing device architecture 100 includes a storage medium 122 or other suitable type of memory (e.g. such as RAM, ROM, programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), magnetic disks, optical disks, floppy disks, hard disks, removable cartridges, flash drives), for storing files including an operating system 124, application programs 126 (including, for example, a web browser application, a widget or gadget engine, and or other applications, as necessary), and data files 128, which can include audio files representative of received voice commands. According to example implementations, the computing device architecture 100 includes a power source 130 that provides an appropriate alternating current (AC) or direct current (DC) to power components.
According to an example implementation, the computing device architecture 100 includes a telephony subsystem 132 that allows the device 100 to transmit and receive audio and data information over a telephone network. Although shown as a separate subsystem, the telephony subsystem 132 may be implemented as part of the network connection interface 112. The constituent components and the CPU 102 communicate with each other over a bus 134.
According to an example implementation, the CPU 102 has appropriate structure to be a computer processor. In one arrangement, the CPU 102 includes more than one processing unit. The RAM 118 interfaces with the computer bus 134 to provide quick RAM storage to the CPU 102 during the execution of software programs such as the operating system, application programs, and device drivers. More specifically, the CPU 102 loads computer-executable process steps from the storage medium 122 or other media into a field of the RAM 118 to execute software programs. Data may be stored in the RAM 118, where the computer CPU 102 can access data during execution. In one example configuration, and as will be understood by one of skill in the art, the device architecture 100 includes sufficient RAM and flash memory for carrying out processes relating to the disclosed technology.
The storage medium 122 itself may include a number of physical drive units, such as a redundant array of independent disks (RAID), a floppy disk drive, a flash memory, a USB flash drive, an external hard disk drive, thumb drive, pen drive, key drive, a High-Density Digital Versatile Disc (HD-DVD) optical disc drive, an internal hard disk drive, a Blu-Ray optical disc drive, or a Holographic Digital Data Storage (HDDS) optical disc drive, an external mini-dual in-line memory module (DIMM) synchronous dynamic random access memory (SDRAM), or an external micro-DIMM SDRAM. Such computer readable storage media allow a computing device to access computer-executable process steps, application programs and the like, stored on removable and non-removable memory media, to off-load data from the device or to upload data onto the device. A computer program product, such as one utilizing a communication system, may be tangibly embodied in storage medium 122, which may include a non-transitory, machine-readable storage medium.
According to example implementations, the term “computing device,” as used herein, may be a CPU, or conceptualized as a CPU (for example, the CPU 102 of
In example implementations of the disclosed technology, a computing device includes any number of hardware and/or software applications that are executed to facilitate any of the operations. In example implementations, one or more I/O interfaces facilitate communication between the computing device and one or more input/output devices. For example, a universal serial bus port, a serial port, a disk drive, a CD-ROM drive, and/or one or more user interface devices, such as a display, keyboard, keypad, mouse, control panel, touch screen display, microphone, etc., may facilitate user interaction with the computing device. The one or more I/O interfaces may be utilized to receive or collect data and/or user instructions from a wide variety of input devices. Received data may be processed by one or more computer processors as desired in various implementations of the disclosed technology and/or stored in one or more memory devices.
One or more network interfaces may facilitate connection of the computing device inputs and outputs to one or more suitable networks and/or connections. For example, the connections that facilitate communication with any number of sensors associated with the system. The one or more network interfaces may further facilitate connection to one or more suitable networks; for example, a local area network, a wide area network, the Internet, a cellular network, a radio frequency network, a Bluetooth enabled network, a Wi-Fi enabled network, a satellite-based network, any wired network, any wireless network, etc., for communication with external devices and/or systems.
In some implementations, after receiving a voice command from computing device user 205 (e.g., via sound interface 116), computing device 210 can create a digital audio data file that represents the received voice command using, for example, an application program 126. Accordingly, in some implementations, the computing device 210 can create a waveform audio (“.WAV”) file, a free lossless audio codec (“FLAC”) file, or other suitable digital audio data file. According to some implementations, voice recognition application server 215 can be configured to receive audio files from computing device 210, process the received audio file, and convert the audio file into a separate data file such as, for example, a text file. In some implementations, application server 220 can be configured to receive the data file (e.g., from computing device 210 or voice recognition application server 215), and process the data file to determine the substance or nature of the voice command. Further, in some implementations, and depending on the nature of the voice command, application server 220 can be configured to output appropriate responses to computing device 210, initiate an account management action, or initiate a transaction or other communication with third-party server 225.
Though not shown, it will be understood by one of skill in the art that many remote servers can be operatively connected through a network 201. Generally, such operative connections involve a secure connection or communications protocol (i.e., a trusted connection), and communications over a network typically involve the use of one or more services such as a Web-deployed service with client/server architecture, a corporate Local Area Network (“LAN”) or Wide Area Network (“WAN”), or through a cloud-based system. According to some implementations, servers (e.g., voice recognition application server 215, application server 220, and third-party server 225) can comprise at least one database (e.g., 212, 216, and 222, respectively) and one or more processors (e.g., 214, 218, and 224, respectively) for carrying out various computer-implemented processes, including computer-implemented processes associated with a voice-controlled account servicing system. Further, though shown independently, according to some implementations, voice recognition application server 215 and application server 220 can be co-located. Likewise, as will be understood, an environment 200 for utilizing a voice-controlled account servicing system can comprise more or less components than shown in
Accordingly, as shown in
In some implementations, after receiving the data file back from voice recognition application server 215, VR APP 304 may process 311 the data file to determine the nature of the voice command and/or determine an appropriate application for further processing the command (e.g., application 306). For example, in some implementations, VR APP 304 may parse a text file and identify certain key words to determine the nature of the voice command and/or an appropriate application to further process the command. So, in the foregoing example, computing device user 205 may provide a voice command that relates to a financial account associated with computing device user 205. Accordingly, in some implementations, processing 311 may include determining the nature of the voice command (e.g., after determining the voice command is related to computing device user's 205 financial account). Further, as shown in
As shown in
As noted previously, in some implementations, application server 220 may be associated with various financial accounts including, for example, a banking account associated with computing device user 205. Accordingly, in some implementations, database 216 can store customer information (e.g., customer account information, which can include various account details such as name and contact information, account balance information, transaction information, other relevant account details, and any other non-public personal information or personally identifiable financial information provided by a customer to a financial institution, or resulting from a transaction with the customer or otherwise obtained by the financial institution). Further, in some implementations, database 216 can store various voice commands that are related to a user's banking account, or associated with the type of banking account maintained by the user at a financial institution, and that are recognizable by application server 220. Additionally, processor 218 may be configured for generating banking accounts, managing and servicing banking accounts, and processing information relating to banking accounts. Further, processor 218 may be configured to execute instructions relating to voice recognition technology that can process received data files relating to voice commands. Moreover, processor 218 may be configured to execute instructions for generating responses to voice commands and inquiries, or to follow a series of actions in response to a received voice command or inquiry.
In some implementations, application server 220 may determine that based on the nature of the voice command (e.g., that the voice command relates to sensitive financial information), additional security information is necessary. Accordingly, application server 220 may optionally transmit 319 a request to application 306 to obtain additional security information from computing device user 205, which computing device user 205 can provide verbally or manually. For example, computing device user 205 could be prompted to verbally provide an answer to a security question or provide a PIN number, Social Security Number, or various other account-verifying information via, for example, sound interface 116. Likewise, computing device user 205 could be prompted to manually provide account verification information (e.g., biometric information such as a fingerprint scan, one or more pattern scans or swipe gestures, or other account verification information) at, for example, presence-sensitive display 107. Further, in some implementations, a request for additional security information may comprise a multi-factor authentication. Thus, for example, application server 220 may generate a passcode and transmit the passcode to computing device 210 such that computing device user 205 can provide the passcode as a voice command that can be received and verified by, for example, VR APP 304 or application 306. Additionally, in some implementations, application 306 may utilize or incorporate voice recognition technology (e.g., voice biometrics) to further verify the identity of computing device user 205 based on, for example, received voice commands.
In some implementations, however, computing device user 205 can pre-register computing device 210 with application 306 and/or application server 220 such that it is not necessary to obtain additional security information. Put differently, computing device user 205 can pre-authorize his financial account for such voice commands. Thus, for example, an account holder can access a website provided by the financial institution associated with the financial account and preauthorize computing device 210 for utilizing voice commands in conjunction with the financial account. In some implementations, an identifier associated with a pre-registered computing device 210, such as smartphone device ID, serial number of the like, may be delivered with a data file or as part of the data file information, such as in data file header information or metadata. Further, in some implementations, the initial voice command can include account-verifying information (or user-verifying information) that gets converted as part of the digital audio and data file and propagated to application server 220.
Further, in some embodiments, application server 220 can determine whether additional security information is required based on the nature of the received voice command and the sensitivity of the requested information. Thus, for example, if the voice command relates to a request for account balance information, no additional security information may be required. If, on the other hand, the voice command relates to a request for application server 220 to take certain actions (e.g., pay a bill to an associated third-party account), additional security information may be required.
As shown in
As further shown in
In some implementations, initiating 325 a transaction with a third-party server (e.g., third-party server 225, which can be associated with a third-party bank, utility company, credit card provider, or other third party) can include authenticating computing device user (e.g., transmitting 319 a request for security information or via a pre-registration of computing device 210). Additionally, initiating 325 a transaction can include securely connecting to a server associated with the third party (e.g., third-party server 225) and validating third-party accounts associated with mobile device user 205. Further, initiating 325 a transaction can include authorizing the requested transaction. In some implementations, after the third party completes the requested transaction, application server 220 may receive a confirmation of the completed transaction from third-party server 225.
As shown in
In some implementations, a voice command from computing device user 205 may initiate a dialog between computing device user 205 and computing device 210, VR APP 304, and/or application 306. Thus, for example, computing device user 205 may provide a voice command relating to account rewards (e.g., “What is my rewards balance?”). Application server 220 may determine the rewards balance according to the disclosure provided, and computing device user 205 may provide a related follow-up voice command (e.g., “What can I spend my rewards points on?”). Again, application server 220 may determine an appropriate response to provide to computing device user 205. In response, computing device user 205 may provide an additional voice command to redeem certain rewards points on an identified item, and application server 220 may initiate the transaction as described above.
Though not shown in
For convenience and ease of discussion, implementations of the disclosed technology are described above in connection with a financial or banking account associated with a user. But it is to be understood that the disclosed implementations are not limited to financial or banking accounts and are applicable to various other accounts associated with a user's sensitive information (e.g., utility/service accounts, medical information, and various other sensitive information).
Certain implementations of the disclosed technology are described above with reference to block and flow diagrams of systems and methods and/or computer program products according to example implementations of the disclosed technology. It will be understood that one or more blocks of the block diagrams and flow diagrams, and combinations of blocks in the block diagrams and flow diagrams, respectively, can be implemented by computer-executable program instructions. Likewise, some blocks of the block diagrams and flow diagrams may not necessarily need to be performed in the order presented, may be repeated, or may not necessarily need to be performed at all, according to some implementations of the disclosed technology.
These computer-executable program instructions may be loaded onto a general-purpose computer, a special-purpose computer, a processor, or other programmable data processing apparatus to produce a particular machine, such that the instructions that execute on the computer, processor, or other programmable data processing apparatus create means for implementing one or more functions specified in the flow diagram block or blocks. These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means that implement one or more functions specified in the flow diagram block or blocks. As an example, implementations of the disclosed technology may provide for a computer program product, including a computer-usable medium having a computer-readable program code or program instructions embodied therein, said computer-readable program code adapted to be executed to implement one or more functions specified in the flow diagram block or blocks. Likewise, the computer program instructions may be loaded onto a computer or other programmable data processing apparatus to cause a series of operational elements or steps to be performed on the computer or other programmable apparatus to produce a computer-implemented process such that the instructions that execute on the computer or other programmable apparatus provide elements or steps for implementing the functions specified in the flow diagram block or blocks.
Accordingly, blocks of the block diagrams and flow diagrams support combinations of means for performing the specified functions, combinations of elements or steps for performing the specified functions, and program instruction means for performing the specified functions. It will also be understood that each block of the block diagrams and flow diagrams, and combinations of blocks in the block diagrams and flow diagrams, can be implemented by special-purpose, hardware-based computer systems that perform the specified functions, elements or steps, or combinations of special-purpose hardware and computer instructions.
Certain implementations of the disclosed technology are described above with reference to mobile computing devices. Those skilled in the art recognize that there are several categories of mobile devices, generally known as portable computing devices that can run on batteries but are not usually classified as laptops. For example, mobile devices can include, but are not limited to portable computers, tablet PCs, internet tablets, PDAs, ultra mobile PCs (UMPCs), wearable devices, and smartphones. Additionally, implementations of the disclosed technology can be utilized with internet of things (IoT) devices, smart televisions and media devices, appliances, automobiles, toys, and voice command devices, as well as peripherals configured for use with such devices.
In this description, numerous specific details have been set forth. It is to be understood, however, that implementations of the disclosed technology may be practiced without these specific details. In other instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description. References to “one implementation,” “an implementation,” “example implementation,” “various implementations,” “some implementations,” etc., indicate that the implementation(s) of the disclosed technology so described may include a particular feature, structure, or characteristic, but not every implementation necessarily includes the particular feature, structure, or characteristic. Further, repeated use of the phrase “in one implementation” does not necessarily refer to the same implementation, although it may.
Throughout the specification and the claims, the following terms take at least the meanings explicitly associated herein, unless the context clearly dictates otherwise. The term “connected” means that one function, feature, structure, or characteristic is directly joined to or in communication with another function, feature, structure, or characteristic. The term “coupled” means that one function, feature, structure, or characteristic is directly or indirectly joined to or in communication with another function, feature, structure, or characteristic. The term “or” is intended to mean an inclusive “or.” Further, the terms “a,” “an,” and “the” are intended to mean one or more unless specified otherwise or clear from the context to be directed to a singular form.
As used herein, unless otherwise specified the use of the ordinal adjectives “first,” “second,” “third,” etc., to describe a common object, merely indicate that different instances of like objects are being referred to, and are not intended to imply that the objects so described must be in a given sequence, either temporally, spatially, in ranking, or in any other manner.
While certain implementations of the disclosed technology have been described in connection with what is presently considered to be the most practical and various implementations, it is to be understood that the disclosed technology is not to be limited to the disclosed implementations, but on the contrary, is intended to cover various modifications and equivalent arrangements included within the scope of the appended claims. Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.
This written description uses examples to disclose certain implementations of the disclosed technology, including the best mode, and also to enable any person skilled in the art to practice certain implementations of the disclosed technology, including making and using any devices or systems and performing any incorporated methods. The patentable scope of certain implementations of the disclosed technology is defined in the claims, and may include other examples that occur to those skilled in the art. Such other examples are intended to be within the scope of the claims if they have structural elements that do not differ from the literal language of the claims, or if they include equivalent structural elements with insubstantial differences from the literal language of the claims.
This application is a continuation of U.S. patent application Ser. No. 15/375,038, filed 9 Dec. 2016, entitled “Systems and Methods for Voice-Controlled Account Servicing”, which claims the benefit of U.S. Provisional Application No. 62/266,266, filed 11 Dec. 2015, the entire contents and substance of which are hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
62266266 | Dec 2015 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15375038 | Dec 2016 | US |
Child | 15667071 | US |