The present invention generally relates to the field of telecommunications. In particular, the present invention relates to a system and method for transcribing telephone conversations.
In today's technological environment, information is readily available through the use of many different types of communication devices now commonly used by virtually everyone in our society. Smart phones, tablets, portable laptops and other such devices are ubiquitous and have become part of our everyday landscape. Surprisingly, the manner in which much of the information conveyed over communication networks (such as the Internet) with the use of these communication devices is lacking in that the delivery of the information typically does not take into account the physical capabilities or disabilities of users of such information.
For example, many users are hearing impaired or have difficulty understanding voice and/or visual communications due to physical impairments. Moreover, there may be a language issue whereby the language used to convey the information may not be a user's natural language; this makes the user's understanding of the conveyed information an important issue in terms of the effectiveness of the communication. Further, our society is becoming increasingly more reliant on individuals being able to communicate with anyone within their circle of friends, family, acquaintances, and business/professional contacts at any time. Much of the available communication devices are portable, and more importantly, can convey voice, video, text and email virtually from anywhere at any time. Currently available portable devices are able to use the ultimately accessible World Wide Web (i.e., the Internet) to transmit and receive information in the various forms already mentioned. Consequently, those in our society who cannot readily understand the conveyed information because of the manner of delivery are inherently at a disadvantage.
Even though our communication infrastructure, which includes universally accessible communication networks, has significantly facilitated our ability to disseminate information, these disadvantaged members of our society are increasingly becoming marginalized with respect to their participation in our communication systems. In many forums, the broadcast of information is often accompanied with captioned text. Although, the broadcast of text along with voice/audio may be helpful to those who are hearing impaired, such technology loses its effectiveness when used improperly. The broadcast text is generated either by a human listening to the voice communications between two or more individuals or listening to a broadcast of information in a public forum. The broadcast text can also be generated automatically with the use of voice recognition software. The voice or audio that is being converted to text many times is often polluted or adversely affected by noise or the undesired acoustic coupling of voices of two or more speakers speaking at the same time. In many scenarios involving human speech, it is the case that many times individuals engaged in conversation will talk over one another rendering the conversation unintelligible for periods of time. During these periods of unintelligibility, it becomes extremely difficult or nearly impossible for anyone involved and/or listening to the conversation to understand what is being said. Also, currently available voice recognition software, which are used to transcribe telephone conversations at the user locations, will suffer from the same periods of unintelligibility; this is because the voice recognition software equipment are typically positioned to hear the same acoustic signals as their human counterparts. Predictably, available voice recognition software lose much of their effectiveness in such cases especially when the conversation occurs over a telephone network. Such situations cannot be avoided, however, because of the very nature of human conversations. Therefore, ironically, the transcribing of telephone conversations intended to enhance the understanding of speech by the participating parties will have this flaw of periods of unintelligibility because of interfering acoustics when two or more parties to a telephone call speak at the same time.
The method, device and system of the present invention provide an Automatic Channel Selective Transcription Engine (ACSTE) that allows subscribers of the ACSTE to participate in a telephone call having two or more parties. The ACSTE is capable of generating associated text transcribed automatically from isolated transmit voice signals where said isolated transmit voice signals are derived from transmitted voice signals of each of the parties to the telephone call.
The ACSTE comprises a signal buffer coupled to a signal processor, which is coupled to a transcription processor. Each party to one or more telephone calls established by the ACSTE has a transmit voice channel and the signal buffer has access to each such voice channel. The signal buffer is capable of obtaining access to the transmit voice signals and replicating the transmit voice signals without any adverse effects on such signals as they traverse through their respective transmit voice channels. The signal buffer transfers the replicated signals to the signal processor, which processes the signals and isolates them from each other. The signal processor then inputs each processed, isolated and replicated transmit voice signal to the transcription processor, which transcribes automatically each of said signals into associated text signals and inputs them to a transmitter which formats the text signals for transmission over a text channel allocated by the ACSTE to a corresponding party (or parties) to the established telephone call.
Therefore, in addition to receiving the voice signals from another party to the telephone call, each party to each telephone call established by the ACSTE is also able to receive associated text of the received voice signals in real time where said text is not at all affected when two or more parties to an established telephone call speak simultaneously. This is because each party's voice is converted to a transmit signal that is transmitted through its own transmit voice channel. Likewise, the other party or parties to the telephone call each has their own transmit voice channel. Thus, regardless of whether two or more parties to the telephone call speak simultaneously, their respective transmit voice channels will contain their respective transmit voice signals and no other voice signals from the telephone call.
The method of the present invention comprises an ACSTE that performs the steps of replicating each transmitted voice signal of each party to a telephone call established by the ACSTE. The ACSTE then processes and isolates each replicated and processed transmit voice signals from each other. The transmit voice signals are then and transcribed automatically; that is, each processed, replicated and isolated transmit voice signal is transcribed into associated text formatted for transmission—over a text channel—to a corresponding party to the telephone call.
Further features and advantages of the present invention, as well as the structure and operation of various aspects of the present invention, are described in detail below with reference to the accompanying drawings.
The features and advantages of the present invention will become more apparent from the detailed description set forth below when taken in conjunction with the drawings in which like reference numbers indicate identical or functionally similar elements.
The method, device and system of the present invention provide an Automatic Channel Selective Transcription Engine (ACSTE) that allows subscribers of the ACSTE to participate in a telephone call having two or more parties participating in the call. The ACSTE is capable of generating associated text transcribed automatically from isolated transmit voice signals where said isolated transmit voice signals are derived from transmitted voice signals of each of the parties to the telephone call. Thus, even when two or more parties to a telephone call speak at the same time, their transmit voice channels contain their respective transmit voice signals, which are not adversely affected when two or more parties to the telephone call speaking simultaneously.
The ACSTE comprises a signal buffer coupled to a signal processor, which is coupled to a transcription processor. Each party to one or more telephone calls established by the ACSTE has a transmit voice channel and the signal buffer has access to each such channel. The signal buffer is capable of obtaining access to the transmit voice signals and replicating the transmit voice signals without any adverse effects on such signals as they traverse through their respective transmit voice channels. The signal buffer transfers the replicated signals to the signal processor, which processes the signals and isolates them from each other. The signal processor then inputs each processed, isolated and replicated transmit voice signal to the transcription processor, which transcribes automatically each of said signals into associated text signals and inputs them to a transmitter which formats the text signals for transmission over a text channel allocated by the ACSTE to a corresponding party (or parties) to the established telephone call.
Therefore, in addition to receiving the voice signals from another party to the telephone call, each party to each telephone call established by the ACSTE is also able to receive associated text of the received voice signals in real time where said text is not at all affected when two or more parties to an established telephone call speak simultaneously. This is because each party's voice is converted to a transmit signal that is transmitted through its own transmit voice channel. Likewise, the other party or parties to the telephone call each has their own transmit voice channel. Thus, regardless of whether two or more parties to the telephone call speak simultaneously, their respective transmit voice channels will contain their respective transmit voice signals and no other voice signals from the telephone call.
The method of the present invention comprises an ACSTE that performs the steps of replicating each transmitted voice signal of each party to a telephone call established by the ACSTE. The ACSTE then processes and isolates each replicated and processed transmit voice signals from each other. The transmit voice signals are then and transcribed automatically; that is each processed, replicated and isolated transmit voice signal is transcribed into associated text formatted for transmission—over a text channel—to a corresponding party to the telephone call.
The description of the system, device and method of the Automatic Channel Selective Transcription Engine of the present invention is disclosed herein using the following terms, terminology, definitions and abbreviations:
The associated text refers to readable text resulting from a voice recognition system (using voice recognition software and speech tuning algorithms) that analyzes and processes acoustics generated from voice signals. Alternatively, electrical signals representing voice signals can also be processed by such a voice recognition system to transcribe automatically electric signals representing words spoken by a person or words generated by a speaker or other voice-broadcasting device. In some instances, the associated text generated by a voice recognition system is reviewed by a human transcriber to correct for any errors.
The term “automatic” or “automatically” refers to a process of steps and/or act(s) performed by electrical, electronic or electromechanical devices, mechanical devices, machines or systems (including the ACSTE) in response to information inputted into such machines, devices or systems.
The communication network is any digital or analog network or any combination of such networks whereby transmission and reception of associated text, voice, video, and graphics can be achieved.
An owner/controlling entity of communication equipment is able to completely operate and/or control the operation of the equipment in any appropriate manner as deemed warranted by such entity or its agent. A legal entity such as a person or a corporation or any defined entity can own the equipment or control the equipment at any time or only during certain time periods.
The Registration server refers to a computer or computer system that allows subscribers or persons desiring to subscribe to the services being provided by the owner/operator of the ACSTE to register for provided services, which includes entering their profile information and selecting the services they desire. The registration server also authenticates a subscriber desiring services to confirm that the subscriber has already registered and has paid for the use of the services. In an Internet setting, the registration server can be accessible to subscribers and anyone having access to the Internet. A registered subscriber desiring to use the transcription services of the ACSTE can initially make the request via the Internet by identifying the parties (i.e., phone numbers) to the call and the type of services the call requires. The ACSTE can also provide regular telephone services (e.g., Internet services, telephone services with or without transcription services) in addition to text services to subscribers who have paid and registered for such services.
The profile information (inputted by a subscriber) is identification information and relevant personal information of a subscriber. The inputting of information is mostly implemented with GUI (Graphical User Interface) interfaces that are used to prompt a registering end user to input particular profile information. The profile information may include information that is unique to the subscriber to allow the registration server to authenticate the subscriber at a later time.
The authentication of a subscriber confirms the identity of a registered subscriber. The authentication is the act precedent to giving a registered subscriber or end user access to the ACSTE based on at least some of the profile information inputted during the registration process (or at a later time) of the registered subscriber or potential subscriber (i.e. an end user). In addition to profile information, unique information (e.g., password, answers to specific personal questions), of the end user or registered subscriber is typically used to allow the ACSTE to provide access to a registered subscriber and provide agreed upon services to the registered subscriber.
An end user is a person or entity capable of subscribing to the ACSTE and select from the types of services being provided by the ACSTE. Also a person or entity that is using the services of ACSTE via a communication network(s) accessible to the ACSTE and to the person or entity. An end user may be a person who has not completed registration or a person who has not registered at all.
Call setup information is a set of features selected by an end user during registration. The end user selects the features from an available set of features for voice and text services being provided by the ACSTE. In establishing a telephone call (incoming or outgoing) for the end user, ACSTE automatically applies the call setup information. A registered subscriber may edit, modify or adjust the call setup information at any time.
Call instruction information—in addition to call setup information, an end user also inputs instructions to the ACSTE on how telephone calls are to be handled during different circumstances. For example, an end user may want different call setup information applied to two-party calls and conference calls. The instructions direct the ACSTE on how to use the different types of call setup information and also may include additional requests by the end user that anticipate the occurrence certain circumstances during call establishment or during the telephone call itself that has already been established. A registered subscriber may edit, modify or adjust the call instruction information at any time.
A Server is a computer or computer system comprising of one or more processors, various blocks of memory, and supporting circuitry to process information and to interface with users or other servers.
A Registered Subscriber is an end user who has provided personal information, profile information and unique information in response to prompts from the Registration server during a registration process. The end user is given the opportunity to input additional appropriate information related to the services for which he/she desires. Also, the end user enters into some type of payment agreement with the ACSTE about the services he/she desires. The end user may also input call setup information and call instruction information during this registration process.
Telephone call—a communication link established between at least two parties each having a communication equipment (e.g., cell phone, telephone) allowing each of the equipment to transmit and/or receive voice, text, video, graphics and various other forms of information through operation of said equipment by an entity (e.g., one or more persons, communication equipment) where the information is transmitted over one or more communication networks in accordance with the standards and protocol of such networks.
Establishing a telephone call refers to the provision of various communication infrastructure equipment, communication channels, communication links and other resources owned and/or controlled (at least during a telephone call) by a service provider to effectuate communications between the parties to a telephone call as per the standards and protocols of the one or more networks through which the signals of the telephone call will traverse.
Party to a Telephone call—communication equipment used to transmit and receive signals (e.g., voice, video, text, graphic) over one or more communication networks. Also, a person or machine/device operating said communication equipment and using resources being provided by a service provider (e.g., an Internet Service Provider (ISP)) to effectuate a telephone call over one or more communication networks.
The term “couple” or “couple(d) to” as used herein refers to a path or a series of connected paths (permanent or temporary) that allows information (in one or more formats) or signals to flow from one point or equipment in a communication network to another point within the same equipment or another equipment in the same or different communication network in accordance with the protocol(s) of the communication network(s).
The discussion that follows will at times use specific examples of various aspects of the present invention for facilitating a clear explanation of the method, device and system of the present invention. The present invention is not all limited to the particular examples discussed herein. Some examples will be discussed in the context of communication network 100 being the Internet, and the equipment being used by the end users or registered subscribers are Internet enabled devices such as laptops, telephones, cellular phones (e.g., smart phones), tablets, desktop computers and other communication devices capable of gaining access to the Internet through the use of Internet browsers such as Explorer, Safari, Firefox, Windows Mobile, Netscape Navigator, Lynx, Symbian, and receive information not only in their original formats, but also in Java, Flash, HTTP/S, TEXT and XML formats which are typically used by the Internet and Internet enabled devices. The communication links coupling the end user equipment (i.e., Internet enabled devices) to the Internet and ultimately to ACSTE 108 include connections to the particular ISP's (Internet Service Provider) to which the end users subscribe. It will be readily understood to those of ordinary skill in the art to which this invention belongs that the present invention is not limited to the use of the Internet as a communication network and to the use of certain of the devices mentioned herein. Any communication network capable of conveying voice and text signals and any device (portable or non-portable) capable of transmitting and receiving text, voice, graphics, video and other types of signals can be used to practice the method, device and system of the present invention.
Referring to
Service provider 106, as well as ACSTE 108, provides telephone services to end users who are their respective subscribers. In addition to regular telephone services, however, ACSTE 108 also provides text services to its registered subscribers if they subscribe to and pay for such services. As such, whenever registered subscriber 104 desires to make a phone call, ACSTE 108 provides registered subscriber 104 a transmit voice channel (e.g., channel 110) and a text channel (e.g., channel 112). The transmit voice channel 110 operates in the same manner as any transmit voice channel from any service provider operating within communication network 100. The text channel provides text transmitted to the subscriber during a telephone call and also enables the subscriber to transmit text. The text is the transcription of the words spoken by the other party (party 102, for example) during a telephone call between parties 102 and 104. Registered subscriber 104 has access to ACSTE 108 because the entity that owns, controls and/or operates ACSTE 108 has advertised itself as a telephone service provider and registered subscriber 104 has agreed to pay for telephone and text services. ACSTE 108 provides its text and voice services to its subscribers for incoming calls and outgoing calls.
Continuing with the scenario shown in
In establishing the call, ACSTE 108 will first determine whether or not the calling party is a subscriber of ACSTE service, and if the calling party objects to the call being transcribed automatically and recorded. ACSTE 108 has several messages stored in its equipment where upon establishing a call originating from a calling party, it transmits one of these pre-stored messages informing the calling party that he/she is calling someone who is hearing impaired (or not hearing impaired, but who wants the call transcribed anyway) and thus the ensuing telephone conversation will be transcribed automatically in real time and recorded. ACSTE 108 may transcribe automatically the phone conversation and may not record the telephone conversation depending on call setup information and call instruction information entered by registered subscriber 104 during registration to ACSTE 108. In such a case, the pre-stored message to subscriber 102 may transmit the appropriate message to calling party 102. After the transmission of the pre-stored message, the calling party will be given a prompt to agree or not agree to continue with the call. If the calling party agrees then ACSTE 108 completes the establishment of the call. If the calling party disagrees, then ACSTE 108 informs the registered subscriber 104 (i.e., informs the registered subscriber for whom the incoming call was intended) that a calling party attempted to call but refused to continue with the call upon being informed that the call will be transcribed automatically and recorded. The subscriber will at least know that a certain person or entity attempted to call him/her but was not willing to continue with the call upon hearing the pre-stored message. In such a case ACSTE 108 may or may not identify the calling party in name or phone number depending on call instruction information and call setup information of registered subscriber 104. ACSTE 108 may allow a calling party unwilling to continue with a telephone call an opportunity to leave a brief voice message to registered subscriber 104.
Depending on whether the calling party is a registered subscriber, or whether the calling party is a first time caller, or whether the calling party is willing to continue with the transcribed call but does not agree to the recording, ACSTE 108 will handle the telephone call accordingly based on the call setup information and call instruction information of the respective parties (in this case only party 104 is a subscriber) to a telephone call. During registration when an end user desires to become a subscriber to ACSTE 108, the computer equipment (registration server to be discussed herein) of ACSTE 108 will ask questions and record answers regarding circumstances under which a subscriber will allow or not allow a call to be transcribed automatically and/or recorded. Not all subscribers to ACSTE 108 are necessarily hearing impaired or have speech impediments. Certain subscribers (having no hearing impairment) for whatever reason may feel that they want the option to have a call transcribed automatically and/or recorded.
In some cases, the registered subscriber may not be deaf, but may have a severe difficulty hearing and thus needs to see the text to help bolster his/her comprehension of what is being said by the other party. Also, the registered subscriber may have a speech impediment to such an extent that his/her speech is unintelligible. In such a circumstance, presumably the telephone or communication equipment being used by the registered subscriber has the capacity to allow him/her to type his/her responses during a telephone conversation. Further, assuming that the calling party has agreed to continue with the telephone call and has a communication device (e.g., an IP phone) for typing text and for receiving streaming text during a telephone call, then the conversation can still occur between the calling party and a deaf or hearing impaired person with a severe speech impediment. Also there may be cases where neither party has a speech impediment or a hearing impairment, but both parties still desire equipment that can send and receive streaming text.
ACSTE 108 has a translation feature (to be discussed herein), which the two parties can invoke to have a conversation wherein both parties are speaking in their respective languages and can still understand each other because their respective received text have been translated. Furthermore, the ACSTE 108 may have a speech synthesizer feature (not shown) wherein the translated streamed text is fed into a voice synthesizer, which converts the translated text into synthesized transmit voice signals. Each party to the telephone call can listen to voice spoken words from the other party or parties to the telephone call. In such a case, the ACSTE 108 would turn off the actual voice of the other party being received over the telephone voice channels and ACSTE 108 would route instead the synthesized transmit voice signals over the transmit voice channel of each of the parties to the telephone call. In such a scenario, each of the parties to a two-party call or to a conference call (involving two or more parties) can listen to the words (via a voice synthesizer) spoken by the other parties (and also see the text) in a language they can understand. As such, two or more parties speaking different languages can effectively communicate with each other in real time.
Still referring to
Referring now to
Referring now to
In another embodiment, the transmit voice signals may be optical signals, electromagnetic signals or other types of signals that can be converted to electrical signals by signal buffer 302 and isolated from each other by signal processor 304 which processes them for input to transcription processor 306. Transcription processor 306 may be implemented as a computer having a processor and memory and programmed with various speech analysis and voice/speech recognition software and additional circuitry that converts the signals from the signal processor 302 to electrical signals. Transcription processor 306 may also be implemented totally in hardware or a combination of hardware, software (e.g., voice/speech recognition software and speech tuning algorithms) and firmware that can process analog signals, digital signals, optical signals or electromagnetic signals to generate text signals.
In yet another embodiment the output of signal buffer 302 may be acoustic signals generated by signal buffer 300 and positioned so that such signals can be ‘heard’ by transcription processor 306. In particular, the acoustic signals are the replicated signals that have been amplified and/or processed and applied to some type of transducer (not shown) that converts them to acoustic waves. The transducer can be a speaker or other sound generating transducer device that converts the replicated electrical signals to acoustic waves. The replicated signals may have been digital signals in which case they are first converted to analog signals by signal buffer 300, before they are processed and applied to transducers (e.g., speakers) to generate acoustic signals, which are acoustically isolated from each other by signal processor 304. The generation of the acoustic signals may be performed by circuitry included in signal buffer 300. The acoustic signals are physically isolated from each other (this is done by signal processor 304) and any other acoustic wave, so that acoustic waves generated by the transducer are not affected in any way, or are not coupled to any other acoustic wave. The acoustic signals are then acoustically coupled to transcription processor 306 via a medium (e.g., air) through which they can travel without any appreciable distortion. Transcription processor 306 may, inter alia, contain its own set of transducers that allows it to ‘hear’ the acoustic signals and convert them to electrical signals. Further, the transcription processor 306 may contain speech processors and other processors that allow it to interpret the sounds so as to generate text signals in accordance with one or more algorithms and/or speech analysis/processing software residing in the transcription processor 306.
Transcription processor 306, after having converted the replicated signals to text signals, transfers such text signals to transmitter 310 which places the text signals in their corresponding text channels T1, . . . , TM (where M is an integer equal to 1 or greater) to a corresponding party or parties to a telephone call established by ACSTE 108 based on call instruction information and/or call setup information from registration server 308. It should be noted that the number of text channels M and the number of transmit voice channels N are not necessarily equal to each other and neither one is necessarily greater than or less than the other. It should also be noted that the text channels may be full duplex channels. The information from registration server 308 may be call instruction information that can be received by transcription processor 306 via signal processor 304. The information from registration server 308 may also be any type of information pertinent to establishing a telephone call including information directing the transmitter 310 on how many channels are to be allocated to a particular telephone call. Alternatively, information (e.g., call instruction information, call setup information) can be inputted into transcription processor 306 by registration server 308 via a direct path (not shown). The call setup information and call instruction information are also provided to transmitter 310. The text channels, T1, . . . , TM are generated and allocated by transmitter 310 in accordance with call setup information and/or call instruction information received from registration server 308.
Transmitter 310 is also coupled to translator 312, which takes text as its input text in one language and outputs text in another language as requested by transmitter 310 based on information (i.e., call setup and/or call instruction information) received from registration server 308. Translator 312 may be implemented as server capable of receiving text information in one language and process the received text using language translation software. Translator 312 then transfers the translated text back to transmitter 310, which formats the translated text in accordance with communication standards and protocols of communication network 100. Transmitter 310 allocates the requisite number of text channels for the call as per instruction information received from registration server 308 and arranges and places the translated text in a corresponding text channel or channels for transmission to a corresponding party or parties to a telephone call established by ACSTE 108. Transmitter 310 is able to send and receive information to and from registration server 308 and translator 312 to implement proper processing of the text signals to be transmitted over communication network 100. It should be noted that the transmit voice signals associated with the telephone call established by ACSTE 108 are undisturbed as they flow through their respective transmit voice channels C1, . . . , CN unless one of the parties to a telephone call established by ACSTE 108 desires to receive synthesized transmit voice signals instead of the actual voice signals. In such a case, ACSTE 108 will insert synthesized transmit voice signals (e.g., voice signals synthesized from text translated to another language) into the respective transmit voice channel for that particular party.
It should be noted that ACSTE 108 preferably comprises signal buffer 302, signal processor 304 and transcription processor 306 coupled to each other as shown in
Referring now to
In steps 402 and 404, the ACSTE 108 is continually monitoring its site and equipment to detect new calls, new subscribers and/or subscribers wishing to place a call and/or update their information. In particular, in step 402, registration server is monitoring for any request from an end user to register or a request from a subscriber to update its profile information. If there is a request to register, the method of the present invention moves to step 404 where the end user enters its profile information (e.g., name, email address, credit card information, type of service(s) sought—e.g., voice, translation, conference call), instructions for any special services, call setup information, call instruction information or other pertinent information associated with identifying specific services. The end user also provides unique information (e.g., passwords, answers to security questions) to ACSTE 108 to allow the ACSTE 108 to authenticate (i.e., confirm identity of registered subscriber) the subscriber in future logins to the ACSTE website. As part of the subscriber's call setup information, the subscriber may also want a message to be sent out to all incoming and outgoing calls that their telephone conversation will be transcribed automatically and recorded. Also the ACSTE may give incoming callers the option to decline the recording of their conversation with a registered subscriber. Further, the registered subscriber may desire to have the ability to set up conference calls with participants who can use the translation feature of ACSTE 108. The call (incoming or outgoing) may be between two subscribers of ACSTE 108, but with conflicting call setup information. In such cases, the ACSTE will give both subscribers an opportunity to adjust or temporarily change their call setup information to allow their telephone call to occur.
In step 406, the ACSTE 108 monitors if it has received a request to make a telephone call. If it has not, it then continues to monitor for call requests, requests for registration and requests for updating profiles and other data from already registered subscribers. A call request may be in the form of an end user dialing the telephone number of a subscriber and the call is routed to the ACSTE 108 (similar to the scenario discussed with respect to
In step 408, the ACSTE 108 determines if at least one of the end users is a registered subscriber. In the case of a conference call (i.e., more than two parties participating in the call) the ACSTE 108 determines if the party originating the call is a subscriber. For a telephone call with two parties and at least one of the parties is a registered subscriber, then the method of the present invention moves to step 416. For a conference call where the party originating the call is a registered subscriber, the method of the present invention also moves to step 416. If, however, both parties to a telephone call are not registered subscribers, or the end user originating a conference call is not a registered subscriber, the method of the present invention moves to step 410 where it offers the non-subscriber party (or parties) to a telephone call an opportunity to become a subscriber. The opportunity may be in the form of a message followed by, for example, a yes/no question where the end user(s) (or end user originating a conference call) is prompted to press or touch a number (on his/her keypad) for ‘yes’ and another number for ‘no.’ In step 412 the ACSTE determines if at least one end user has indicated an interest in becoming a registered subscriber. If so, the method of the present invention moves to step 414 where an end user(s) who answered in the affirmative will be provided with information (e.g., toll free telephone number and/or a website), specifying where the end user can visit on the Internet to have the opportunity to register and become a registered subscriber. The end user may be referred to an ACSTE 108 website, or a toll free number or both. The method of the present invention then moves to step 416. Returning back to step 412, if none of the end users have accepted the offer, the method of the present invention also moves to step 416.
In step 416, ACSTE 108 finishes its establishment of the telephone and proceeds to determine whether it will process the call for registered subscribers or nonsubscribers. In particular, ACSTE 108 determines whether at least one of the parties to a telephone call is a registered subscriber or the party originating a conference call is a registered subscriber. If so, the method of the present invention moves to steps 420 to 424. Otherwise the method of the present invention moves to step 416.
In step 418, the method of the present invention as performed by ACSTE 108 provides regular telephone service (without the texting features) to the participating parties to the telephone call. However, for a conference call in which the end user originating the call is not a registered subscriber, any one or more of the other participants in the conference call who are registered subscribers will receive text services as per their call instruction information, call setup information and any other pertinent information inputted by such registered subscribers into the registration server.
In step 420, ACSTE 108 begins providing telephone service with the text features to the parties to a telephone call established by ACSTE 108. Specifically, in step 420, signal buffer 302 replicates the transmit voice signals of the transmit channels associated with the telephone call being processed. In step 422, the replicated transmit voice signals are transferred to signal processor 304, which processes the signals and isolates them from each other and any source of noise. The isolated processed replicated transmit voice signals are then inputted to transcription processor 306. In step 424, transcription processor 306 transcribes automatically the signals into associated text signals and transfers the text signals to transmitter 310. Transmitter 310 generates the requisite number of transmit text channels for transmission of the text signals to the proper parties to the telephone call based on call setup information, call instruction information and other pertinent information from registration server 308. The text signals can be translated first to a language or languages specified by various parties to the telephone call. Further, the translated text signals can be applied to a voice synthesizer and the resulting voice signals are transmitted through the corresponding transmit voice channels of the telephone call.
While various aspects of the present invention have been described above, it should be understood that they have been presented by way of example and not limitation. It will be apparent to persons skilled in the relevant art(s) that various changes in form and detail can be made therein without departing from the spirit and scope of the present invention. Thus, the present invention should not be limited by any of the above described exemplary aspects, but should be defined only in accordance with the following claims and their equivalents.
In addition, it should be understood that the figures in the attachments, which highlight the structure, methodology, functionality and advantages of the present invention, are presented for example purposes only. The present invention is sufficiently flexible and configurable, such that it may be implemented in ways other than that shown in the accompanying figures.