This relates generally to electronic devices, including but not limited to electronic devices with speech-to-text (STT) processing capabilities.
Computers and other electronic devices are become an increasingly important tool in education today. Electronic versions of reading materials, such as textbooks, articles, compositions, stories, reading assignments, lecture notes, etc., are frequently used in class for reading and discussion purposes. Some electronic reading devices display reading materials in a way that gives the electronic reading material the look and feel of a real paper book (e.g., an eBook with “flip-able” pages). Some electronic reading devices also provide additional functionalities that allow the reader to interact with the reading materials, such as marking and annotating the reading materials electronically. Some electronic reading devices have text-to-speech (TTS) functionalities that can “speak” the text of the reading materials aloud to the user. Sometimes, a child can have a story read to him or her by an electronic reading device that has text-to-speech (TTS) capabilities.
Conventional electronic reading devices are suitable for readers that are capable of and/or prefer to read independently of others. However, in some environment, collaborative or group reading may be more beneficial to a reader than solo reading by the reader alone. For example, in a classroom environment, a group of children may participate in collaborative reading of a single story, with each child reading only a portion of the whole story. In another example, in a home, a parent may read part of a story to a child, while allowing the child to participate in reading the remainder of the story. Existing electronic reading devices are inadequate in providing an easy, intuitive, fun, interactive, versatile, and/or educational way of organizing the group or collaborative reading of multiple readers in the same group reading session.
Accordingly, there is a need for electronic devices with faster, more intuitive, and more efficient methods and interfaces for facilitating collaborative reading in a group reading environment. Such methods and interfaces may complement or replace conventional methods for displaying electronic reading materials on user devices. Such devices, methods, and interfaces increase the efficiencies, organization, and interactivity of the group reading session, and enhance the learning experience and enjoyment of the users during group reading.
In some embodiments, the device is a desktop computer. In some embodiments, the device is a portable computing device (e.g., a notebook computer, tablet computer, or handheld device). In some embodiments, the device has a touchpad. In some embodiments, the device has a touch-sensitive display (also known as a “touch screen” or “touch screen display”). In some embodiments, the device has a graphical user interface (GUI), one or more processors, memory and one or more modules, programs or sets of instructions stored in the memory for performing multiple functions. In some embodiments, the user interacts with the GUI primarily through finger contacts and gestures on the touch-sensitive surface. In some embodiments, the user interacts with the device primarily through a voice interface.
In some embodiments, the functions provided by the device optionally include one or more of designing a group reading plan, establishing a collaborative reading group comprising multiple user devices, handing off reading control to another device, taking over reading control from another device, displaying reading prompts, providing reading aids, evaluating reading quality, providing annotation tools, generating additional reading exercises, changing the plot and/or other aspects of the reading material, displaying reading material and graphical illustrations associated with the reading materials, and so on. Executable instructions for performing these functions are optionally included in a non-transitory computer readable storage medium or other computer program product configured for execution by one or more processors.
In accordance with some embodiments, a method is performed at an electronic device having one or more processors, memory, and a display. The method includes receiving a selection of text to be read in a group reading session; identifying a plurality of participants for the group reading session; and upon receiving the selection of the text and the identification of the plurality of participants, automatically, without user intervention, generating a reading plan for the group reading session, wherein the reading plan divides the text into a plurality of reading units and assigns at least one reading unit to each of the plurality of participants in accordance with a comparison between a respective difficulty level of the at least one reading unit and a respective reading ability level of the participant.
In accordance with some embodiments, a method is performed at a first client device associated with a first user, the first client device having one or more processors and memory. The method includes: registering with a server of the group reading session to participate in the group reading session; upon successful registration, receiving at least a partial reading plan from the server, the partial reading plan divides text to be read in the reading session into a plurality of reading units and assigns at least a first reading unit of a pair of consecutive reading units to the first user, and a second reading unit of the pair of consecutive reading units to a second user; upon receiving a first start signal for the reading of the first reading unit, displaying a first reading prompt at a respective start location of the first reading unit currently displayed at the first client device; monitoring progress of the reading of the first reading unit based on a speech signal received from the first user; in response to detecting that the reading of the first reading unit has been completed: ceasing to display the first reading prompt at the first client device; and sending a second start signal to a second client device associated with the second user, the second start signal causing a second reading prompt to be displayed at a respective start location of the second reading unit currently displayed at the second client device.
In accordance with some embodiments, a method is performed at a device having one or more processors, memory, and a display. The method includes: receiving a first reading assignment comprising text to be read or recited aloud by a user; receiving a first speech signal from the user reading or reciting the text of the first reading assignment; evaluating the first speech signal against the text to identify one or more areas for improvement; and based on the evaluating, generating a second reading assignment providing additional practice opportunities tailored to the identified one or more areas for improvement.
In accordance with some embodiments, a method is performed at a first device having one or more processors, memory, and a display. The method includes: displaying text of a first segment of a multi-segment textual document on the first device, the text including one or more keywords each associated with a respective portion of a first graphical illustration for the first segment of the multi-segment textual document; detecting a first speech signal reading the first segment of the multi-segment textual document; upon detecting each of the one or more keywords in the first speech signal, sending a respective first illustration signal to a second device, wherein the respective first illustration signal causes the respective portion of the graphical illustration associated with the keyword to be displayed on the second device.
The embodiments described in this specification may realize one or more of the following advantages. In some embodiments, text for reading in a group reading session is automatically divided and assigned to the anticipated participants of the group reading session. The text division and assignment are customized based on the difficulty of the text and the reading ability of the participants. The instructor of the group reading session optionally select different assignment modes (e.g., challenge mode, encouragement mode, and reinforcement mode) based on the particular temperament and performance of individual students, making the automatic division and assignment of the reading units more suited for the real teaching environment. During a group reading session, reading prompt is automatically provided on particular user's devices, saving valuable class time from being wasted on picking a student to participate in the reading. In addition, reading prompt is only displayed on a particular student's when it is that student's turn to read, saving valuable class time from being wasted on the student looking for the correct section to read when he or she is called on. Various visual aids and real-time feedback is provided to both the listening participant and the reading participant of the group reading session. Customized reading assignment is automatically generated for each student, such that they can practice the weaker points identified during the group reading. Each individual device can partially take over the teacher's role to evaluate the student's performance in completing the customized reading assignment, saving the instructor valuable time. Various study aids and annotation tools can be provided to the user during the user's completion of the customized homework assignment. The embodiments described in this specification can be used in many settings outside of the classroom or school environment as well. In professional and private sessions, the embodiments described in this specification provide better learning experience, and allow the user to better enjoy reading on an electronic device.
The details of one or more embodiments of the subject matter described in this specification are set forth in the accompanying drawings and the description below. Other features, aspects, and advantages of the subject matter will become apparent from the description, the drawings, and the claims.
For a better understanding of the aforementioned embodiments of the invention as well as additional embodiments thereof, reference should be made to the Description of Embodiments below, in conjunction with the drawings in which like reference numerals refer to corresponding parts throughout the
Embodiments of electronic devices, user interfaces for such devices, and associated processes for using such devices are described. In some embodiments, the device is a portable communications device, such as a mobile telephone, that also contains other functions, such as PDA and/or music player functions. Exemplary embodiments of portable multifunction devices include, without limitation, the iPhone®, iPod Touch®, and iPad® devices from Apple Inc. of Cupertino, Calif. Other portable electronic devices, such as laptops or tablet computers with touch-sensitive surfaces (e.g., touch screen displays and/or touch pads), may also be used. It should also be understood that, in some embodiments, the device is not a portable communications device, but is a desktop computer with a touch-sensitive surface (e.g., a touch screen display and/or a touch pad).
In the discussion that follows, an electronic device that includes a display (e.g., a touch-sensitive display screen) is described. It should be understood, however, that the electronic device may include one or more other physical user-interface devices, such as a physical keyboard, a mouse and/or a joystick.
The device typically supports a variety of applications, such as one or more of the following: a drawing application, a presentation application, a word processing application, a spreadsheet application, a gaming application, a telephone application, a video conferencing application, an e-mail application, an instant messaging application, a photo management application, a digital camera application, a digital video camera application, a web browsing application, a digital music player application, and/or a digital video player application. The device particularly supports an application, such as an eBook reader application, a Portable Document Format (PDF) reader application, and other electronic book reader applications, that is capable of displaying an electronic textual document in one or more formats (e.g., *.txt, *.pdf, *.rar, *.zip, *. tar, *.aeh, *.html, *.djvu, *.epub, *.pdb, *.fb2, *.xeb, *.ceb, *.ibooks, *.exe, BBeB, and so on). In some embodiments, the device also support display of one or more graphical illustrations, animations, sounds, and widgets associated with the electronic textual document.
Attention is now directed toward embodiments of portable devices with touch-sensitive displays.
It should be appreciated that device 100 is only one example of a portable multifunction device, and that device 100 may have more or fewer components than shown, may combine two or more components, or may have a different configuration or arrangement of the components. The various components shown in
Memory 102 optionally includes high-speed random access memory and/or non-volatile memory, such as one or more magnetic disk storage devices, flash memory devices, or other non-volatile solid-state memory devices. Access to memory 102 by other components of device 100, such as CPU 120 and the peripherals interface 118, is optionally controlled by memory controller 122.
Peripherals interface 118 can be used to couple input and output peripherals of the device to CPU 120 and memory 102. The one or more processors 120 run or execute various software programs and/or sets of instructions stored in memory 102 to perform various functions for device 100 and to process data.
In some embodiments, peripherals interface 118, CPU 120, and memory controller 122 are optionally implemented on a single chip, such as chip 104. In some other embodiments, they may be implemented on separate chips.
RF (radio frequency) circuitry 108 receives and sends RF signals, also called electromagnetic signals. RF circuitry 108 converts electrical signals to/from electromagnetic signals and communicates with communications networks and other communications devices via the electromagnetic signals. RF circuitry 108, optionally, includes well-known circuitry for performing these functions, including but not limited to an antenna system, an RF transceiver, one or more amplifiers, a tuner, one or more oscillators, a digital signal processor, a CODEC chipset, a subscriber identity module (SIM) card, memory, and so forth. RF circuitry 108 may communicate with networks, such as the Internet, also referred to as the World Wide Web (WWW), an intranet and/or a wireless network, such as a cellular telephone network, a wireless local area network (LAN) and/or a metropolitan area network (MAN), and other devices by wireless communication. The wireless communication may use any of a plurality of communications standards, protocols and technologies, including but not limited to Global System for Mobile Communications (GSM), Enhanced Data GSM Environment (EDGE), high-speed downlink packet access (HSDPA), high-speed uplink packet access (HSUPA), wideband code division multiple access (W-CDMA), code division multiple access (CDMA), time division multiple access (TDMA), Bluetooth, Wireless Fidelity (Wi-Fi) (e.g., IEEE 802.11a, IEEE 802.11b, IEEE 802.11g and/or IEEE 802.11n), voice over Internet Protocol (VoIP), Wi-MAX, a protocol for e-mail (e.g., Internet message access protocol (IMAP) and/or post office protocol (POP)), instant messaging (e.g., extensible messaging and presence protocol (XMPP), Session Initiation Protocol for Instant Messaging and Presence Leveraging Extensions (SIMPLE), Instant Messaging and Presence Service (IMPS)), and/or Short Message Service (SMS), or any other suitable communication protocol.
Audio circuitry 110, speaker 111, and microphone 113 provide an audio interface between a user and device 100. Audio circuitry 110 receives audio data from peripherals interface 118, converts the audio data to an electrical signal, and transmits the electrical signal to speaker 111. Speaker 111 converts the electrical signal to human-audible sound waves. Audio circuitry 110 also receives electrical signals converted by microphone 113 from sound waves. Audio circuitry 110 converts the electrical signal to audio data and transmits the audio data to peripherals interface 118 for processing. Audio data is optionally retrieved from and/or transmitted to memory 102 and/or RF circuitry 108 by peripherals interface 118. In some embodiments, audio circuitry 110 also includes a headset jack (e.g., 212,
I/O subsystem 106 couples input/output peripherals on device 100, such as touch screen 112 and other input control devices 116, to peripherals interface 118. I/O subsystem 106 optionally includes display controller 156 and one or more input controllers 160 for other input or control devices. The one or more input controllers 160 receive/send electrical signals from/to other input or control devices 116. The other input control devices 116 optionally includes physical buttons (e.g., push buttons, rocker buttons, etc.), dials, slider switches, joysticks, click wheels, and so forth. In some alternate embodiments, input controller(s) 160 may be coupled to any (or none) of the following: a keyboard, infrared port, USB port, and a pointer device such as a mouse. The one or more buttons (e.g., 208,
Touch-sensitive display 112 provides an input interface and an output interface between the device and a user. Display controller 156 receives and/or sends electrical signals from/to touch screen 112. Touch screen 112 displays visual output to the user. The visual output may include graphics, text, icons, video, and any combination thereof (collectively termed “graphics”). In some embodiments, some or all of the visual output may correspond to user-interface objects.
Touch screen 112 has a touch-sensitive surface, sensor or set of sensors that accepts input from the user based on haptic and/or tactile contact. Touch screen 112 and display controller 156 (along with any associated modules and/or sets of instructions in memory 102) detect contact (and any movement or breaking of the contact) on touch screen 112 and converts the detected contact into interaction with user-interface objects (e.g., one or more soft keys, icons, web pages or images) that are displayed on touch screen 112. In an exemplary embodiment, a point of contact between touch screen 112 and the user corresponds to a finger of the user.
Touch screen 112 may use LCD (liquid crystal display) technology, LPD (light emitting polymer display) technology, or LED (light emitting diode) technology, although other display technologies may be used in other embodiments. Touch screen 112 and display controller 156 may detect contact and any movement or breaking thereof using any of a plurality of touch sensing technologies now known or later developed, including but not limited to capacitive, resistive, infrared, and surface acoustic wave technologies, as well as other proximity sensor arrays or other elements for determining one or more points of contact with touch screen 112. In an exemplary embodiment, projected mutual capacitance sensing technology is used, such as that found in the iPhone®, iPod Touch®, and iPad® from Apple Inc. of Cupertino, Calif.
Touch screen 112 may have a video resolution in excess of 100 dpi. In some embodiments, the touch screen has a video resolution of approximately 160 dpi. The user may make contact with touch screen 112 using any suitable object or appendage, such as a stylus, a finger, and so forth. In some embodiments, the user interface is designed to work primarily with finger-based contacts and gestures, which can be less precise than stylus-based input due to the larger area of contact of a finger on the touch screen. In some embodiments, the device translates the rough finger-based input into a precise pointer/cursor position or command for performing the actions desired by the user.
In some embodiments, in addition to the touch screen, device 100 may include a touchpad (not shown) for activating or deactivating particular functions. In some embodiments, the touchpad is a touch-sensitive area of the device that, unlike the touch screen, does not display visual output. The touchpad may be a touch-sensitive surface that is separate from touch screen 112 or an extension of the touch-sensitive surface formed by the touch screen.
Device 100 also includes power system 162 for powering the various components. Power system 162 may include a power management system, one or more power sources (e.g., battery, alternating current (AC)), a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator (e.g., a light-emitting diode (LED)) and any other components associated with the generation, management and distribution of power in portable devices.
Device 100 may also include one or more optical sensors 164.
Device 100 may also include one or more proximity sensors 166.
Device 100 may also include one or more accelerometers 168.
In some embodiments, the software components stored in memory 102 include operating system 126, communication module (or set of instructions) 128, contact/motion module (or set of instructions) 130, graphics module (or set of instructions) 132, text input module (or set of instructions) 134, Global Positioning System (GPS) module (or set of instructions) 135, speech-to-text (STT) module 136 (or set of instructions), text-to-speech (TTS) module (or set of instructions) 137, and applications (or sets of instructions) 138.
Operating system 126 (e.g., Darwin, RTXC, LINUX, UNIX, OS X, WINDOWS, or an embedded operating system such as VxWorks) includes various software components and/or drivers for controlling and managing general system tasks (e.g., memory management, storage device control, power management, etc.) and facilitates communication between various hardware and software components.
Communication module 128 facilitates communication with other devices over one or more external ports 124 and also includes various software components for handling data received by RF circuitry 108 and/or external port 124. External port 124 (e.g., Universal Serial Bus (USB), FIREWIRE, etc.) is adapted for coupling directly to other devices or indirectly over a network (e.g., the Internet, wireless LAN, etc.). In some embodiments, the external port is a multi-pin (e.g., 30-pin) connector that is the same as, or similar to and/or compatible with the 30-pin connector used on iPod (trademark of Apple Inc.) devices.
Contact/motion module 130 may detect contact with touch screen 112 (in conjunction with display controller 156) and other touch sensitive devices (e.g., a touchpad or physical click wheel). Contact/motion module 130 includes various software components for performing various operations related to detection of contact, determining if there is movement of the contact and tracking the movement across the touch-sensitive surface, and determining if the contact has ceased. Contact/motion module 130 receives contact data from the touch-sensitive surface. Determining movement of the point of contact, which is represented by a series of contact data, may include determining speed (magnitude), velocity (magnitude and direction), and/or an acceleration (a change in magnitude and/or direction) of the point of contact. These operations may be applied to single contacts (e.g., one finger contacts) or to multiple simultaneous contacts (e.g., “multi-touch”/multiple finger contacts). In some embodiments, contact/motion module 130 and display controller 156 detect contact on a touchpad. Contact/motion module 130 may detect a gesture input by a user.
Graphics module 132 includes various known software components for rendering and displaying graphics on touch screen 112 or other display, including components for changing the intensity of graphics that are displayed. As used herein, the term “graphics” includes any object other than raw text that can be displayed to a user, including without limitation stylized text, web pages, icons (such as user-interface objects including soft keys), digital images, videos, animations and the like.
In some embodiments, graphics module 132 stores data representing graphics to be used. Each graphic may be assigned a corresponding code. Graphics module 132 receives, from applications etc., one or more codes specifying graphics to be displayed along with, if necessary, coordinate data and other graphic property data, and then generates screen image data to output to display controller 156.
Text input module 134, which may be a component of graphics module 132, provides soft keyboards for entering text in various applications (e.g., contacts 139, e-mail 142, IM 143, browser 148, and any other application that needs text input).
GPS module 135 determines the location of the device and provides this information for use in various applications (e.g., to telephone 138 for use in location-based dialing, to camera 143 as picture/video metadata, and to applications that provide location-based services such as weather widgets, local yellow page widgets, and map/navigation widgets).
Speech-to-Text (STT) module 136 converts (or employs a remote service to convert) speech signals captured by the microphone 113 into text. In some embodiments, the speech-to-text module 136 processes the speech signal in light of acoustic and/or language models build on a limited corpus of text, such as text within a textbook or storybook stored on the device 100. With a limited corpus of text, the speech-to-text conversion or recognition can be performed with less processing power, and memory requirement at the device 100, and without employing a remote service. The speech-to-text (STT) module 136 is optionally used by any of the applications 138 supporting speech-based inputs. In particular, the group reading applications 149 and various components thereof uses the STT module to process the user's speech signals, and trigger various functions and outputs based on the result of the STT processing.
Text-to-Speech module 137 converts (or employs a remote service to convert) text (e.g., text of an electronic story book, text extracted from a webpage, text of a textural document, text associated with a user interface element, text associated with a system notification event, etc.) into speech signals. In some embodiments, the text-to-speech module 137 provides the speech signal to the audio circuitry 110, and the speech signal is output through the speaker 111 to the user. In some embodiments, the text-to-speech module 137 is used to generate a sample reading, or support a virtual reader that participate in the group reading along with other human participants.
Applications 138 may include the following modules (or sets of instructions), or a subset or superset thereof: contacts module 139; telephone module 140; video conferencing module 141; e-mail client module 142; instant messaging (IM) module 143; camera module 144 for still and/or video images; image management module 145; video and music player module 146; notes module 147; and browser module 148.
In some embodiments, applications 138 stored in memory 102 also include one or more group reading applications 149. The group reading applications 149 include various modules to facilitate various functions useful in a group reading session. In some embodiments, the group reading applications 149 include one or more of: a group reading organizer module 150, a group reading participant module 151, a reading plan generator module 152, an assignment receiver module 153, an assignment checker module 154, a text displayer module 155, a illustration displayer module 156, a reader switching module 157, a reading material selection module 158, and a reading material storing module 159. Not all of the modules 150-159 need to be included in a particular embodiment. Some functions of one or more modules 150-159 may be combined into the same module or divided among several modules. More details of the various group reading applications 148 are described with respect to
Each of the above identified modules and applications correspond to a set of executable instructions for performing one or more functions described above and the methods described in this application (e.g., the computer-implemented methods and other information processing methods described herein). These modules (i.e., sets of instructions) need not be implemented as separate software programs, procedures or modules, and thus various subsets of these modules may be combined or otherwise re-arranged in various embodiments. In some embodiments, memory 102 may store a subset of the modules and data structures identified above. Furthermore, memory 102 may store additional modules and data structures not described above.
Device 100 may also include one or more physical buttons, such as “home” or menu button 204. As described previously, menu button 204 may be used to navigate to any application 138 in a set of applications that may be executed on device 100. Alternatively, in some embodiments, the menu button is implemented as a soft key in a GUI displayed on touch screen 112.
In one embodiment, device 100 includes touch screen 112, menu button 204, push button 206 for powering the device on/off and locking the device, volume adjustment button(s) 208, Subscriber Identity Module (SIM) card slot 210, head set jack 212, and docking/charging external port 124. Push button 206 may be used to turn the power on/off on the device by depressing the button and holding the button in the depressed state for a predefined time interval; to lock the device by depressing the button and releasing the button before the predefined time interval has elapsed; and/or to unlock the device or initiate an unlock process. In an alternative embodiment, device 100 also may accept verbal input for activation or deactivation of some functions through microphone 113.
Each of the above identified elements in
Attention is now directed towards embodiments of user interfaces (“UI”) and associated processes that may be implemented on an electronic device, such as device 300 or portable multifunction device 100.
In the process 400, a group reading plan is generated for a group reading session before the start of the group reading session. For example, an instructor optionally invokes the process 400 before a class, and generates a text reading plan for use during the class. In another example, a parent optionally invokes the process 400 before a story session with his/her children, and generates a story reading plan for the story session with his/her children. In another example, a director of a school play optionally generates a script reading plan for later use during a rehearsal. In another example, a book club organizer optionally invokes the process 400 before a book club meeting to generate a book reading plan for use during the club meeting. The process 400 may also be used for other group reading settings, such as bible studies, study groups, and foreign language training, etc.
Referring to
In some embodiments, the primary user device provides a reading plan generator interface (e.g., UI 502 shown in
In some embodiments, the reading plan generator interface 502 is provided over a network, and through a web interface. In some embodiments, the web interface provides a log-in process, and the text selection input is automatically populated for the user based on the login information entered by the user. For example, if a reading material has been assigned to a particular reading group associated with the user, the text selection input area provided by the UI element 504 is automatically populated for the user, when the user provides the proper login information to access the reading plan generator interface 504.
In some embodiments, the text to be read during a particular reading session is predetermined based on the current date. For example, in some embodiments, a front page news article of the current day is automatically selected as the text for reading in a group reading session that is to occur on the current day or the next day.
Referring back to
In this particular example, as shown in
Referring back to
In some embodiments, the reading plan divides the selected text and assigns the resulting reading units in accordance with a comparison between a respective difficulty level of the reading unit(s) and a respective reading ability level(s) of the participant(s). For example, for a group of children with lower reading ability levels, the reading plan optionally divides the text into a number of reading units such that each child gets assigned several shorter and easier segments of text to read during the group reading session. In contrast, for a group of older students, the lesson plan divides the text into a different number of reading units such that each student receives one or two long passages of text to read during the group reading session. In some embodiments, the number of reading units generated by the device depends on the number of participants identified for the group reading session. For example, the number of reading units is optionally multiples of the number of participants.
In some embodiments, the reading ability level is measured by a combination of several different scores each measuring a respective aspect of a user's reading ability, such as vocabulary, pronunciation, comprehension, emotion, speed, fluency, prosody, etc. In some embodiments, the difficulty level of the text and/or the difficulty of the reading units are also measured by a combination of several different scores each measuring a respective aspect of the reading unit's reading accessibility, such as length, vocabulary, structural complexity, grammar complexity, emotion, pronunciation, etc. In some embodiments, the reading ability level of the user and the reading difficult level of the reading unit are measured by a matching set of measures (e.g., vocabulary, grammar, and complexity).
In some embodiments, automatically generating the reading plan further includes the following operations (408-414, and 416-424).
In some embodiments, the primary user device determines (408) one or more respective reading assessment scores for each of the plurality of participants. For example, the reading assessment scores are optionally the grades for each participant for a class. In another example, the reading assessment scores are optionally generated based on an age, class year, or education level of each participant. In another example, the reading assessment scores are optionally generated based on evaluation of past performances in prior group reading sessions. In some embodiments, the reading assessment scores for each participant are provided to the user device in the form of a file.
In some embodiments, the primary user device divides (410) the text into a plurality of contiguous portions according to the respective reading assessment scores of the plurality of participants. For example, if a majority of participants have low reading assessment scores, the primary user device optionally divides the text into portions that are relatively easy for the majority of participants, and leaves only one or more difficult portions for the few participants that have relatively high reading assessment scores.
In some embodiments, the primary user device analyzes (412) each of the plurality of portions to determine one or more respective readability scores for the portion. In some embodiments, the primary user device assigns (414) each of the plurality of portions to a respective one of the plurality of participants according to the respective readability scores for the portion and the respective reading assessment scores of the participant.
In some embodiments, the primary user device provides several reading assignment modes for selection by the user for each participant. In some embodiments, the primary user device provides (416) at least two of a challenge mode, a reinforcement mode, and an encouragement mode for selection by the user for each participant. For example, as shown in
Referring now to
In some embodiments, a single mode selection is optionally applied to all or multiple participants in the group reading session. In some embodiments, the assignment of reading units in the challenge mode aims to be somewhat challenging to a participant in at least one aspect measured by the primary user device, while the assignment of reading units in the encouragement mode aims to be somewhat easy or accessible to a participant in all aspects measured by the primary user device. In some embodiments, the assignment of reading units in the reinforcement mode aims to provide reinforcement in at least one aspect measured by the primary user device which the participant has shown recent improvement. In some embodiments, more or fewer assignment modes are provided by the primary user device. In some embodiments, a respective assignment mode needs not be specified for all participants of the group reading session.
In some embodiments, in accordance with a user selection of the challenge mode for the respective one of the plurality of participants, the primary user device selects (420) a reading unit that has a respective difficulty level higher than the respective reading ability level of the respective participant. In some embodiments, in accordance with a user selection of the reinforcement mode for the respective one of the plurality of participants, the primary user device selects (422) a reading unit that has a respective difficulty level comparable or equal to the respective reading ability level of the respective participant. In some embodiments, in accordance with a user selection of the encouragement mode for the respective one of the plurality of participants, the primary user device selects a reading unit that has a respective difficulty level lower than the respective reading ability level of the respective participant.
In some embodiments, additional modes are provided for selection by the user to influence the division of the selected text into appropriate reading units, and the assignments of the reading units to the plurality of participants. For example, as shown in
In some embodiments, the user is allowed to select more than one division mode for a particular group reading session, and the primary user device divides the text in accordance with all of the selected division modes. In some embodiments, a priority order is used to break the tie if a conflict arises due to the concurrent selection of multiple division modes.
As shown in
In some embodiments, the reading plan review interface 514 includes the participant information of the group reading session. In some embodiments, the reading plan review interface 514 optionally presents the reading assessment scores for each participant. In some embodiments, the reading plan review interface 514 optionally includes the division and/or assignment modes used to divide and assign the reading units for the group reading session (not shown).
As shown in
In some embodiments, the user is optionally allowed to move the beginning and/or end points of each reading unit, and/or to change the assignment of the reading unit manually. As shown in
In some embodiments, the group reading plan is stored as an index file specifying the respective beginning and end points of the reading units, and the assigned participant for each reading unit. In some embodiments, the primary user device generates the reading plan review interface 514 based on the index file, and revises the index file based on input received in the reading plan review interface 514. In some embodiments, the reading plan review interface 514 optionally includes a user interface element for sending the reading assignments to the participants before the group reading session. In some embodiments, to ensure that each participant prepares for reading the entire text, the assignment is not made known to the participant until the beginning of the group reading session.
Referring back to
Referring to
In some embodiments, during the group reading session, the primary user device performs (434) the following operations to facilitate the reading transition from participant to participant during the reading.
In some embodiments, for a pair of consecutive reading units (e.g., for each pair of consecutive reading units) in the plurality of reading units, the primary user device identifies (436) a first client device corresponding to a first participant assigned to read the first reading unit of the pair of consecutive reading units, and a second client device corresponding to a second participant assigned to read a second reading unit of the pair of consecutive reading units. For example, according to the reading plan shown in
In some embodiments, the primary user device sends (438) a first start signal to the first client device, the first start signal causing a first reading prompt to be displayed at a respective start location of the first reading unit currently displayed at the first client device. As shown in
In some embodiments, at the start of the group reading session, the entirety of the text to be read in the group reading session has been displayed on each participant's respective device, so that all participants can see the text on their respective devices. When the first reading prompt is displayed on the first client device 604 and not on the client devices 606 and 608 operated by the other participants (e.g., Max and John), Alice knows that it is her turn to read the highlighted reading unit aloud, while the other participants listens to her reading.
In some embodiments, the primary user device 602 monitors (440) progress of the reading based on a speech signal received from the first participant. For example, in some embodiments, the speech signal from the first participant (e.g., Alice) can be captured by a microphone of the first client device 604, and forwarded to the primary user device 602, where the primary user device 602 processes the speech signal (e.g., using speech-to-text) to determine the progress of the reading through the first reading unit. In such embodiments, the client devices (e.g., client device 604) are not required to perform the speech-to-text processing onboard, which can require a substantial amount of memory and processing resources. In some embodiments, the primary client device 602 captures the speech signal directly from the first participant (e.g., Alice) when the participant is located sufficiently close to the primary user device 602 (e.g., in the same room).
In some embodiments, the first client device 604 captures the speech signal from the first participant, processes the speech signal against the first reading unit to determine the progress of the reading, and sends the result of the monitoring to the primary user device 602. In such embodiments, the individual client device only needs to consider the text within the reading unit when processing the speech signal. Therefore, the processing and resource requirement on the individual client device is relatively small.
In some embodiments, as the primary user device monitors the progress of the reading by the first participant, the primary user device optionally sends signals to the other client devices regarding the reading by the first participant. For example, the primary reading device 602 optionally sends additional signals regarding the pronunciation, speed, emotion detected in the speech signal reading the first reading unit to the first client device 604, and/or other client devices (e.g., devices 606 and 608) in the group reading session. In response to these additional signals, the receiving client devices optionally display pop-up notes, highlighting, hints, dictionary definitions, and other visual information (e.g., a bouncing ball) related to the text and the first participant's reading of the first reading unit.
Referring back to
For example, as shown in
As shown in
In some embodiments, the primary user device 602 treats the second client device 606 (e.g., Max's device) as the first reading unit of the next pair of consecutive reading units 520 and 522 in the selected text, and monitors the progress of the reading by Max. When Max has completed the reading of the second reading unit 520 (i.e., the earlier one of the pair of reading units 520 and 522), the primary user device 602 sends a second stop signal to Max's device to cause the removal of the second reading prompt from Max's device 606. The primary user device 602 further sends a third start signal to the third client device 608 operated by the next participant John who has been assigned the latter reading unit 522 of the next pair of consecutive reading units 620 and 522. In response to receiving the third start signal, the third client device 608 displays the reading prompt at the start of the reading unit 522 current displayed at client device 608. This process can continue as the participants read through the reading units in the text one by one, and the reading prompt hops from one client device to the next according to the assignment specified in the group reading plan (e.g., reading plan 516). In some embodiments, one participant may be assigned multiple non-consecutive reading units, and the reading prompt will return to the device of the participant when it is that participant's turn to read one of his/her assigned reading units.
Referring back to
For example, in
Referring back to
Referring to
In some embodiments, the primary user device sends the same first error signal to each of the client devices of the participants who are not currently reading, and causes the first visual indication of the reading error to be displayed on these client devices. For example, if the primary user device detects that the first participant (e.g., Alice) has mispronounced or misread a particular word in the first reading unit 518, the primary user device sends an error signal to the second user device (e.g., the device operated by Max) to alert the listening participant (e.g., Max) that the particular word has been mispronounced or misread. Optionally, the mispronounced/misread word is highlighted on the devices of the listening participants, and the correct pronunciation is visually indicated on those devices.
In some embodiments, during the group reading session, the primary user device, upon detecting the at least one reading error in the speech signal of the first participant (e.g., Alice), sends (460) a second error signal to the first client device (e.g., device operated by Alice), where the second error signal causes a second visual indication of the reading error to be displayed at the location of the reading error in the first reading unit (e.g., reading unit 518) currently shown at the first client device. For example, if the primary user device detects that the first participant (e.g., Alice) has mispronounced or misread a particular word in the first reading unit (e.g., reading unit 518), the primary user device sends an error signal to the first user device to alert the current reader (e.g., Alice) that the particular word has been mispronounced or misread. Optionally, the mispronounced/misread word is highlighted on the device of the current reader, and the correct pronunciation is visually indicated on the device, such that the reader is aware of the error, and may re-read the incorrect portion of the first reading unit.
In some embodiments, the primary user device provides (462) one or more hints to the first client device to help the first participant to correctly read through a respective portion of the first reading unit (e.g., a portion in which a reading error has been made and/or a portion for which reading speed has slowed down). In some embodiments, more or fewer hints are provided depending on whether the first participant is in the challenge mode, the reinforcement mode, or the encouragement mode. For example, fewer hints are provided to participants reading in the challenge mode, while more hints are provided to participants reading the encouragement mode. In some embodiments, the primary user device dynamically adjust the number and/or type of hints provided based on an evaluation of the reading by the current reader (e.g., Alice).
In some embodiments, the primary user device provides two different kinds of error signals (e.g., the first error signal and the second error signal) to the listening participant(s)′ device(s) and the current reader's device, respectively. In some embodiments, the first error signal causes (464) immediate display of the first visual indication of the reading error at the second client device (i.e., the listening participant's device), while the second error signal causes delayed display of the second visual indication of the reading error at the first client device (i.e., the current reader's device) until after the reading of the first reading unit is completed by the first participant.
In some embodiments, in addition or in lieu of a group reading session in which the participants reading aloud the text units displayed in front them, the same process 400 or a similar process is optionally used to facilitate a group reading session in which the participants recite the text units assigned to them without seeing the text units displayed in front of them during their respective recitations. This is particularly useful for learning and reciting lines for a play or other theatrical performances. For example, in some embodiments, while text are displayed on devices of all participants at the beginning of the reading session, as soon as a first start signal is send to the first client device, text of the first reading unit is obfuscated (e.g., the first reading prompt optionally blocks the text of the first reading unit) on the first client device. While the first participant recites out loud from memory the text of the first reading unit, the text of the first reading unit continues to be displayed on the devices of the listening participants. When the recitation of the first reading unit is completed by the first participant, the primary user device sends a stop signal to the first client device, and the first client device removes the reading prompt from the first client device. Upon removal of the first reading prompt, the text of the first reading unit is revealed again on the first client device. At the same time, the primary user device sends a second start signal to the second client device, and the second start signal causes a second reading prompt to be displayed on the second client device and causes the text of the second reading unit to be obfuscated on the second client device. The second participant can start recite the second reading unit out loud, while the other participants listen with the text of the second reading unit displayed on their respective devices.
In some embodiments, recitation errors are detected by the primary user device, and error signals are send to the listening participants' devices and/or the device of the participant that is currently reciting his/her assigned reading unit. In some embodiments, primary user devices sends an error signal to the device of the participant that is currently performing the recitation, and the device, upon receiving the error signal, displays the recitation error to that participant. For example, in some embodiments, only the words that were recited incorrectly are shown on the device of that participant.
In some embodiments, the primary user device displays the reading plan review interface 514 shown in
Referring to
In some embodiments, e.g., as shown previously in
In the exemplary process 700, a first client device associated with a first user (e.g., the client device 604 associated with Alice shown in
In some embodiments, upon successful registration, the first client device receives (704) at least a partial reading plan from the server. In some embodiments, the reading plan divides the text to be read in the reading session into a plurality of reading units and assigns at least a first reading unit (e.g., reading unit 518 in
In some embodiments, upon receiving a first start signal for the reading of the first reading unit, the first client device displays (706) a first reading prompt at a respective start location of the first reading unit currently displayed at the first client device. In some embodiments, if the first client device is operated by a participant who is assigned the very first reading unit of the text, the first client device optionally receives the start signal from the server (e.g., the primary user device). In some embodiments, if the first client device is operated by a participant who is assigned a reading unit after the very first reading unit of the text, the first client device optionally receives the start signal from the respective device of the participant who has been assigned the immediately preceding reading unit. For example, as shown in
In some embodiments, the first client device monitors (708) the progress of the reading of the first reading unit based on a speech signal received from the first user. In some embodiments, the first client device captures the speech signal directly from the first user, e.g., using a microphone coupled to the first client device. In some embodiments, the first client device converts the captured speech signal to text using an local STT function, and compares the converted text to the text of the first reading unit to determine the progress of the reading. In some embodiments, the first client device sends the speech signal to the server, and receives updates from the server regarding the progress of the reading. In some embodiments, other methods of monitoring the progress of the reading of the first reading unit are possible.
In some embodiments, in response to detecting that the reading of the first reading unit has been completed, the first client device performs (710) the following operations (712-714).
In some embodiments, in response to detecting that the reading of the first reading unit has been completed, the first client device ceases (712) to display the first reading prompt at the first client device. In some embodiments, in response to detecting that the reading of the first reading unit has been completed, the first client device further sends (714) a second start signal to a second client device associated with the second user (i.e., the user that is assigned to read the latter reading unit in the pair of consecutive reading units). The second start signal causes a second reading prompt to be displayed at a respective start location of the second reading unit currently displayed at the second client device. For example, as shown in
Referring back to
In some embodiments, the first client device detects (718), based on the speech signal received from the first user, that the reading of the first reading unit is approaching completion. In response to detecting that the reading of the first reading unit is approaching completion, the first client device sends (720) a get-ready signal to the second client device, where the get-ready signal causes a get-ready prompt to be displayed at the respective start location of the second reading unit currently displayed at the second client device. For example, in
In some embodiments, the first reading prompt moves (722) through the first reading unit currently shown at the first client device in accordance with the progress of the reading by the first user.
In some embodiments, during the group reading session, the first client device processes (724) a speech signal received from the first user and evaluates the reading of the first user based on the speech signal. In some embodiments, the first client device detects (726) at least one reading error in the speech signal of the first participant in light of the first reading unit. In some embodiments, upon detecting the at least one reading error, the first client device displays (728) a first reading aid at a location of the reading error in the first reading unit currently shown at the first client device. For example, when the first client device detects a pronunciation error, a missed word, an added word, a misread word, an incorrect segmentation of a phrase or sentence, inappropriate reading speed, and/or incorrect emotion or prosody, etc., in the speech signal received from the first participant in light of the text in the first reading unit, the first client device displays a reading aid to help the first participant to correct the reading error. The reading aid include one or more of a phonetic spelling of the mispronunciation, highlighting of a missed word or mispronounced word, visual aids to indicate the correct emotion, prosody, segmentation, and/or speed of the reading through a phrase or passage, and so on.
In some embodiments, during the group reading session, the first client device, upon detecting the at least one reading error, sends (730) an error signal to the second client device (and one or more other client devices and/or the server). The error signal causes a visual indication of the reading error to be displayed at a location of the reading error in the first reading unit currently shown at the second client device (and the one or more other client devices and/or the server). In some embodiments, the number and types of error signals generated during each participant's reading are optionally used to evaluate the participant's reading ability level in one or more aspects, and to generate various reading ability scores for the participant.
In some embodiments, the group reading session is conducted in a read-aloud mode, in which each participant reads aloud the text of his/her assigned reading unit presented in front of him/her. In some embodiments, the group reading session is conducted in a recitation mode, in which each participant recites out loud the text of his/her assigned reading unit while the text is obfuscated in front of him/her. In some embodiments, some participants reads their respective assigned reading units in the read-aloud mode, while other participants reads their respective assigned reading units in the recitation mode. For example, in a group rehearsal, some actors/actresses may want to read their lines aloud with their scripts open in front of them, while other actors/actresses may wish to practice reciting their lines aloud without their scripts open in front of them.
Referring to
In some embodiments, each participant is allowed to turn on the read-aloud mode or the recitation mode by providing a reading mode selection input his/her respective device. In some embodiments, the server optionally provides the respective reading mode selection input to each client device, and the user of the server device controls which participants will read in the read-aloud mode and which participants will read in the recitation mode.
In some embodiments, each reading unit is assigned a respective reading mode. If a particular reading unit is assigned a read-aloud mode, the reading prompt presented for the particular reading unit highlights the particular reading unit currently displayed at the respective client device of its assigned reader. If a particular reading unit is assigned a recitation mode, the reading prompt presented for the particular reading unit visually obfuscates the particular reading unit currently displayed at the respective client device of the assigned reader. In some embodiments, the mode of the reading is defined in the reading plan, e.g., by the creator of the reading plan using the reading plan generator interface.
In some embodiments, the first client device determines (736) a respective reading mode assigned to the reading of the first reading unit, the reading mode being one of a read-aloud mode and a recitation mode. In some embodiments, in accordance with a determination that the first reading unit is assigned the read-aloud mode, the first client device provides (738) the first reading prompt to highlight the first reading unit currently displayed at the first client device. In accordance with a determination that the first reading unit is assigned the recitation mode, the first client device provides (740) the first reading prompt to visually obfuscate the first reading unit currently displayed at the first client device.
In some embodiments, after the reading of the first reading unit has been completed by the first user, the first client device receives (742) a reading assessment summary for the reading of the first reading unit, the reading assessment summary identifying one or more areas needing improvement for the first user. In some embodiments, the first reading unit receives (744) a customized reading assignment for the first user according to the identified one or more areas needing improvement.
In some embodiments, the reading assessment summary also identifies one or more areas in which the first user has performed well, and is worthy of encouragement or commend. In some embodiments, the reading assessment summary is provided by the server device. In some embodiments, for each participant, the server device optionally receives comments from respective devices of other participants during the group reading session, and the server device optionally incorporates these comments into the reading assessment summary of the participant.
In some embodiments, after the group reading session, each client device is optionally used to monitor reading of the customized assignments by its respective user. In some embodiments, after the group reading session, the first client device receives (746) additional speech signals from the first user reading the customized reading assignment. In some embodiments, the first client device processes (e.g., using speech-to-text conversion and/or other means) (748) the additional speech signals to determine if the reading of the customized reading assignment is satisfactory. In some embodiments, the first client devices sends (750) a report to the server regarding the reading of the customized reading assignment by the first user.
In the process 800, the user device receives (802) a first reading assignment comprising text to be read or recited aloud by a user. In some embodiments, the first reading assignment is a reading assignment received from a server device (e.g., an instructor's device) after a group reading session. In some embodiments, the first reading assignment is received from a server device without the user having participated in a group reading session. In some embodiments, the first reading assignment is selected by the user on the user device, e.g., according to his/her own interest or at the instruction of his/her instructor.
In some embodiments, if the first reading assignment is to be read aloud by the user, the user device displays the text of the first reading assignment to the user. In some embodiments, if the first reading assignment is to be recited by the user, the user device displays the text of the first reading assignment during a preparation period, and obfuscates the text or at least portions of the text after the preparation period has ended. In some embodiments, the user device selectively displays some text, and obfuscates other text in accordance with input received from the user.
In some embodiments, the user device receives (804) a first speech signal from the user reading or reciting the text of the first reading assignment. For example, in some embodiments, the user device captures the speech uttered directly by the user using a microphone coupled to the user device. In some embodiments, the user device (e.g., an instructor's device or a server device) receives the first speech signal from another device (e.g., the user device operated by the user) that directly captures the speech uttered by the user. In some embodiments, the speech signal is a recording of the speech uttered by the user, and is sent to the user device at a later time. In some embodiments, the speech signal is received by the user device in real-time as the user is speaking.
In some embodiments, if the first reading assignment is for the user to read aloud, the user device highlights each respective portion or word in the text at the moment that the user reads that portion or word in the text. In some embodiments, the visual indication (e.g., underline, a bouncing ball icon) moves synchronously through the displayed text, as the user reads through the text aloud. In some embodiments, if the reading stops at a particular location in the text for more than a predetermined amount of time (e.g., 2 seconds), the user device automatically enters a bookmark at that location in the text. In some embodiments, the user device optionally receives and stores textual input or other annotative inputs (e.g., drag and drop of photos, documents, notes, web pages, hyperlinks, etc.) in association with the bookmark inserted at that particular location.
In some embodiments, the user device processes the speech signal against the text in the first reading assignment, and provides a first type of visual enhancement (e.g., highlighting, bolding, or changing text or background color) for correctly pronounced words. In some embodiments, the user device processes the speech signal against the text in the first reading assignment, and provides a second type of visual enhancement (e.g., highlighting, bolding, or changing text or background color) for incorrectly pronounced words. In some embodiments, the user device detects one or more missed words in the speech signal, and provides a third type of visual enhancement for the missed words. In some embodiments, the user device detects one or more added words in the speech signal, and provides a fourth type of visual enhancement for the portion of text in which the extraneous words have been added.
In some embodiments, the user device detects extraneous fillers (e.g., empty, extraneous sounds or words that pad a sentence without adding any additional meaning, such as “I mean,” “sort of,” “ya know?” “well,” “umm,” “uh,” “like,” and equivalents in other languages) in the speech signal, and displays a visual alert for the user each time the filler is detected in real-time as the user is speaking. In some embodiments, the user device monitors the speed by which the text is read aloud, and displays a visual indicator for the user to slow down or speed up based on the actual speed by which the text is being read aloud by the user.
In some embodiments, the user device monitors the progress of the reading, and provides visual prompts to the user to change the intonation and/or emotion of the reading in real-time. For example, if the reading assignment is a script for a play, the reading assignment optionally associates respective predetermined emotions, accents, voice quality, and/or intonations with different portions of the text. In some embodiments, the user device optionally provides prompts (e.g., visual indicators or pop-up notes) for the desired emotions, accents, voice quality, and/or intonations associated with a particular portion of text, e.g., at a location proximate to the particular portion of text, and/or as the reading has almost reached that portion of the text.
In some embodiments, the user device evaluates (806) the first speech signal against the text of the first reading assignment to identify one or more areas for improvement. For example, the first reading assignment is optionally associated with various standards for pronunciation, accents, speed, intonation, emotion, voice quality, fidelity to the text, loudness, and/or pitch, etc., for various portions or the entirety of the text in the first reading assignment. In some embodiments, the user device evaluates the first speech signal against the text for one or more of these various standards.
In some embodiments, if the user's speech signal meets or exceeds the standards established for a number of required aspects, the user device displays a visual indication of successful completion of the reading assignment by the user. In some embodiments, if the user's speech signal does not meet the standard(s) established for one or more required aspects, the user device identifies these aspects as respective areas for improvement. For example, if the user has mispronounced a particular word, or a particular category of words (e.g., words containing the letters “th,” or words containing a silent “e” or “p,” or words containing accented letters, etc.), the user device identifies pronunciation of the particular word or particular category of words as an area for improvement. In another example, if the user has read one or more portions of the first reading assignment faster or slower than the standard established for all or some portions of the text, the user device identifies the reading speed or familiarity with the text as an area for improvement. In some embodiments, if the user has read one or more portions of the first reading assignment with an emotion or voice quality different from the standard established for those portions of the text, the user device identifies the emotion or voice quality as an area for improvement for those portions. In some embodiments, if the user has spoken more filler words or had inappropriate pauses during the reading or recitation of the first reading assignment than the standard established for fillers and pauses, the user device identifies the use of fillers and pauses as an area for improvement. Other examples of the areas for improvements are possible.
In some embodiments, based on the evaluating, the user device generates (808) a second reading assignment providing additional practice opportunities tailored to the identified one or more areas for improvement. For example, if the identified area for improvement is the pronunciation of a particular word or category of words, the user device optionally generates a second reading assignment containing drills and reading exercises containing that particular word or category of words, but differing from the text in the first reading assignment. For example, if the user has trouble pronouncing the word “these” and “those” in the first reading assignment, the user device generates more textual drills containing the words “these” and “those” in different sentences. In another example, the user device generates more textual drills containing other words containing the letters “th.”
In some embodiments, if the identified area for improvement is the speed by which the user has read one or more portions of the first reading assignment, the user device optionally generates a second reading assignment that is longer or shorter than the first assignment. For example, if the user is practicing a timed public speech based on the first reading assignment and is speaking too fast, the user device optionally generates a second reading assignment that removes some non-essential content of first reading assignment. When the user reads the second reading assignment under timed conditions, the user would feel the pressure to slow down for fear of the awkward silence at the end. Once the user has gained a feel of the slower reading speed, the user can practice reading the first reading assignment again at the newly achieved slower speed. In contrast, if the user is practicing a timed public speech based on the first reading assignment and is speaking too slowly; the user device optionally generates a second reading assignment that expands the content of the first reading assignment. When the user reads the second reading assignment under timed conditions, the user would feel the pressure to speed up for fear of not finishing on time. Once the user has gained a feel of the faster reading speed, the user can practice reading the first reading assignment again at the newly achieved faster speed. In some embodiments, the user device evaluates the user's reading speed of the second reading assignment as the reader practice reading the second reading assignment one or more times, and determines when it is appropriate to have the user read the first assignment again.
In some embodiments, if the user device identifies the emotion conveyed in the user's voice as an area for improvement for particular portions of the first reading assignment, the user device generates a second reading assignment containing one or more additional passages having similar emotional content or requiring similar voice quality as the portion of the first reading assignment for which the emotion was inappropriate or lacking.
In some embodiments, if the user device identifies the accent of the user's reading is an area for improvement, the user device generates a second reading assignment containing one or more additional passages having words reflective of the required accent and/or passages conveying a stereotypical impression of the required accent. For example, if the first assignment is to be read with an Italian accent and a tough edge, and the second reading assignment is optionally the transcript of a dialogue from a famous movie (e.g., the Godfather) depicting tough Italian mafia characters.
In some embodiments, if the user device has identified the excessive use of filler words and pauses in the reading of the first reading assignment as an area for improvement, the user device optionally identifies a pattern in the occurrence of the fillers and pauses in the reading or recitation, and generates a second reading assignment that provides visual aid to help the user to read through the text without conforming to that pattern. For example, at locations that the user is likely to insert a filler word, the user device optionally insert a visual aid (e.g., visually reducing the spacing between two consecutive words in the text) encouraging the user to speak continuously without using a filler word or pause. In some embodiments, the user device optionally displays the filler word in the text of the second assignment at the location that the user is likely to insert the filler word, such that the user can consciously replace the filler word in his/her reading of the text with a short pause instead. In some embodiments, the user device determines that the presence of filler words or pauses indicates unfamiliarity to the text of the first reading assignment, and generates a second reading assignment that provides additional notes regarding the portions of text at which the filler words and pauses were spoken by the user.
In some embodiments, the user device provides (810) two or more practice modes for the second reading assignment, including at least two of a challenge mode, an encouragement mode, and a reinforcement mode. In some embodiments, the user device selects (812) reading materials of different levels of difficulty as the second reading assignment based on a respective practice mode selected for the second reading assignment.
In some embodiments, in accordance with a selection of the challenge mode for the second reading assignment, the user device selects reading materials that are more difficult than the first reading assignment in the identified one or more areas for improvement. In some embodiments, in accordance with a selection of the encouragement mode for the second reading assignment, the user device selects reading materials that are easier than the first reading assignment in the identified one or more areas for improvement. In some embodiments, in accordance with a selection of the reinforcement mode for the second reading assignment, the user device selects reading materials that are of similar difficulty as the first reading assignment in the identified one or more areas of improvement.
In some embodiments, the instructor of the user optionally pre-selects the practice mode for the second reading assignment based on the identity of the user. In some embodiments, the user device automatically chooses the practice mode based on the user's performance in reading the first assignment. In some embodiments, if the user has performed fairly well in all aspects (though not perfectly), the user device automatically uses the challenge mode for the second reading assignment. In some embodiments, if the user has performed poorly in all aspects, the user device automatically uses the encouragement mode for the second reading assignment. In some embodiments, if the user has shown mixed performance in some aspects, the user device automatically selects the reinforcement mode for the second reading assignment.
In some embodiments, the user device detects (820) detects a reading error in the first speech signal reading or reciting the text of the first reading assignment. In some embodiments, in response to detecting the reading error, the user device automatically inserts (822) a bookmark at a location of the reading error in the text of the first reading assignment. In some embodiments, the user device displays all the bookmarks inserted into the text in the same user interface (e.g., a bookmark page) after the user's reading of the first reading assignment, such that the user can selectively review one or more of the reading errors at a later time.
In some embodiments, in response to detecting subsequent user selection of the bookmark (e.g., from the text of the first reading assignment currently shown on the user device, or from a bookmark page showing multiple reading error bookmarks), the user device presents (824) one or more study aids related to the reading error. In some embodiments, the study aids include one or more flash cards and/or notes showing the definitions, pronunciations, emotions, speed, accents, and/or prosody, etc. required for reading the portion of text at which the reading error had previously occurred. In some embodiments, the study aids include one or more recordings or demos of the correct reading.
In some embodiments, in response to detecting subsequent user selection of the bookmark, the user device presents (826) one or more additional reading exercises related to the reading error. For example, if the reading error is a pronunciation error of a particular word, selection of the bookmark optionally causes the correct pronunciation to be presented (e.g., played back as an audio clip, or shown as phoneme symbols) to the user.
In some embodiments, in response to detecting subsequent user selection of the bookmark, the user device visually enhances (e.g., highlights, bolds, animates, etc.) (828) a portion of the text in the first reading assignment that is related to the reading error. For example, if the user selects the bookmark for a particular reading error in the bookmark page, the user device displays a portion of the text from the first reading assignment that contains the location of the reading error, and visually highlights the text involved in the reading error.
In some embodiments, the user device receives (830) a second speech signal from the user. The user device stores (832) a recording of the second speech signal in association with the reading error. In response to detecting subsequent user selection of the bookmark, the user device plays back (834) the recording of the second speech signal. For example, after detecting a particular reading error in the user's reading of the first reading assignment, the user device generates a bookmark for the reading error, and allows the user to record a personal note for the reading error. Sometimes, the user may wish to record a personal node that is tailored to the user's particular pronunciation habit, or understanding of the text. Sometimes, the user may simply wish to record her best attempt at producing a satisfactory reading for this portion of the text after practicing and reading the study aids. This is a useful option for the user to distill the study aid information that has shown to the user and the multiple practices the user has performed into a few key points in the user's own words, so that the user does not have to review all of the study aid information again in the future. In some embodiments, the user device presents a user interface element (e.g., a record personal node button) in the bookmark interface to start the recording of the second speech signal (e.g., a personal note).
In some embodiments, the user device sends (836) a report containing the one or more areas for improvement to a device operated by an instructor of the user. In some embodiments, the report optionally also contains the one or more reading errors made by the user.
In some embodiments, in addition to providing reading assignments to the user and evaluating the reading/recitation by the user, the user device also presents other types of assignments and questions to the user, and allows the user to provide answers in speech form. For example, after the reading or recitation of the first reading assignment, the user device optionally presents questions about the text in the first reading assignment, and checks on the user's comprehension of the text.
In some embodiments, the user device incorporates one or more multiple choice or short answer questions in an assignment, and the user device captures speech input from the user answering the multiple choice or short answer questions. Based on the speech input received from the user, the user device optionally determines whether the user has provided the correct answers to the multiple choice or short answer questions. Speech-to-text processing in these embodiments is relatively easy, since only a limited corpus of text (e.g., a corpus containing the letter choices for the multiple choice questions and/or correct answers to the short answer questions) need to be used to perform the speech recognition. In some embodiments, the user device automatically grades the user's answers, and sends the grade report to the instructor. In some embodiments, the user device also stores and send a recording of the user's answers to the instructor, e.g., for future evaluation and/or verification purposes.
In some embodiments, the user device provides additional notes, links, and annotations in the answers, and the user can review these additional notes, links, and annotations when reviewing the answers to the multiple choices and/or short answer questions. In some embodiments, selection of the links by the user causes the user device to display a portion of a text book or a portion of the first reading assignment that shows the correct answer.
Features described with respect to
Sometimes, a collaborative reading environment includes two or more participants in which a first participant has a more active role as compared to a second participant. For example, sometimes, a parent may read the text of a story to a child, while the child looks at a graphical illustration of the text that the parent is reading. Sometimes, the parent may read a more difficult portion of the story to the child, and let the child read a short and simple portion of the story back to the parent. Sometimes, two children may take turns reading aloud parts of a story, while each child is given an opportunity to change one or more aspects of the story (e.g., plot, characters, objects, location, time, etc.) while reading his/her part.
In the exemplary process 900, at a first device having one or more processors, memory, and a display, the first device displays (902) text of a first segment of a multi-segment textual document on the display of the first device. In some embodiments, the multi-segment textual document is one of a story, an article, a chapter in a textbook, a news article, the script of a play, and/or other document comprising passages of text that can be read aloud by a user. In some embodiments, the multiple segments of the textual document are based on natural divisions (e.g., sentences, chapters, sections, roles, sub-headings, etc.) that are present in the textual document. In some embodiments, the multiple segments are generated manually by a user, an editor or publisher of the textual document, or automatically by a software segmentation process.
In some embodiments, each segment of the multi-segment textual document is associated with one or more graphical illustrations. For example, in some embodiments, each scene of a story is associated with a respective graphical illustration depicting that scene. In another example, each section of an article is optionally associated with a respective diagram or figure illustrating the key content of the section. In yet another example, an article describing a process (e.g., an oil refining process) optionally includes respective text segments describing each of multiple stages of the process, and each stage is associated with a respective step shown in a flow diagram of the process.
In some embodiments, the text of the first segment of the multi-segment textual document includes one or more keywords each associated with a respective portion of a first graphical illustration for the first segment of the multi-segment textual document.
As a particular example, as shown in
As shown in
Referring back to
In some embodiments, the first device displays (908) the first graphical illustration on the first device concurrently with the display of the text of the first segment of the multi-segment textual document. In some embodiments, the first device displays each portion of the first graphical illustration upon detecting the keyword associated with the portion of the first graphical illustration in the speech signal. In other words, in some embodiments, the first device shows the complete graphical illustration for the first segment of the textual document while the text is displayed on the first device. In some embodiments, the first device gradually completes the graphical illustration for the first segment of the textual document, as the user reads through the text of the first segment.
As illustrated in the particular example shown in
In some embodiments, not shown in
As shown in
In some embodiments, the individual portions (e.g., the princess
In some embodiments, the device 1004a and the device 1004b are not located in vicinity of each other, and the device 1004a and the device 1004b communicate with each other remotely through one or more networks (e.g., the Internet). In some embodiments, when the device 1004a and the device 1004b are located in vicinity of each other, the second device 1004b optionally captures and processes the speech signal from the first user directly. In such embodiments, when the second device 1004b detects, in the speech signal from the first user, each of the one or more keywords in the text 1006 of the first segment, the second device 1004b displays the corresponding portion of the first graphical illustration 1008 on the second device 1004b without requiring the illustration signal to be sent from the first user device 1004a.
In some embodiments, after the reading of the first segment has been completed by the first user (e.g., Alice), the first device continues to display text of a second segment of the multi-segment textual document that follows the first segment. For example, in some collaborative reading sessions, the first participant (e.g., Alice) optionally reads all or multiple consecutive portions of the textual document before passing the reading control to another participant (e.g., Max). In some embodiments, the display of the second segment optionally replaces the display of the first segment on the first device, when the text of the second segment is displayed on the first device. In some embodiments, when the second segment is displayed on the first device, a second graphical illustration associated with the second segment is displayed on the first device. In some embodiments, the second graphical illustration replaces the first graphical illustration on the first device. In some embodiments, an animation is presented on the first device showing the transformation from the first graphical illustration into the second graphical illustration, when the text of the second segment is displayed on the first device.
In some embodiments, after reading of one or more segments (including the first segment) is completed by the first user 1002a, the first user 1002a optionally passes the reading control to the second user 1002b. In some embodiments, the first user 1002a decides when to pass the reading control to the second user 1002b, e.g., by providing a manual switching input to the first device 1002a. For example, a manual switching input includes a user selection of a predetermined user interface element (e.g., a “switch” button) provided on the first device 1002a. In some embodiments, the first user 1002a optionally brings the first device 1002a close to or in contact with the second device 1002b to cause a switch input to be entered at both the first device 1002a and the second device 1002b. The switch input entered at the first device 1002a causes the first device to relinquish the reading control to the second device, and the switch input entered at the second device 1002b causes the second device to accept the reading control from the first device.
In some embodiments, locations for switching reading control have been predetermined and specified in the first user device (e.g., in a predetermined reading plan). In such embodiments, when the first device processes the speech signal from the first user and determines that the reading has reached a switching location (e.g., the end of the first segment) in the textual document, the first device automatically generates the switch signal and sends the switch signal to the second device to pass the reading control to the second device.
Referring back to
In some embodiments, after the first device has sent the switching signal to the second device, the first device assumes a passive role in the collaborative reading session, and waits for illustration signals from the second device. In some embodiments, the first device receives (914) respective second illustration signals from the second device, where each of the respective second illustration signals has been sent by the second device upon the second device detecting a second speech signal reading a respective second keyword in the second segment of the multi-segment textual document. In some embodiments, upon receiving each of the respective second signals, the first device displays (916) a respective portion of a second graphical illustration for the second segment of the multi-segment textual document on the display of the first device. In some embodiments, the first device displays (918) the second segment of the multi-segment textual document on the first device when the second graphical illustration is completely displayed on the first device.
Referring now to the particular example illustrated in
As shown in
As shown in
As shown in
Referring back to
In some embodiments, the first device displays (920) at least one variable field in the text of the first segment (or any segment) of the multi-segment textual document currently displayed on the first device. In some embodiments, the first device also displays (922) two or more alternative selections for each of the at least one variable field on the first device. In some embodiments, the first device also allows freeform input from the user regarding the value of at least one of the variable fields. In some embodiments, the first device detects (924) user selection of a respective one of the two or more alternative selections in the first speech signal reading the first segment of the multi-segment textual document.
In some embodiments, the first device dynamically changes (926) the first graphical illustration of the first segment in accordance with the user selection of the respective one of the alternative selections. For example, in some embodiments, the first device stores a respective graphical illustration for the first segment in association with each alternative selection of the variable field. Before determining which portion of the first graphical illustration is displayed on the second device upon detection of a keyword, the first device generates or selects a particular graphical illustration that is associated with the selected alternative as the first graphical illustration for the first segment. In some embodiments, the first device stores a template illustration for the first segment, and upon selection of a particular alternative for the variable field, the first device dynamically generates the first graphical illustration for the first segment based on the template illustration and the selected alternative for the variable field.
As illustrated in a particular example shown in
In some embodiments, instead of choosing the plot option herself, the first user (e.g., Alice) optionally allows the second user (e.g., Max) to choose the plot options. For example, the first user optionally enters a switching input after the first user's reading has reached the plot point (e.g., after the words “the bear”) in the text 1026. In some embodiments, the switching input causes the options to be presented on the second user device 1004b. Once the second user has selected the plot option by reading one of the options presented or selecting using another type of selection input (e.g., touch or mouse input), the second user device 1004b returns the reading control back to the first user device 1004a, e.g., in response to another switching input entered by the second user.
Referring back to
In some embodiments, the two or more alternative selections for a first variable field in the text of the first segment include (930) two or more alternative plot points, each plot point is associated with a respective alternative subsequent segment (e.g., second segment) of the multi-segment textual document following the first segment. An example of these embodiments is shown in
In some embodiments, the two or more alternative selections for a first variable field in the text of the first segment include (932) two or more alternative descriptions for an object or character mentioned in the first segment of the multi-segment textual document. For example, instead of a “white-bearded bear,” the first segment may include options such as “brown bear” or “giant bear” in addition to the “white-bearded bear” option for user selection. Selection of the different options would cause the graphical illustration to change accordingly as well.
In some embodiments, the two or more alternative selections for the first variable field in the text of the first segment include (934) two or more alternative positions, colors, shapes, sizes, textures, quantities, transparencies, material states, physical properties, and/or emotional states, etc., for a respective object or character mentioned in the first segment of the multi-segment of the multi-segment textual document. Other alternative options and combinations thereof are also possible.
Features described with respect to
The foregoing description, for purpose of explanation, has been described with reference to specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated.
The present application claims priority to U.S. Provisional Application Ser. No. 61/785,361, filed Mar. 14, 2013, which is incorporated by referenced herein in its entirety.
Number | Date | Country | |
---|---|---|---|
61785361 | Mar 2013 | US |