Annotating Notes From Passive Recording With User Data

Information

  • Patent Application
  • 20160293166
  • Publication Number
    20160293166
  • Date Filed
    August 28, 2015
    9 years ago
  • Date Published
    October 06, 2016
    8 years ago
Abstract
Systems and methods, and computer-readable media bearing instructions for carrying out methods of capturing notes from passive recording of an ongoing content stream and annotating the note with the identity of one or more persons are presented. Passive recording comprises temporarily recording the most recent content of the ongoing content stream. An ongoing content stream is passively recorded in a passive recording buffer. The passive recording buffer is configured to store a limited amount of recorded content corresponding to the most recently recorded content of the ongoing content stream. Upon indication by the user, a note is generated from the recorded content in the passive recording, annotated with the identity of one or more persons, and stored in a note file for the user.
Description
BACKGROUND

As most everyone will appreciate, it is very difficult to take handwritten notes while actively participating in an on-going conversation or lecture, whether or not one is simply listening or activity conversing with others. At best, the conversation becomes choppy as the note-taker must pause in the conversation (or in listening to the conversation) to commit salient points of the conversation to notes. Quite often, the note taker misses information (which may or may not be important) while writing down notes of a previous point. Typing one's notes does not change the fact that the conversation becomes choppy or the note taker (in typing the notes) will miss a portion of the conversation.


Recording an entire conversation and subsequently replaying and capturing notes during the replay, with the ability to pause the replay while the note taker captures information to notes, is one alternative. Unfortunately, this requires that the note taker invests the time to re-listen to the entire conversation to capture relevant points to notes.


Most people don't have an audio recorder per se, but often possess a mobile device that has the capability to record audio. While new mobile devices are constantly updated with more computing capability and storage, creating an audio recording of a typical lecture would consume significant storage resources.


SUMMARY

The following Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. The Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.


Systems and methods, and computer-readable media bearing instructions for carrying out methods of capturing notes from passive recording of an ongoing content stream and annotating the note with the identity of one or more persons are presented. Passive recording comprises temporarily recording the most recent content of the ongoing content stream. An ongoing content stream is passively recorded in a passive recording buffer. The passive recording buffer is configured to store a limited amount of recorded content corresponding to the most recently recorded content of the ongoing content stream. Upon indication by the user, a note is generated from the recorded content in the passive recording, annotated with the identity of one or more persons, and stored in a note file for the user.


According to additional aspects of the disclosed subject matter, a computer-implemented method conducted on a user's computing device for generating notes from an ongoing content stream is presented. The method comprises initiating passive recording of an ongoing content stream, where the passive recording stores recorded content of the ongoing content stream in a passive recording buffer. A user indication to generate a note based on the recorded content of the passive recording of the ongoing content stream is then received. In response to receiving the user indication to generate a note, a note is generated from the recorded content of the passive recording and the note is the stored in a note file. An indication to associate the identity of one or more persons with the generated note is received from the user and the generated note is annotated with the identities in the note file.


According to further aspects of the disclosed subject matter, a computer-readable medium bearing computer-executable instructions is presented. When the computer-executable instructions are executed on a computing system comprising at least a processor, the execution carries out a method for generating notes annotated with the identify of a person from an ongoing content stream. The method comprises at least initiating passive recording of the ongoing content stream, where the passive recording is stored in a passive recording buffer. A user indication to generate a note based on the recorded content of the passive recording of the ongoing content stream is received. A user indication to associate the identity of one or more persons with the generated note is also received. A note is generated from the ongoing content stream from the recorded content of the passive recording. The generated note is then annotated with the identities in the note file and the annotated note is stored in a note file in association with an event corresponding to the ongoing content stream.


According to still further aspects of the disclosed subject matter, a computing device for generating annotated notes from an ongoing content stream is presented. The computing device comprises a processor and a memory, where the processor executes instructions stored in the memory as part of or in conjunction with additional components to generate notes from an ongoing content stream. These additional components include at least a passive recording buffer, an audio recording component, a passive recording component, an annotation component, and a note generator component. In operation, the audio recording component records content of the ongoing content stream and the passive recording component obtains the recorded content of the ongoing content stream from the audio recording component and stores the recorded content to the passive recording buffer. The annotation component is configured to annotate a generated note with the identity of a person in conjunction with the note generator component. The note generator component initiates a passive recording process via the passive recording component. Further, the note generator component receives an indication from the user, via a user interface component, to capture the recorded content of the ongoing content stream, generate a note from the captured recorded content, annotate the generated note with the identity of a person via the annotation component, and store generated note in a note file in a data store.





BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing aspects and many of the attendant advantages of the disclosed subject matter will become more readily appreciated as they are better understood by reference to the following description when taken in conjunction with the following drawings, wherein:



FIG. 1A illustrate an exemplary audio stream (i.e., ongoing audio conditions) with regard to a time line, and further illustrates various the ongoing passive recording of the audio stream into an exemplary passive recording buffer;



FIG. 1B illustrates component with regard to an alternative implementation (to that of FIG. 1A) in conducting the ongoing passive recording of an audio stream into a passive recording buffer;



FIG. 2 is a flow diagram illustrating an exemplary routine for generating notes of the most recently portion of the ongoing content stream;



FIG. 3 is a flow diagram illustrating an exemplary routine for generating notes of the most recently portion of the ongoing content stream, and for continued capture until indicated by a user;



FIG. 4 is a block diagram illustrating exemplary components of a suitably configured computing device for implementing aspects of the disclosed subject matter;



FIG. 5 is a pictorial diagram illustrating an exemplary network environment suitable for implementing aspects of the disclosed subject matter;



FIG. 6 illustrates a typical home screen as presented by an app (or application) executing on a suitably configured computing device;



FIG. 7 illustrates the exemplary computing device of FIG. 6 after the user has interacted with the “add meeting” control;



FIG. 8 illustrates the exemplary computing device of FIG. 6 after the user has switched to the presentation of user-actionable controls, each control associated with the identity of a different person associated with the meeting, and each control configure to cause the generation of a note from the content of the passive recording buffer and to associate the identity of the corresponding user-actionable control with the note as an annotation;



FIG. 9 illustrates the exemplary computing device showing the notes associated with “Meeting 4”;



FIG. 10 illustrates an exemplary routine for generating notes of the most recently portion of the ongoing content stream, for continued capture until indicated by a user, and for annotating the captured note with the identity of one or more persons; and



FIG. 11 illustrates an exemplary routine for identifying and populating a people list corresponding to a current meeting.





DETAILED DESCRIPTION

For purposes of clarity, the term “exemplary,” as used in this document, should be interpreted as serving as an illustration or example of something, and it should not be interpreted as an ideal and/or a leading illustration of that thing.


For purposes of clarity and definition, the term “content stream” or “ongoing content stream” should be interpreted as being an ongoing occasion in which audio and/or audio visual content can be sensed and recorded. Examples of an ongoing content stream include, by way of illustration and not limitation: a conversation; a lecture; a monologue; a presentation of a recorded occasion; and the like. In addition to detecting a content stream via audio and/or audio/visual sensors or components, according to various embodiments the ongoing content stream may correspond to a digitized content stream which is being receives, as a digital stream, by the user's computing device.


The term “passive recording” refers to an ongoing recording of a content stream. Typically, the content stream corresponds to ongoing, current audio or audio/visual conditions as may be detected by condition sensing device such as, by way of illustration, a microphone. For purposed of simplicity of this disclosure, the description will generally be made in regard to passively recording audio content. However, in various embodiments, the ongoing recording may also include both visual content with the audio content, as may be detected by an audio/video capture device (or devices) such as, by way of illustration, a video camera with a microphone, or by both a video camera and a microphone. The ongoing recording is “passive” in that a recording of the content stream is only temporarily made; any passively recorded content is overwritten with more recent content of the content stream after a predetermined amount of time. In this regard, the purpose of the passive recording is not to generate an audio or audio/visual recording of the content stream for the user, but to temporarily store the most recently recorded content in the event that, upon direction by a person, a transcription to text of the most recently recorded content may be made and stored as a note for the user.


In passively recording the current conditions, e.g., the audio and/or audio/visual conditions, the recently recorded content is placed in a “passive recording buffer.” In operation, the passive recording buffer is a memory buffer in a host computing device configured to hold a limited, predetermined amount of recently recorded content. For example, in operation the passive recording buffer may be configured to store a recording of the most recent minute of the ongoing audio (or audio/visual) conditions as captured by the recording components of the host computing device. To further illustrate aspects of the disclosed subject matter, particularly in regard to passive recording and the passive recording buffer, reference is made to FIG. 1.



FIG. 1 illustrate an exemplary audio stream 102 (i.e., ongoing audio conditions) with regard to a time line 100, and further illustrates various the ongoing passive recording of the audio stream into an exemplary passive recording buffer. According to various embodiments of the disclosed subject matter and shown in FIG. 1, the time (as indicated by time line 100) corresponding to the ongoing audio stream 102 may be broken up according to time segments, as illustrated by time segments ts0-ts8. While the time segments may be determined according to implementation details, in one non-limiting example the time segment corresponds to 15 seconds. Correspondingly, the passive recording buffer, such as passive recording buffer 102, may configured such that it can store a predetermined amount of recently recorded content, where the predetermined amount corresponds to a multiple of the amount of recently recorded content that is recorded during a single time segment. As illustratively shown in FIG. 1, the passive recording buffer 102 is configured to hold an amount of the most recently recorded content corresponding to 4 time segments though, as indicted about, this number may be determined according to implementation details and/or according to user preferences.


Conceptually, and by way of illustration and example, with the passive recording buffer 102 configure to temporarily store recently recorded content corresponding to 4 time segments, the passive recording buffer 102 at the beginning of time segment ts4 will include the recently recorded content from time segments ts0-ts3, as illustrated by passive recording buffer 104. Similarly, the passive recording buffer 102, at the start of time period ts5, will include the recently recorded content from time segments ts1-ts4, and so forth as illustrated in passive recording buffers 106-112.


In regard to implementation details, when the recently recorded content is managed according to time segments of content, as described above, the passive recording buffer can implemented as a circular queue in which the oldest time segment of recorded content is overwritten as a new time segment begins. Of course, when the passive recording buffer 102 is implemented as a collection of segments of content (corresponding to time segments), the point at which a user provides an instruction to transcribe the contents of the passive recording buffer will not always coincide with a time segment. Accordingly, an implementation detail, or a user configuration detail, can be made such that recently recorded content of at least a predetermined amount of time is always captured. In this embodiment, if the user (or implementer) wishes to record at least 4 time segments of content, the passive recording buffer may be configured to hold 5 time segments worth of recently recorded content.


While the discussion above in regard to FIG. 1A is made in regard to capturing recently recording content along time segments, it should be appreciated that this is one manner in which the content may be passively recorded. Those skilled in the art will appreciate that there are other implementation methods in this an audio or audio/visual stream may be passively recorded. Indeed, in an alternative embodiment as shown in FIG. 1B, the passive recording buffer is configured to a size sufficient to contain a predetermined maximum amount of passively recorded content (as recorded in various frames) according to time. For example, if the maximum amount (in time) of passively recorded content is 2 minutes, then the passive recording buffer is configured to retain a sufficient number of frames, such as frames 160-164, which collectively correspond to 2 minutes. Thus, as new frames are received (in the on-going passive recording), older frames whose content falls outside of the preceding amount of time for passive recording will be discarded. In reference to passive buffer T0, assuming that the preceding amount of time to passively record is captured in 9 frames (as shown in passive buffer T0), when a new frame 165 is received, it is stored in the passive buffer and the oldest frame 160 is discarded, as shown in passive buffer T1.


While the passive recording buffer may be configured to hold a predetermined maximum amount of recorded content, independent of the maximum amount that a passive recording buffer can contain and according to various embodiments of the disclosed subject matter, a computer user may configure the amount of recent captured content to be transcribed and placed as a note in a note file—of course, constrained by the maximum amount of content (in regard to time) that the passive recording buffer can contain. For example, while the maximum amount (according to time) of passively recorded content that passive recording buffer may contain may be 2 minutes, in various embodiments the user is permitted to configure the length (in time) of passive recorded content to be converted to a note, such as the prior 60 seconds of content, the prior 2 minutes, etc. In this regard, the user configuration as to the length of the audio or audio/visual content stream to be transcribed and stored as a note in a note file (upon user instruction), is independent of the passive recording buffer size (except for the upper limit of content that can be stored in the buffer.) Further, while the example above suggests that the passive recording buffer may contain up to 2 minutes of content, this is merely illustrative and should not be construed as limiting upon the disclosed subject matter. Indeed, in various alternative, non-limiting embodiments, the passive recording buffer may be configured to hold up to any one of 5 minutes of recorded content, 3 minutes of recorded content, 90 seconds of recorded content, etc. Further, the size of the passive recording buffer may be dynamically determined, adjusted as needed according to user configuration as to the length of audio content to be converted to a note in a note file.


Rather than converting the frames (160-165) into an audio stream at the time that the frames are received and stored in the passive buffer, the frames are simply stored in the passive buffer according to their sequence in time. By not processing the frames as they are received but, instead, processing the frames into an audio stream suitable for transcription (as will be described below), significant processing resources may be conserved. However, upon receiving an indication that the content in the passive buffer is to be transcribed into a note, the frames are merged together into an audio (or audio/visual) stream that may be processed by a transcription component or service.


As shown with regard to FIGS. 1A and 1B, there may be any number of implementations of a passive buffer, and the disclosed subject matter should be viewed as being equally applicable to these implementations. Indeed, irrespective of the manner in which a passive buffer is implemented, what is important is that a predetermined period of preceding content is retained and available for transcription at the direction of the person using the system.


As briefly discussed above, with an ongoing audio stream (or audio/visual stream) being passively recorded, a person (i.e., user of the disclosed subject matter on a computing device) can cause that the most recently recorded content of the ongoing stream be transcribed to text and the transcription recorded in a notes file. FIG. 2 is a flow diagram illustrating an exemplary routine 200 for generating notes, i.e., a textual transcription of the recently recorded content, of the most recently portion of the ongoing audio stream. Beginning at block 202, a passive recording process of the ongoing audio stream is commenced. As should be understood, this passive recording is an ongoing process and continues recording the ongoing audio (or audio/visual) stream (i.e., the content stream) until specifically terminated at the direction of the user, irrespective of other steps/activities that are taken with regard to routine 200. With regard to the format of the recorded content by the passive recording process, it should be appreciated that any suitable format may be used including, by way of illustration and not limitation, MP3 (MPEG-2 audio layer III), AVI (Audio Video Interleave), AAC (Advanced Audio Coding), WMA (Windows Media Audio), WAV (Waveform Audio File Format), and the like. Typically though not exclusively, the format of the recently recorded content is a function of the codec (coder/decoder) that is used to convert the audio content to a file format.


At block 204, with the passive recording of the content stream ongoing, the routine 200 awaits a user instruction. After receiving a user instruction, at decision block 206 a determination is made as to whether the user instruction is in regard to generating notes (from the recorded content in the passive recording buffer 102) or in regard to terminating the routine 200. If the instruction is in regard to generating a note, at block 208 the recently recorded content in the passive recording buffer is captured. In implementation, typically capturing the recently recorded content in the passive recording buffer comprises copying the recently recorded content from the passive recording buffer into another temporary buffer. Also, to the extent that the content in the passive recording buffer is maintained as frames, the frames are merged into an audio stream (or audio/visual stream) into the temporary buffer. This copying is done such that the recently recorded content can be transcribed without impacting the passive recording of the ongoing audio stream such that information/content of the ongoing content stream is continuously recorded.


At block 210, after capturing the recently recorded content in the passive recording buffer, the captured recorded content is transcribed to text. According to aspects of the disclosed subject matter, the captured recorded content may be transcribed by executable transcription components (comprising hardware and/or software components) on the user's computing device (i.e., the same device implementing routine 200). Alternatively, a transcription component may transmit the captured recorded content to an online transcription service and, in return, receive a textual transcription of the captured recorded content. As additional alternatives, the captured recorded content may be temporarily stored for future transcription, e.g., storing the captured recorded content for subsequent uploading to a computing device with sufficient capability to transcribe the content, or storing the captured recorded content until a network communication can be established to obtain a transcription from an online transcription service.


At block 212, the transcription is saved as a note in a note file. In addition to the text transcription of the captured recorded content, additional information may be stored with the note in the note file. Information such as the date and time of the captured recorded content may be stored with or as part of the note in the note file. A relative time (relative to the start of routine 200) may be stored with or as part of the note in the note file. Contextual information, such as meeting information, GPS location data, user information, and the like can be stored with or as part of the note in the note file. After generating the note and storing it in the note file, the routine 200 returns to block 204 to await additional instructions.


At some point, at decision block 206, the user instruction/action may be in regard to terminating the routine 200. Correspondingly, the routine 200 proceeds to block 214 where the passive recording of the ongoing audio (or audio/visual) stream is terminated, and the routine 200 terminates.


Often, an interesting portion of an ongoing conversation/stream may be detected and the user will wish to not only capture notes regarding the most recent time period, but continue to capture the content in an ongoing manner. The disclosed subject matter may be suitably and advantageously implemented to continue capturing the content (for transcription to a text-based note) as described in regard to FIG. 3. FIG. 3 is a flow diagram illustrating an exemplary routine 300 for generating notes of the most recently portion of the ongoing content stream, and for continued capture until indicated by a user. As will be seen, many aspects of routine 200 and routine 300 are the same.


Beginning at block 302, a passive recording process of the ongoing audio stream is commenced. As indicated above in regard to routine 200, this passive recording process is an ongoing process and continues recording the ongoing content stream until specifically terminated, irrespective of other steps/activities that are taken with regard to routine 300. With regard to the format of the recently recorded content, it should be appreciated that any suitable format may be used including, by way of illustration and not limitation, MP3 (MPEG-2 audio layer III), AVI (Audio Video Interleave), AAC (Advanced Audio Coding), WMA (Windows Media Audio), WAV (Waveform Audio File Format), and the like.


At block 304, with the passive recording ongoing, the routine 300 awaits a user instruction. After receiving a user instruction, at decision block 306 a determination is made as to whether the user instruction is in regard to generating a note (from the recorded content in the passive recording buffer 102) or in regard to ending the routine 300. If the user instruction is in regard to generating a note, at block 308 the recently recorded content in the passive recording buffer is captured. In addition to capturing the recorded content from the passive recording buffer, at decision block 310 a determination is made in regard to whether the user has indicated that the routine 300 should continue capturing the ongoing audio stream for transcription as an expanded note. If the determination is made that the user has not indicated that the routine 300 should continue capturing the ongoing audio stream, the routine proceeds to block 316 as described below. However, if the user has indicated that the routine 300 should continue capturing the ongoing audio stream as part of an expanded note, the routine proceeds to block 312.


At block 312, without interrupting the passive recording process, the ongoing recording of the ongoing content stream to the passive recording buffer is continually captured as part of expanded captured recorded content, where the expanded captured recorded content is, thus, greater than the amount of recorded content that can be stored in the passive recording buffer. At block 314, this continued capture of the content stream continues until an indication from the user is received to release or terminate the continued capture. At block 316, after capturing the recently recorded content in the passive recording buffer and any additional content as indicated by the user, the captured recorded content is transcribed to text. As mentioned above in regard to routine 200 of FIG. 2, the captured recorded content may be transcribed by executable transcription components (comprising hardware and/or software components) on the user's computing device. Alternatively, a transcription component may transmit the captured recorded content to an online transcription service and, in return, receive a textual transcription of the captured recorded content. As additional alternatives, the captured recorded content may be temporarily stored for future transcription, e.g., storing the captured recorded content for subsequent uploading to a computing device with sufficient capability to transcribe the content, or storing the captured recorded content until a network communication can be established to obtain a transcription from an online transcription service.


At block 318, the transcription is saved as a note in a note file, i.e., a data file comprising at least one or more text notes. In addition to the text transcription of the captured recorded content, additional information may be stored with the note in the note file. Information such as the date and time of the captured recorded content may be stored with or as part of the note in the note file. A relative time (relative to the start of routine 200) may be stored with or as part of the note in the note file. Contextual information, such as meeting information, GPS location data, user information, and the like can be stored with or as part of the note in the note file. After generating the note and storing it in the note file, the routine 300 returns to block 304 to await additional instructions.


As mentioned above, at decision block 306, the user instruction/action may be in regard to terminating the routine 300. In this condition, the routine 300 proceeds to block 320 where the passive recording of the ongoing audio (or audio/visual) stream is terminated, and thereafter the routine 300 terminates.


Regarding routines 200 and 300 described above, as well as routines 1000 and 1100 described below as well as other processes describe herein, while these routines/processes are expressed in regard to discrete steps, these steps should be viewed as being logical in nature and may or may not correspond to any actual and/or discrete steps of a particular implementation. Also, the order in which these steps are presented in the various routines and processes, unless otherwise indicated, should not be construed as the only order in which the steps may be carried out. In some instances, some of these steps may be omitted. Those skilled in the art will recognize that the logical presentation of steps is sufficiently instructive to carry out aspects of the claimed subject matter irrespective of any particular language in which the logical instructions/steps are embodied.


Of course, while these routines include various novel features of the disclosed subject matter, other steps (not listed) may also be carried out in the execution of the subject matter set forth in these routines. Those skilled in the art will appreciate that the logical steps of these routines may be combined together or be comprised of multiple steps. Steps of the above-described routines may be carried out in parallel or in series. Often, but not exclusively, the functionality of the various routines is embodied in software (e.g., applications, system services, libraries, and the like) that is executed on one or more processors of computing devices, such as the computing device described in regard FIG. 4 below. Additionally, in various embodiments all or some of the various routines may also be embodied in executable hardware modules including, but not limited to, system on chips, codecs, specially designed processors and or logic circuits, and the like on a computer system.


These routines/processes are typically embodied within executable code modules comprising routines, functions, looping structures, selectors such as if-then and if-then-else statements, assignments, arithmetic computations, and the like. However, the exact implementation in executable statement of each of the routines is based on various implementation configurations and decisions, including programming languages, compilers, target processors, operating environments, and the linking or binding operation. Those skilled in the art will readily appreciate that the logical steps identified in these routines may be implemented in any number of ways and, thus, the logical descriptions set forth above are sufficiently enabling to achieve similar results.


While many novel aspects of the disclosed subject matter are expressed in routines embodied within applications (also referred to as computer programs), apps (small, generally single or narrow purposed, applications), and/or methods, these aspects may also be embodied as computer-executable instructions stored by computer-readable media, also referred to as computer-readable storage media, which are articles of manufacture. As those skilled in the art will recognize, computer-readable media can host, store and/or reproduce computer-executable instructions and data for later retrieval and/or execution. When the computer-executable instructions that are hosted or stored on the computer-readable storage devices are executed, the execution thereof causes, configures and/or adapts the executing computing device to carry out various steps, methods and/or functionality, including those steps, methods, and routines described above in regard to the various illustrated routines. Examples of computer-readable media include, but are not limited to: optical storage media such as Blu-ray discs, digital video discs (DVDs), compact discs (CDs), optical disc cartridges, and the like; magnetic storage media including hard disk drives, floppy disks, magnetic tape, and the like; memory storage devices such as random access memory (RAM), read-only memory (ROM), memory cards, thumb drives, and the like; cloud storage (i.e., an online storage service); and the like. While computer-readable media may deliver the computer-executable instructions (and data) to a computing device for execution via various transmission means and mediums including carrier waves and/or propagated signals, for purposes of this disclosure computer readable media expressly excludes carrier waves and/or propagated signals.


Advantageously, many of the benefits of the disclosed subject matter can be conducted on computing devices with limited computing capacity and/or storage capabilities. Further still, many of the benefits of the disclosed subject matter can be conducted on computing devices of limited computing capacity, storage capabilities as well as network connectivity. Indeed, suitable computing devices suitable for implementing the disclosed subject matter include, by way of illustration and not limitation: mobile phones; tablet computers; “phablet” computing devices (the hybrid mobile phone/tablet devices); personal digital assistants; laptop computers; desktop computers; and the like.


Regarding the various computing devices upon which aspects of the disclosed subject matter may be implemented, FIG. 4 is a block diagram illustrating exemplary components of a suitably configured computing device 400 for implementing aspects of the disclosed subject matter. The exemplary computing device 400 includes one or more processors (or processing units), such as processor 402, and a memory 404. The processor 402 and memory 404, as well as other components, are interconnected by way of a system bus 410. The memory 404 typically (but not always) comprises both volatile memory 406 and non-volatile memory 408. Volatile memory 406 retains or stores information so long as the memory is supplied with power. In contrast, non-volatile memory 408 is capable of storing (or persisting) information even when a power supply is not available. Generally speaking, RAM and CPU cache memory are examples of volatile memory 406 whereas ROM, solid-state memory devices, memory storage devices, and/or memory cards are examples of non-volatile memory 408. Also illustrated as part of the memory 404 is a passive recording buffer 414. While shown as being separate from both volatile memory 406 and non-volatile memory 408, this distinction is for illustration purposes in identifying that the memory 404 includes (either as volatile memory or non-volatile memory) the passive recording buffer 414.


Further still, the illustrated computing device 400 includes a network communication component 412 for interconnecting this computing device with other devices over a computer network, optionally including an online transcription service as discussed above. The network communication component 412, sometimes referred to as a network interface card or NIC, communicates over a network using one or more communication protocols via a physical/tangible (e.g., wired, optical, etc.) connection, a wireless connection, or both. As will be readily appreciated by those skilled in the art, a network communication component, such as network communication component 412, is typically comprised of hardware and/or firmware components (and may also include or comprise executable software components) that transmit and receive digital and/or analog signals over a transmission medium (i.e., the network.)


The processor 402 executes instructions retrieved from the memory 404 (and/or from computer-readable media) in carrying out various functions, particularly in regard to responding to passively recording an ongoing audio or audio/visual stream and generating notes from the passive recordings, as discussed and described above. The processor 401 may be comprised of any of a number of available processors such as single-processor, multi-processor, single-core units, and multi-core units.


The exemplary computing device 400 further includes an audio recording component 420. Alternatively, not shown, the exemplary computing device 400 may be configured to include an audio/visual recording component, or both an audio recording component and a visual recording component, as discussed above. The audio recording component 420 is typically comprised of an audio sensing device, such as a microphone, as well as executable hardware and software, such as a hardware and/or software codec, for converting the sensed audio content into recently recorded content in the passive recording buffer 414. The passive recording component 426 utilizes the audio recording component 420 to capture audio content to the passive recording bugger, as described above in regard to routines 200 and 300. A note generator component 428 operates at the direction of the computing device user (typically through one or more user interface controls in the user interface component 422) to passively capture content of an ongoing audio (or audio/visual) stream, and to further generate one or more notes from the recently recorded content in the passive recording buffer 414, as described above. As indicated above, the note generator component 428 may take advantage of an optional transcription component 424 of the computing device 400 to transcribe the captured recorded content from the passive recording buffer 414 into a textual representation for saving in a note file 434 (of a plurality of note files) that is stored in a data store 430. Alternatively, the note generator component 428 may transmit the captured recorded content of the passive recording buffer 414 to an online transcription service over a network via the network communication component 412, or upload the captured audio content 432, temporarily stored in the data store 430, to a more capable computing device when connectivity is available.


An annotation component 436 is configured to annotate a generated note with one or more annotations including the identity of one or more persons/users associated with a meeting. Annotating a generated note is described in greater detail below in regard to FIGS. 6-9 and 10.


Regarding the data store 430, while the data store may comprise a hard drive and/or a solid state drive separately accessible from the memory 404 typically used on the computing device 400, as illustrated, in fact this distinction may be simply a logical distinction. In various embodiments, the data store is a part of the non-volatile memory 408 of the computing device 400. Additionally, while the data store 430 is indicated as being a part of the computing device 400, in an alternative embodiment the data store may be implemented as a cloud-based storage service accessible to the computing device over a network (via the network communication component 412).


Regarding the various components of the exemplary computing device 400, those skilled in the art will appreciate that these components may be implemented as executable software modules stored in the memory of the computing device, as hardware modules and/or components (including SoCs—system on a chip), or a combination of the two. Indeed, components such as the passive recording component 426, the note generator component 428, and the transcription component 424 may be implemented according to various executable embodiments including executable software modules that carry out one or more logical elements of the processes described in this document, or as as hardware components that include executable logic to carry out the one or more logical elements of the processes described in this document. Examples of these executable hardware components include, by way of illustration and not limitation, ROM (read-only memory) devices, programmable logic array (PLA) devices, PROM (programmable read-only memory) devices, EPROM (erasable PROM) devices, and the like, each of which may be encoded with instructions and/or logic which, in execution, carry out the functions described herein.


Moreover, in certain embodiments each of the various components may be implemented as an independent, cooperative process or device, operating in conjunction with or on one or more computer systems and or computing devices. It should be further appreciated, of course, that the various components described above should be viewed as logical components for carrying out the various described functions. As those skilled in the art will readily appreciate, logical components and/or subsystems may or may not correspond directly, in a one-to-one manner, to actual, discrete components. In an actual embodiment, the various components of each computing device may be combined together or distributed across multiple actual components and/or implemented as cooperative processes on a computer network.



FIG. 5 is a pictorial diagram illustrating an exemplary environment 500 suitable for implementing aspects of the disclosed subject matter. As shown in FIG. 5, a computing device 400 (in this example the computing device being a mobile phone of user/person 501) may be configured to passively record an ongoing conversation among various persons as described above, including persons 501, 503, 505 and 507. Upon an indication by the user/person 501, the computing device 400 captures the contents of the passive recording buffer 414, obtains a transcription of the recently recorded content captured from the passive recording buffer, and stores the textual transcription as a note in a note file in a data store. The computing device 400 is connected to a network 502 over which the computing device may obtain a transcription of captured audio content (or audio/visual content) from a transcription service 510, and/or store the transcribed note in an online and/or cloud-based data store (not shown).


In addition to capturing or generating a note of an ongoing content stream, quite often a person may wish to identify the current speaker of the captured note, i.e., associate or annotate the name or identity of the speaker with the note. For example, in a business meeting, it is often important to capture who proposed a particular idea or raised a particularly salient set of issues. Similarly, it may be important to identify who has suggested taking various actions or activities. Alternatively, in a family meeting, it might be very useful in associating a particular discussion with the person speaking. Indeed, in these circumstances and others, it would be advantageous to be able to annotate a captured note with the identity of a person (either as one speaking or to which a particular conversation pertains) for future reference.


Still further, while generating a note from an ongoing conversation may capture critical information, locating that a particular note and/or understanding the context of a particular note may be greatly enhanced when the person is able to associate the identity of the speaker (or the subject of the conversation) with the note, i.e., annotate the captured note with the identity of a person. With reference to the examples above, that person may greatly enhance the efficiency by which he/she can recall the particular context of a note and/or identify one or more notes that pertain to that person by associating the note with the identity of a person (or the identities of multiple persons.


According to aspects of the disclosed subject matter, during an ongoing passive recording of a content stream, a person may provide an indication as to the identity of a person to be associated with a generated note (i.e., a note to be captured from the passive recording buffer as described above). This indication may be made as part of or in addition to providing an indication to capture and generated a particular note of an ongoing conversation or audio stream, as set forth in regard to FIGS. 6-9 and 10. Indeed, FIGS. 6-9 illustrate an exemplary interaction with a computing device executing an application for capturing notes from an on-going audio conversation, and further illustrating the annotation of a captured note with the identity of one or more persons. FIG. 10 illustrates an exemplary routine 1000 for generating notes of the most recently passively recorded content of the ongoing content stream, for continued capture until indicated by a user, and for annotating the captured note with the identity of one or more persons.


With regard to FIGS. 6-9, FIG. 6 illustrates a typical home screen as presented by an app (or application) executing on a computing device 600. The home screen illustrates/includes several meeting entries, such as meeting entries 602 and 604, for which the user of the computing device has caused the app or application to capture notes from an on-going conversation. According to aspects of the disclosed subject matter, meetings are used and viewed as an organizational tool of captured notes, i.e., for providing a type of folder in which captured notes can be grouped together. As can be seen, meeting 602 entitled “PM Meeting,” which occurred on “Jul. 27, 2015” at “10:31 AM,” includes two captured notes. Similarly, meeting 604 entitled “Group Mtg,” occurred on “Jul. 28, 2015” at “1:30 PM” and includes three captured notes.


In addition to listing the “meetings” (which, according to various embodiments, is more generically used as a folder for collecting generated notes of an on-going audio stream), a user may also create a new meeting (or folder corresponding to the meeting) by interacting with the “add meeting” control 606. Thus, if the user is attending an actual meeting and wishes to capture (or may wish to capture) notes from the conversation of the meeting, the user simply interacts with the “add meeting” control 606, which begins the action of passively recording an ongoing content stream and thereby enabling the user to capture notes.


Turning to FIG. 7, FIG. 7 illustrates the exemplary computing device 600 of FIG. 6 after the user has interacted with the “add meeting” control 606. As indicated above, according to one embodiment of the disclosed subject matter, as part of creating a new meeting, the note capturing app/application executing on the computing device 600 begins the process of passively recording the on-going content stream, as indicated by the status indicator 702. In addition to the status indicator 702, a meeting title 704 is also displayed, in this case the title being a default title (“Meeting 4”) for the new meeting. Of course, in various embodiments, the default title of the meeting may be user configurable to something meaningful to the user. Alternatively, the title of a meeting may be obtained from the user's calendar (i.e., a meeting coinciding with the current time). Also shown by the app on the computing device 600 is a duration control 706 by which the user can control the amount/duration (as a function of seconds) of content that is captured in the passive recording buffer, as described above. In the present example, the amount of content, in seconds, that is captured in the passive recording buffer is set at 30 seconds, i.e., at least the prior 30 seconds worth of content is captured in the passive recording buffer and available for generating a note.


Also shown in FIG. 7 is a capture button 708. According to aspects of the disclosed subject matter, by interacting with the capture button 708, the user can cause the underlying app to capture/generate a note from the content captured in the passive recording buffer and store the note, in conjunction with the meeting, in a note file as described above. Indeed, as also described, through continued interaction with the capture button 708, such as continuing to press on the capture button 708, the amount of content that is captured in the currently captured/generated note is extended until the interaction ceases, thereby extending the amount or duration of content that is captured in a note. Also presented on the computing device 600 is a home control 710 that causes the passive recording operation to cease and returns to the home page (as illustrated in FIG. 6), and a user toggle control 712 that switches from “typical” note capturing to annotated note capturing according to user, as described below.


In addition to simply capturing a note from the on-going content stream, a user may wish to associate the identity of the current speaker with a captured note as an annotation to the note. According to aspects of the disclosed subject matter, a user may annotate an already-captured note with the identity of one or more users after the note has been captured/generated. Alternatively, a user may cause that a note be generated in conjunction with the selection of the identity of one or more users, which identity (identities) are to be associated with the generated note. Indeed, by interacting with the user toggle control 712, the user can switch to/from simply capturing notes to a screen for capturing notes to be annotated with the identity of a user (or users).



FIG. 8 illustrates the exemplary computing device 600 after the user has switched to the presentation of user-actionable controls, each control associated with the identity of a different person associated with the meeting, and each control configure to cause the generation of a note from the content of the passive recording buffer and to associate the identity of the corresponding user-actionable control with the note as an annotation. As shown in FIG. 8, the computing device 600 now presents a list of user-actionable controls 802-812, each control being associated with a particular person/users of the meeting. Indeed, according to aspects of the disclosed subject matter, by interacting with any one of the user-actionable controls 802-812, a note is generated and stored in the note file and annotated with user identity associated with the user-actionable control, and also associated with the meeting/event corresponding to the ongoing content stream. In other words, the indication to associate the user identity with a generated note is also an indication to generate a note based on the recorded content of the passive recording of the ongoing content stream. Also shown in FIG. 8 are the home control 710 that causes the passive recording to cease and returns to home page (as illustrated in FIG. 6), and a revert toggle control 814 that switches or toggles between “typical” note capturing (as shown in FIG. 7) and the “annotated” note capturing as shown in FIG. 8.


It should be appreciated and according to various embodiments, while a user may generate a note through interaction with a user-actionable control associated with the identity of a person, such as user-actionable control 802, a user may further configure a generated note to be associated with one or more additional persons. In this manner, a generated note may be associated with the identities of multiple persons. Of course, as will be readily appreciated, quite often the conversation for which a particular note generated is applicable to more than one party. While the computing device 600 of FIG. 8 illustrates a manner in which the identity of a single person may be associated with a note, other user interfaces may be presented in which the user is able to associate the identity of one or more users with a generated note, i.e., the user may add, delete and/or modify the users that are associated with any given note.


Assuming that the user has captured both a non-annotated note and annotated note (i.e., a note to which the identity of a user is associated) for the meeting shown in the exemplary FIGS. 6-8, FIG. 9 illustrates the exemplary computing device 600 showing the notes captured and associated with “Meeting 4.” Indeed, as shown in FIG. 9, a title control 902 displays the current name of the meeting (“Meeting 4”), a status control 904 illustrates various status information regarding the notes of the meeting, including that there are two (2) notes captured from the meeting, and further includes note 906 and 908. According to the illustrated embodiment, each note 906 and 908 is presented as a user-actionable control for presenting the corresponding note to the user. As can be seen, the first note 906 is not associated with a person (as indicted by the lack of any person's image on the control), whereas note 908 is associated/annotated with a person as indicted by the presence of the party's image 912. In addition to the note controls 906 and 908, the computing device 600 also includes a user-actionable record icon 910 that returns to recording notes (either of the screens displayed in FIG. 7 or 8) to continue capturing notes for this meeting.


While FIGS. 6-9 illustrate a particular set of user interfaces for interacting with an app executing on a computing device for capturing notes with a user identity annotation, it should be appreciated that this is simply one example of such user interaction and should note be viewed as limiting upon the disclosed subject matter. Those skilled in the art will appreciate that there may be any number of user interfaces that may be suitably used by an app to capture a note of an on-going audio stream from a passive recording buffer and associated or annotate the note with the identity of one or more users.


Turning to FIG. 10, FIG. 10 illustrates an exemplary routine 1000 for generating notes of the most recently portion of the ongoing content stream as located in the passive recording buffer as described above, for continued capture until indicated by a user, and for annotating the captured note with the identity of one or more users/persons. Beginning at block 1002, a passive recording process of the ongoing audio stream is commenced. At block 1004, with the passive recording ongoing, the routine 1000 awaits a user instruction.


After receiving user instruction, at decision block 1006 a determination is made as to whether the user instruction is in regard to generating a note (from the recorded content in the passive recording buffer 102) or in regard to ending the routine 1000. If the user instruction is in regard to generating a note, at block 1008 the recently recorded content in the passive recording buffer is captured. In addition to capturing the recorded content from the passive recording buffer, at decision block 1010 a determination is made in regard to whether the user has indicated that the routine 1000 should continue capturing the ongoing audio stream for transcription as an expanded note. If the determination is made that the user has not indicated that the routine 1000 should continue capturing the ongoing audio stream, the routine proceeds to block 1016 as described below. However, if the user has indicated that the routine 1000 should continue capturing the ongoing audio stream as part of an expanded note, the routine proceeds to block 1012. At block 1012, without interrupting the passive recording process, the ongoing recording of the ongoing content stream to the passive recording buffer is continually captured as part of expanded captured recorded content, where the expanded captured recorded content is, thus, greater than the amount of recorded content that can be stored in the passive recording buffer.


At block 1014, this continued capture of the content stream continues until an indication from the user is received to release or terminate the continued capture. At block 1016, after capturing the recently recorded content in the passive recording buffer and any additional content as indicated by the user, a note is generated from the captured recorded content. According to various embodiments, the note may be generated according to a transcription of the recorded/captured content. Alternatively, a note may be generated as a single audio file from the recorded/captured content. Further still, a note may be stored in a note file in multiple formats, such as an audio file and a transcription.


At block 1018, the generated note is then stored in a note file, i.e., a data file comprising at least one or more text notes. As indicated above, according to various embodiments, the note may be stored in a note file in association with (or as part of) a meeting. At block 1020, a determination is made as to whether the identity of a person is to be associated with the generated note, i.e., whether to annotate the note with the identity of one or more persons. If the note is not to be annotated with the identity of one or more persons, the routine 1000 returns to block 1004 to await additional user instructions. Alternatively, if the generated note is to be annotated with the identity of one or more persons, at block 1022 the note is annotated with the associated person's identity and the routine 1000 returns to block 1004.


As mentioned above, at decision block 1006, the user instruction/action may be in regard to terminating the routine 1000. In this condition, the routine 1000 proceeds to block 1024 where the passive recording of the ongoing audio (or audio/visual) stream is terminated, and thereafter the routine 1000 terminates.


In regard to identifying persons to be displayed as being a part of a meeting, according to aspects of the disclosed subject matter the list of persons to be presented on the “annotated” note capturing screen, as shown in FIG. 8, may be determined from the calendar of that user operating the app/application on the computing device. Indeed, FIG. 11 is a flow diagram illustrating an exemplary routine 1100 for populating a people list corresponding to a particular meeting. Beginning at block 1102, notice to initiate a meeting is received. As discussed above, this notice may be received as a product of the user interacting with the “add meeting” control 606 as shown in FIG. 6. At block 1104, the user/operator is added to the people list for the current meeting. At block 1106, a determination is made as to whether to utilize a corresponding appointment/meeting from the user's calendar. For example, if an appointment corresponding to the current time is available, a query may be made as to whether to include the participants of the appointment/meeting as people for the current meeting. Of course, at least one reason that the user is queries and that the user of routine 1100 may decline the inclusion of the participants of a corresponding appointment and/or meeting, is because the user intends, instead, to create an alternative, user-defined meeting. If, at decision block 1106, there is no corresponding meeting to the current time, or the user declines to use the current calendar meeting, the routine 1100 simply terminates. Alternatively, at block 1108, the user's calendar is accessed. At block 1110, the users associated with the concurrent appointment are identified and, at block 1112, the identified users are added to the people list. Thereafter, the routine 1100 terminates.


While not shown, in addition to identifying persons associated with a concurrent appointment, or as an alternative to identifying persons associated with a concurrent appointment, the user operating the app on the computing device may also/alternatively manually identify one or more persons to be presented as options in the “annotated” note capturing screen, as shown in FIG. 8. In other words, the list of persons to be presented may be user configurable, either in part or in total.


While various novel aspects of the disclosed subject matter have been described, it should be appreciated that these aspects are exemplary and should not be construed as limiting. Variations and alterations to the various aspects may be made without departing from the scope of the disclosed subject matter.

Claims
  • 1. A computer-implemented method conducted on a computing device, comprising at least a processor and a memory, for generating notes from an ongoing content stream, the method comprising: initiating passive recording of an ongoing content stream, the passive recording storing recorded content of the ongoing content stream in a passive recording buffer;receiving a user indication to generate a note based on the recorded content of the passive recording of the ongoing content stream;generating a note of the ongoing content stream from the recorded content of the passive recording and storing the note in a note file;receiving an indication to associate the identity of a person with the generated note from the user; andannotating the generated note with the identity of the person in the note file.
  • 2. The computer-implemented method of claim 1, wherein the indication to associate the identity of the person with the generated note is received from the user after the note has been generated and stored in the note file.
  • 3. The computer-implemented method of claim 1, wherein the indication to associate the identity of the person with the generated note is automatically generated as a function of generating the note based on the recorded content of the passive recording of the ongoing content stream.
  • 4. The computer-implemented method of claim 3, wherein the indication to associate the identity of the person with the generated note is included as part of the user indication to generate the note based on the recorded content of the passive recording of the ongoing content stream.
  • 5. The computer-implemented method of claim 1, wherein the identity of the person associated with the generated note is one of a plurality of identities corresponding to a plurality of persons.
  • 6. The computer-implemented method of claim 5 further comprising: accessing a calendar of the user of the computing device;identifying a concurrent appointment in the accessed calendar; andidentifying the plurality of persons from a plurality of users associated with the concurrent appointment.
  • 7. The computer-implemented method of claim 5, wherein the plurality of persons is user configurable.
  • 8. The computer-implemented method of claim 1 further comprising: receiving an indication to associate a plurality of identities corresponding to a plurality of persons with the generated note from the user; andannotating the generated note with the plurality of identities in the note file.
  • 9. The computer-implemented method of claim 1, wherein the generated note is stored in the note file in association with a meeting corresponding to the ongoing content stream.
  • 10. The computer-implemented method of claim 8, wherein the generated note is stored in the note file as one of a plurality of notes associated with the meeting corresponding to the ongoing content stream.
  • 11. A computer-readable medium bearing computer-executable instructions which, when executed on a computing system comprising at least a processor, carry out a method for generating notes annotated with an identity of a person from an ongoing content stream, the method comprising: initiating passive recording of an ongoing content stream, the passive recording storing recorded content of the ongoing content stream in a passive recording buffer;receiving a user indication to generate a note based on the recorded content of the passive recording of the ongoing content stream;receiving a user indication to associate an identity of a persons of a plurality of persons with the generated note;generating a note of the ongoing content stream from the recorded content of the passive recording;annotating the generated note with the identity of the person in the note file; andstoring the note in a note file and the annotation in association with an event corresponding to the ongoing content stream.
  • 12. The computer-readable medium of claim 10, wherein the user indication to associate the identity of the person with the generated note is received from the user after the note has been generated and stored in the note file.
  • 13. The computer-readable medium of claim 10, wherein the user indication to associate the identity of the person the generated note is automatically applied as a function of generating the note based on the recorded content of the passive recording of the ongoing content stream.
  • 14. The computer-readable medium of claim 12, wherein the user indication to associate the identity of the person with the generated note is a part of the user indication to generate the note based on the recorded content of the passive recording of the ongoing content stream.
  • 15. The computer-readable medium of claim 10, wherein the plurality of persons is user configurable.
  • 16. The computer-readable medium of claim 10, the method further comprising: receiving a user indication to associate an additional identity of a person of the plurality of persons with the generated note; andannotating the generated note with the additional identity in the note file.
  • 17. The computer-readable medium of claim 10, wherein the generated note is stored in the note file as one of a plurality of notes associated with the meeting corresponding to the ongoing content stream.
  • 18. A computing device for generating annotated notes from an ongoing content stream, the computing device comprising a processor and a memory, wherein the processor executes instructions stored in the memory as part of or in conjunction with additional components to generate notes from an ongoing content stream, the additional components comprising: a passive recording buffer, the passive recording buffer configured to temporarily store a predetermined amount of recorded content of an ongoing content stream;an audio recording component, the audio recording component being configured to generate recorded content of the ongoing content stream;a passive recording component, the passive recording component being configured to obtain recorded content of the ongoing content stream from the audio recording component and store the recorded content to the passive recording buffer;an annotation component, the annotation component being configured to annotate a generated note with an identity of a person; anda note generator component, the note generator component being configured to initiate a passive recording process via the passive recording component, receive an indication from the user, via a user interface component, to capture recorded content of the ongoing content stream, generate a note from the captured recorded content, annotate the generated note with an identity of a person via the annotation component, and store the generated note in a note file in a data store.
  • 19. The computing device of claim 17, wherein the indication from the user to capture recorded content of the ongoing content stream includes an indication to annotate the generated note with the identity of a person.
  • 20. The computing device of claim 17, wherein the generated note is stored in the note file in the data store in association with a meeting corresponding to the ongoing content stream.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of U.S. patent application Ser. No. 14/678611, titled “Generating Notes From Passive Recording,” filed Apr. 3, 2015, which is incorporated herein by reference. This application further claims priority to U.S. Provisional Patent Application No. 62/186,313, titled “Generating Notes From Passive Recording With Annotations,” filed Jun. 29, 2015, which is incorporated herein by reference. This application is related to co-pending U.S. patent application Ser. No. ______, titled “Annotating Notes From Passive Recording With Categories,” filed ______, (attorney docket #357368.02).

Provisional Applications (1)
Number Date Country
62186313 Jun 2015 US
Continuation in Parts (1)
Number Date Country
Parent 14678611 Apr 2015 US
Child 14838849 US