Embodiments of the disclosure relate to computing devices programmed to detect compliance with a medication regimen.
Exemplary embodiments include a computing device configured to dynamically display a specific, structured interactive animated conversational graphical user interface paired with a prescribed functionality directly related to the interactive graphical user interface's structure. Also included is a first computer vision model and a second computer vision model. The first computer vision model is configured to track a hand of a human, and the second computer vision model is configured to track a face of a human. The computing device is programed with heuristic logic. The heuristic logic infers if (i) the hand is visible, (ii) the face is visible, (iii) the back of the hand is visible, and (iv) the face is occluded, then a medication has been taken by the human being.
Further exemplary embodiments include a computer vision model configured to track a throat of the human to detect a swallow by the human. Also a computer vision model may be configured to detect a pill type. The computing device of claim may be any form of a computing device, including a personal computer, laptop, tablet, or mobile device. Additionally, upon initiation, a user is provided one or more options to select a desired method for data entry, including voice, type, touch or combinations thereof without having to switch back and forth. The user provided data is validated based on characteristics defined within the specific, structured interactive animated conversational graphical interface. The user provided data may be further validated against external data stored in a cloud-based database.
The specific, structured interactive animated conversational graphical user interface according to many embodiments may complete and update a database entry. The specific, structured interactive animated conversational graphical user interface may convert text data to voice data for storage and for use in human conversation. It may also convert response data to audio files using cloud-based text-to-speech solutions capable of being integrated into a web browser based avatar in the form of a human.
The accompanying drawings, where like reference numerals refer to identical or functionally similar elements throughout the separate views, together with the detailed description below, are incorporated in and form part of the specification, and serve to further illustrate embodiments of concepts that include the claimed disclosure, and explain various principles and advantages of those embodiments.
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the disclosure. It will be apparent, however, to one skilled in the art, that the disclosure may be practiced without these specific details. In other instances, structures and devices may be shown in block diagram form only in order to avoid obscuring the disclosure.
The Intel®RealSense™ D400 series depth camera uses stereo vision to calculate depth. The stereo vision implementation consists of a left imager, right imager, and an optional infrared projector. The infrared projector projects non-visible static IR pattern to improve depth accuracy in scenes with low texture. The left and right imagers capture the scene and sends imager data to the depth imaging (vision) processor, which calculates depth values for each pixel in the image by correlating points on the left image to the right image and via shift between a point on the left image and the right image. The depth pixel values are processed to generate a depth frame. Subsequent depth frames create a depth video stream. According to exemplary embodiments, these depth frames are analyzed as described and claimed herein.
At step 205, a medication compliance module is launched. For example, upon launching a user may be shown the exemplary specific, structured interactive animated conversational graphical user interface with an avatar in the form of a human as shown in
At step 210, the system waits for a user to position in front of one or more depth cameras. For example, 305 (
At step 215, a determination is made if a hand and face are visible. If so, at step 220 the depth camera begins recording frames and the user is instructed to take a medication. If no hand and face are visible, the user returns to step 210.
At step 225, a determination is made if a back of a hand is visible while the face is occluded. If so, at step 230 medication compliance is detected. For example, 405 shows a user positioned in front of one or more depth cameras with the indication, “Medication Taken.” If the back of the hand is not visible and the face is not occluded, medication compliance is not detected.
According to various exemplary embodiments, a three-dimensional avatar in the form of a human as depicted in
As illustrated in
As illustrated in
While various embodiments have been described herein, it should be understood that they have been presented by way of example only, and not limitation. The descriptions are not intended to limit the scope of the technology to the particular forms set forth herein. Thus, the breadth and scope of a preferred embodiment should not be limited by any of the above-described exemplary embodiments. It should be understood that the above description is illustrative and not restrictive. To the contrary, the present descriptions are intended to cover such alternatives, modifications, and equivalents as may be included within the spirit and scope of the technology as defined by the appended claims and otherwise appreciated by one of ordinary skill in the art. The scope of the technology should, therefore, be determined not with reference to the description, but instead should be determined with reference to the appended claims along with their full scope of equivalents.