The accompanying drawings illustrate a number of exemplary embodiments and are a part of the specification. Together with the following description, these drawings demonstrate and explain various principles of the present disclosure.
Throughout the drawings, identical reference characters and descriptions indicate similar, but not necessarily identical, elements. While the exemplary embodiments described herein are susceptible to various modifications and alternative forms, specific embodiments have been shown by way of example in the drawings and will be described in detail herein. However, the exemplary embodiments described herein are not intended to be limited to the particular forms disclosed. Rather, the present disclosure covers all modifications, equivalents, and alternatives falling within the scope of the appended claims.
The present disclosure is generally directed to digital pixel sensor (DPS) chips that allow for incorporation of machine learning (ML) chips connected via micro through-silicon vias (uTSVs). Traditional on-sensor computing systems such as DPS chipsets typically include three layers: 1) a photodiode layer, 2) a combined analog-to-digital converter (ADC) and memory layer, and 3) a machine learning logic layer that has an ML processor. Within such conventional three-layer systems, image data are transferred from the second layer to the third layer via data transfer interfaces, such as Mobile Industry Processor Interface (MIPI) interfaces. These MIPI interfaces, however, have high area overhead on the second and third layers. Indeed, MIPI interfaces result in large amounts of “keep-out” space on the chips that is virtually unusable. Accordingly, chipset layout and total capacity to hold electrical components is limited in such traditional systems.
Moreover, these conventional systems typically consume high amounts of energy when transferring image data. Because the photodiode layer (layer one) is typically fabricated using older and larger technology nodes, as compared with current computing technologies that allow computer processors to be produced at a much smaller scale on much smaller nodes, these traditional systems consume more power when performing processing operations. Moreover, the large area requirements of MIPI transfer interfaces limit the number of transistors and other electronic components that can be fit onto a given third-layer ML processor.
In contrast, the embodiments described herein may implement dense micro through-silicon vias to communicate image data from an image sensing layer to a machine learning processing layer. By using uTSVs, the embodiments described herein may transfer image data from a second layer of a DPS to the third layer with high amounts of parallelism. To provide this parallelism, the systems herein may be configured to divide a pixel array on the first layer (e.g., an image sensing, photodiode layer) into multiple smaller blocks or sections. These systems may then be configured to pack the image data of each section together for transfer through uTSVs to a third layer (e.g., a machine learning processing layer). The ML processing layer may, itself, be divided into multiple cores, where each core on the ML layer is equipped with a dedicated uTSV array to receive image data from the sensing layer and process that data in a parallel and distributed manner. By providing parallelism in this manner, the systems herein may process image data much faster and with less power than traditional systems. Moreover, the high amount of bandwidth provided by uTSVs may allow new types of smart machine learning applications to be processed on the ML processing layer. These concepts will be described further below with regard to
Features from any of the embodiments described herein may be used in combination with one another in accordance with the general principles described herein. These and other embodiments, features, and advantages will be more fully understood upon reading the following detailed description in conjunction with the accompanying drawings and claims.
The first layer 101 may be connected to a second layer 104 that includes one or more image processing components 105 configured to process the light 103 detected by the digital pixel sensors 102. The image processing components 105 of the second layer 104 may include, but are not limited to, analog-to-digital converters (ADCs 106), encoders 107, memory 108 (e.g., random access memory (RAM) or static RAM (SRAM)), transmitters 109 (or transceivers), and other image processing components. In some embodiments, each digital pixel sensor 102 may have its own corresponding ADC on the second layer 104. These ADCs may be configured to convert the detected light 103 (in an analog value) to a digital value that can be stored in memory 108 and/or transmitted to an external data store (e.g., a cloud data store). The encoder 107 may be configured to encode the digital data associated with each of the digital pixels of the first layer 101. The encoding may include converting the digital pixel values to a specified data format or into a specific data structure. The image processing components 105 of the second layer 104 may be connected to the third layer 112 of the system 100 via one or more micro through-silicon vias 110.
Micro through-silicon vias (uTSVs) 110 may refer to vertical electrical connections that pass directly through silicon (or other substrate) between layers of an integrated circuit. Micro through-silicon vias 110 may be much smaller than traditional through-silicon vias and may have a much smaller footprint on a processing chip. Indeed, each use of a TSV on a processing chip results in an area on that chip that is a “keep-out zone” or “dead zone” on which other electronic components (e.g., transistors, capacitors, diodes, etc.) or traces may not be placed. Traditional TSVs typically result in large dead zones on processing chips, reducing the processing power and efficiency of those chips. Micro TSVs, however, have much smaller dead zones and, as such, many more can be implemented on a chip such as a machine learning processor.
Third layer 112 of the system 100 may include one or more such processing chips in the machine learning (ML) hardware processing components 111. These ML hardware processing components 111 may be communicatively connected to the image processing components 105 of the second layer 104 via one or more uTSVs 110. At least in some embodiments, because uTSVs 110 are used to transfer the digital data produced by the ADCs 106 and/or encoders 107, and because many more uTSVs can be used than in traditional scenarios with TSVs, the digital data may be transferred over in parallel and may be processed in parallel using multiple machine learning cores. In some cases, the digital image data from different quadrants of an image may be sent to different ML chips and/or ML cores within those chips. Each of these ML cores may thus, in parallel, process image data as it comes in and may apply machine learning logic to changes between pixels between subsequent image frames. These embodiments will be explained in greater detail below with regard to
In some cases, the second layer 204 may be divided into multiple segments, including two, three, four, or more different segments. In the embodiment shown in
The computing environment 300 of
Accordingly, the embodiments herein may be configured to divide the digital pixel array of the first layer into multiple blocks and then pack the image data of each block together for transfer. For instance, a pixel array 301 may be divided into multiple blocks and fed to an encoder 302. The digital data may then be shared across one or more uTSV channels 303 using time multiplexing or some other type of multiplexing or other transmission method that may speed up the transfer. In some cases, the data may be encoded prior to transfer and, at least in some cases, the data may be compressed prior to transfer using various compression algorithms. On the third layer, the data may be unencoded by decoder 304 (and/or uncompressed, if applicable), resulting in decoded pixel data 305. This decoded pixel data may then be used by the ML processing hardware to identify objects, to track objects, or to perform other types of processing on an image or set of images. In some embodiments, while the third layer is decoding or otherwise processing data, the first and/or second layer may process the next batch of input (e.g., the next image frame). In some examples, processing data concurrently in this fashion may be faster and/or expand the effective quantity of available data storage.
In some cases, different numbers of uTSVs may be modeled using a specific number of photodiode inputs and a specific number of ML processor outputs. Using such modeling, the optimal number of uTSVs may be identified such that the area taken up by uTSV junctures on the second and third layers is not so high that it impedes the amount of processing that can be performed by the ML chips, but is large enough to provide bandwidth sufficient to transfer the available pixel data from the digital pixel sensors. Such simulation may also take into account any multiplexing, encoding, compression, etc. to arrive at an optimal block granularity that is specific to each DPS device, according to its photodiode detectors, image processing hardware, and ML processing hardware.
In some embodiments, each ML core 402A-402D may be equipped with a dedicated uTSV array to receive data from the sensing layer (e.g., layer one). In some case, each ML core 402A-402D may include multiple uTSV for data transfer. The image data 410 may be transferred from the second layer to the third layer in a parallel and distributed manner using the dedicated uTSV array (e.g., using uTSVs 405). As shown in
At step 510 of method 500, the method may include forming a first layer that includes multiple digital pixel sensors configured to detect light. These digital pixel sensors may be CCD's, CMOS sensors, or other types of image sensing hardware. The digital pixel sensors may sense light wavelengths and intensities, and may pass that light data to image processing components on a second layer. At step 520, the manufacturing method may include forming a second layer adhered to the first layer, where the second layer includes one or more image processing components configured to process the light detected by the digital pixel sensors. The image processing components may include ADCs, encoders, transmitters, memory, or other electronic components. These components may be configured to process the incoming light data from the digital pixel sensors and prepare it (e.g., format or encode the light data) for processing by a machine learning processor.
Step 530 of method 500 may include forming a third layer adhered to the second layer that includes one or more machine learning (ML) hardware processing components. The image processing components of the second layer may be communicatively connected to the ML hardware processing components of the third layer via one or more uTSVs. In some cases, as shown in
The machine learning models described herein may be or may implement substantially any type of machine learning to train a model. At least in some cases, the machine learning module may be an inferential model. As used herein, the term “inferential model” may refer to pure statistical models, pure machine learning models, or any combination of statistical and machine learning models. Such inferential models may include neural networks such as recurrent neural networks. In some embodiments, the recurrent neural network may be a long short-term memory (LSTM) neural network. Such recurrent neural networks are not limited to LSTM neural networks, and may have any other suitable architecture. For example, in some embodiments, the neural network may be a fully recurrent neural network, a gated recurrent neural network, a recursive neural network, a Hopfield neural network, an associative memory neural network, an Elman neural network, a Jordan neural network, an echo state neural network, a second order recurrent neural network, and/or any other suitable type of recurrent neural network. In other embodiments, neural networks that are not recurrent neural networks may be used. For example, deep neural networks, convolutional neural networks, and/or feedforward neural networks, may be used. In some implementations, the inferential model may be an unsupervised machine learning model, e.g., where previous data (on which the inferential model was previously trained) is not required. In other cases, the ML models may implement feedback loops to learn and improve over time.
As shown in computing environment 600 of
In some cases, for example, as shown in embodiment 700 of
The ML processor analysis may include a wide variety of tasks, including recognizing objects within images, tracking objects across images, facial recognition, and other tasks. Each ML processing core 702-705 may perform its own individual analysis, or may work in conjunction with one, two, three, or more other ML processing cores to determine, for instance, whether an object has moved between two subsequent image frames. In
In
In some cases, the ML processing cores may be configured to track the recognized objects across the subsequent images 801A-801C without neighboring pixel data from other ML processing cores. Thus, for instance, ML processing core 704 may individually recognize and track objects 803 and 804, while ML processing core 705 may individually recognize and track objects 804 and 805 without accessing pixel data from other image sections or from other ML processing cores. In this example, because no changes occur in block 801A & 802, the processing on these image blocks may be skipped, and the corresponding processing cores may be powered down to save energy. Accordingly, each ML processing core may operate individually, and may track the movement of recognized objects until those objects are no longer present in the image. This individual processing may be fed by individual data feeds through dedicated uTSVs at each ML processing core. Each dedicated uTSV may handle the image data for one or more sections of an image. By dividing up an image and then processing each section in parallel on different ML processing cores, the embodiments herein may process images much faster than traditional systems. Moreover, using global memory (e.g., 401A-401D of
This framework may be implemented for many different functions, including facial recognition, location tracking, or other functions. Moreover, it will be understood that images may be divided into many different sections in different scenarios. For instance, if an ML processor had eight processing cores, images may be divided into eight sections and correspondingly transmitted through eight uTSVs to the eight ML cores, respectively. Still further, some ML cores may have higher processing capacity and may have two, three, or more dedicated uTSVs for data transfer, while other systems may implement a 1:1 ration between uTSVs and ML processing cores. The embodiments described herein may determine the optimal number of uTSVs for each system, to where the uTSV for each ML processing core provides a high enough bandwidth to keep the core fully utilized, but to where there aren't so many uTSVs that processor core space on the ML hardware processor is taken up by uTSV attachment points. This balance includes tradeoffs that may be assessed and determined for each specific digital pixel sensor system.
Accordingly, a system may be provided that may increase digital pixel sensor output by implementing uTSVs for different ML processing cores. These uTSVs may provide a sufficiently high bandwidth stream to keep the ML processing cores fully utilized, while taking up a minimal amount of space on the second and third layers of the digital pixel sensor system. Moreover, the ML processing cores may be implemented alone or in conjunction with each other to execute smart applications including machine learning-based image processing applications.
Example 1: A system may include a first layer that includes a plurality of digital pixel sensors configured to detect light, a second layer that includes one or more image processing components configured to process the light detected by the digital pixel sensors, and a third layer that includes one or more machine learning (ML) hardware processing components, where the image processing components of the second layer are communicatively connected to the ML hardware processing components of the third layer via one or more micro through-silicon vias (uTSVs).
Example 2: The system of Example 1, wherein the image processing components of the second layer comprise at least one of an analog-to-digital converter (ADC), an encoder, a memory, or a transmitter.
Example 3:The system of any of Examples 1 and 2, wherein the ML hardware processing components control capture of consecutive image frames.
Example 4: The system of any of Examples 1-3, wherein the ML hardware processing components are configured to execute one or more smart applications.
Example 5: The system of any of Examples 1-4, wherein at least one of the one or more smart applications comprises a smart image processing application.
Example 6: The system of any of Examples 1-5, wherein the ML hardware processing components include a plurality of processing cores.
Example 7: The system of any of Examples 1-6, wherein one or more of the processing cores are configured to perform local processing on a specified portion of an image captured by the plurality of digital pixel sensors.
Example 8: The system of any of Examples 1-7, wherein the ML processing cores share information to perform centralized processing that stitches the image captured by the plurality of digital pixel sensors together.
Example 9: The system of any of Examples 1-8, wherein one or more of the processing cores is configured to recognize one or more objects within a specified region of an image.
Example 10: The system of any of Examples 1-9, wherein one or more of the processing cores is configured to track the one or more recognized objects across a plurality of subsequent images.
Example 11: The system of any of Examples 1-10, wherein the one or more recognized objects are tracked across the plurality of subsequent images without neighboring pixel data from other ML processing cores.
Example 12: The system of any of Examples 1-11, wherein the one or more recognized objects are processed by the same ML processing core until the objects are no longer present in the image.
Example 13: The system of any of Examples 1-12, wherein the one or more recognized objects are processed by a plurality of ML processing cores upon determining that the objects have moved between images.
Example 14: An electronic device may include a first layer that includes a plurality of digital pixel sensors configured to detect light, a second layer that includes one or more image processing components configured to process the light detected by the digital pixel sensors, and a third layer that includes one or more machine learning (ML) hardware processing components, wherein the image processing components of the second layer are communicatively connected to the ML hardware processing components of the third layer via one or more uTSVs.
Example 15: The electronic device of Example 14, wherein the image processing components of the second layer are configured to combine a plurality of image bits into a combined group of image data bits.
Example 16: The electronic device of any of Examples 14 and 15, wherein the combined group of image data bits is transferred as a group to the ML hardware processing components of the third layer through the uTSVs.
Example 17: The electronic device of any of Examples 14-16, wherein the combined group of image data bits is transferred as a group to the ML hardware processing components using time-based multiplexing.
Example 18: The electronic device of any of Examples 14-17, wherein at least one of the ML hardware processing components comprises a local memory buffer.
Example 19: The electronic device of any of Examples 14-18, wherein image data stored in the local memory buffer is processed using a systolic array whose parameters are prestored in the local memory buffer.
Example 20: A method of manufacture may include forming a first layer that includes a plurality of digital pixel sensors configured to detect light, forming a second layer adhered to the first layer that includes one or more image processing components configured to process the light detected by the digital pixel sensors, and forming a third layer adhered to the second layer that includes one or more machine learning (ML) hardware processing components, wherein the image processing components of the second layer are communicatively connected to the ML hardware processing components of the third layer via one or more uTSVs.
The digital pixel sensor systems described herein may be used in a variety of different computing environments, including artificial reality environments. Artificial-reality systems may be implemented in a variety of different form factors and configurations. Some artificial-reality systems may be designed to work without near-eye displays (NEDs). Other artificial-reality systems may include an NED that also provides visibility into the real world (such as, e.g., augmented-reality system 900 in
Turning to
In some embodiments, augmented-reality system 900 may include one or more sensors, such as sensor 940. Sensor 940 may generate measurement signals in response to motion of augmented-reality system 900 and may be located on substantially any portion of frame 910. Sensor 940 may represent one or more of a variety of different sensing mechanisms, such as a position sensor, an inertial measurement unit (IMU), a depth camera assembly, a structured light emitter and/or detector, or any combination thereof. In some embodiments, augmented-reality system 900 may or may not include sensor 940 or may include more than one sensor. In embodiments in which sensor 940 includes an IMU, the IMU may generate calibration data based on measurement signals from sensor 940. Examples of sensor 940 may include, without limitation, accelerometers, gyroscopes, magnetometers, other suitable types of sensors that detect motion, sensors used for error correction of the IMU, or some combination thereof.
In some examples, augmented-reality system 900 may also include a microphone array with a plurality of acoustic transducers 920(A)-920(J), referred to collectively as acoustic transducers 920. Acoustic transducers 920 may represent transducers that detect air pressure variations induced by sound waves. Each acoustic transducer 920 may be configured to detect sound and convert the detected sound into an electronic format (e.g., an analog or digital format). The microphone array in
In some embodiments, one or more of acoustic transducers 920(A)-(J) may be used as output transducers (e.g., speakers). For example, acoustic transducers 920(A) and/or 920(B) may be earbuds or any other suitable type of headphone or speaker.
The configuration of acoustic transducers 920 of the microphone array may vary. While augmented-reality system 900 is shown in
Acoustic transducers 920(A) and 920(B) may be positioned on different parts of the user's ear, such as behind the pinna, behind the tragus, and/or within the auricle or fossa. Or, there may be additional acoustic transducers 920 on or surrounding the ear in addition to acoustic transducers 920 inside the ear canal. Having an acoustic transducer 920 positioned next to an ear canal of a user may enable the microphone array to collect information on how sounds arrive at the ear canal. By positioning at least two of acoustic transducers 920 on either side of a user's head (e.g., as binaural microphones), augmented-reality device 900 may simulate binaural hearing and capture a 3D stereo sound field around about a user's head. In some embodiments, acoustic transducers 920(A) and 920(B) may be connected to augmented-reality system 900 via a wired connection 930, and in other embodiments acoustic transducers 920(A) and 920(B) may be connected to augmented-reality system 900 via a wireless connection (e.g., a Bluetooth connection). In still other embodiments, acoustic transducers 920(A) and 920(B) may not be used at all in conjunction with augmented-reality system 900.
Acoustic transducers 920 on frame 910 may be positioned in a variety of different ways, including along the length of the temples, across the bridge, above or below display devices 915(A) and 915(B), or some combination thereof. Acoustic transducers 920 may also be oriented such that the microphone array is able to detect sounds in a wide range of directions surrounding the user wearing the augmented-reality system 900. In some embodiments, an optimization process may be performed during manufacturing of augmented-reality system 900 to determine relative positioning of each acoustic transducer 920 in the microphone array.
In some examples, augmented-reality system 900 may include or be connected to an external device (e.g., a paired device), such as neckband 905. Neckband 905 generally represents any type or form of paired device. Thus, the following discussion of neckband 905 may also apply to various other paired devices, such as charging cases, smart watches, smart phones, wrist bands, other wearable devices, hand-held controllers, tablet computers, laptop computers, other external compute devices, etc.
As shown, neckband 905 may be coupled to eyewear device 902 via one or more connectors. The connectors may be wired or wireless and may include electrical and/or non-electrical (e.g., structural) components. In some cases, eyewear device 902 and neckband 905 may operate independently without any wired or wireless connection between them. While
Pairing external devices, such as neckband 905, with augmented-reality eyewear devices may enable the eyewear devices to achieve the form factor of a pair of glasses while still providing sufficient battery and computation power for expanded capabilities. Some or all of the battery power, computational resources, and/or additional features of augmented-reality system 900 may be provided by a paired device or shared between a paired device and an eyewear device, thus reducing the weight, heat profile, and form factor of the eyewear device overall while still retaining desired functionality. For example, neckband 905 may allow components that would otherwise be included on an eyewear device to be included in neckband 905 since users may tolerate a heavier weight load on their shoulders than they would tolerate on their heads. Neckband 905 may also have a larger surface area over which to diffuse and disperse heat to the ambient environment. Thus, neckband 905 may allow for greater battery and computation capacity than might otherwise have been possible on a stand-alone eyewear device. Since weight carried in neckband 905 may be less invasive to a user than weight carried in eyewear device 902, a user may tolerate wearing a lighter eyewear device and carrying or wearing the paired device for greater lengths of time than a user would tolerate wearing a heavy standalone eyewear device, thereby enabling users to more fully incorporate artificial-reality environments into their day-to-day activities.
Neckband 905 may be communicatively coupled with eyewear device 902 and/or to other devices. These other devices may provide certain functions (e.g., tracking, localizing, depth mapping, processing, storage, etc.) to augmented-reality system 900. In the embodiment of
Acoustic transducers 920(I) and 920(J) of neckband 905 may be configured to detect sound and convert the detected sound into an electronic format (analog or digital). In the embodiment of
Controller 925 of neckband 905 may process information generated by the sensors on neckband 905 and/or augmented-reality system 900. For example, controller 925 may process information from the microphone array that describes sounds detected by the microphone array. For each detected sound, controller 925 may perform a direction-of-arrival (DOA) estimation to estimate a direction from which the detected sound arrived at the microphone array. As the microphone array detects sounds, controller 925 may populate an audio data set with the information. In embodiments in which augmented-reality system 900 includes an inertial measurement unit, controller 925 may compute all inertial and spatial calculations from the IMU located on eyewear device 902. A connector may convey information between augmented-reality system 900 and neckband 905 and between augmented-reality system 900 and controller 925. The information may be in the form of optical data, electrical data, wireless data, or any other transmittable data form. Moving the processing of information generated by augmented-reality system 900 to neckband 905 may reduce weight and heat in eyewear device 902, making it more comfortable to the user.
Power source 935 in neckband 905 may provide power to eyewear device 902 and/or to neckband 905. Power source 935 may include, without limitation, lithium ion batteries, lithium-polymer batteries, primary lithium batteries, alkaline batteries, or any other form of power storage. In some cases, power source 935 may be a wired power source. Including power source 935 on neckband 905 instead of on eyewear device 902 may help better distribute the weight and heat generated by power source 935.
As noted, some artificial-reality systems may, instead of blending an artificial reality with actual reality, substantially replace one or more of a user's sensory perceptions of the real world with a virtual experience. One example of this type of system is a head-worn display system, such as virtual-reality system 1000 in
Artificial-reality systems may include a variety of types of visual feedback mechanisms. For example, display devices in augmented-reality system 900 and/or virtual-reality system 1000 may include one or more liquid crystal displays (LCDs), light emitting diode (LED) displays, organic LED (OLED) displays, digital light project (DLP) micro-displays, liquid crystal on silicon (LCoS) micro-displays, and/or any other suitable type of display screen. These artificial-reality systems may include a single display screen for both eyes or may provide a display screen for each eye, which may allow for additional flexibility for varifocal adjustments or for correcting a user's refractive error. Some of these artificial-reality systems may also include optical subsystems having one or more lenses (e.g., conventional concave or convex lenses, Fresnel lenses, adjustable liquid lenses, etc.) through which a user may view a display screen. These optical subsystems may serve a variety of purposes, including to collimate (e.g., make an object appear at a greater distance than its physical distance), to magnify (e.g., make an object appear larger than its actual size), and/or to relay (to, e.g., the viewer's eyes) light. These optical subsystems may be used in a non-pupil-forming architecture (such as a single lens configuration that directly collimates light but results in so-called pincushion distortion) and/or a pupil-forming architecture (such as a multi-lens configuration that produces so-called barrel distortion to nullify pincushion distortion).
In addition to or instead of using display screens, some of the artificial-reality systems described herein may include one or more projection systems. For example, display devices in augmented-reality system 900 and/or virtual-reality system 1000 may include micro-LED projectors that project light (using, e.g., a waveguide) into display devices, such as clear combiner lenses that allow ambient light to pass through. The display devices may refract the projected light toward a user's pupil and may enable a user to simultaneously view both artificial-reality content and the real world. The display devices may accomplish this using any of a variety of different optical components, including waveguide components (e.g., holographic, planar, diffractive, polarized, and/or reflective waveguide elements), light-manipulation surfaces and elements (such as diffractive, reflective, and refractive elements and gratings), coupling elements, etc. Artificial-reality systems may also be configured with any other suitable type or form of image projection system, such as retinal projectors used in virtual retina displays.
The artificial-reality systems described herein may also include various types of computer vision components and subsystems. For example, augmented-reality system 900 and/or virtual-reality system 1000 may include one or more optical sensors, such as two-dimensional (2D) or 3D cameras, structured light transmitters and detectors, time-of-flight depth sensors, single-beam or sweeping laser rangefinders, 3D LiDAR sensors, and/or any other suitable type or form of optical sensor. An artificial-reality system may process data from one or more of these sensors to identify a location of a user, to map the real world, to provide a user with context about real-world surroundings, and/or to perform a variety of other functions.
The artificial-reality systems described herein may also include one or more input and/or output audio transducers. Output audio transducers may include voice coil speakers, ribbon speakers, electrostatic speakers, piezoelectric speakers, bone conduction transducers, cartilage conduction transducers, tragus-vibration transducers, and/or any other suitable type or form of audio transducer. Similarly, input audio transducers may include condenser microphones, dynamic microphones, ribbon microphones, and/or any other type or form of input transducer. In some embodiments, a single transducer may be used for both audio input and audio output.
In some embodiments, the artificial-reality systems described herein may also include tactile (i.e., haptic) feedback systems, which may be incorporated into headwear, gloves, body suits, handheld controllers, environmental devices (e.g., chairs, floormats, etc.), and/or any other type of device or system. Haptic feedback systems may provide various types of cutaneous feedback, including vibration, force, traction, texture, and/or temperature. Haptic feedback systems may also provide various types of kinesthetic feedback, such as motion and compliance. Haptic feedback may be implemented using motors, piezoelectric actuators, fluidic systems, and/or a variety of other types of feedback mechanisms. Haptic feedback systems may be implemented independent of other artificial-reality devices, within other artificial-reality devices, and/or in conjunction with other artificial-reality devices.
By providing haptic sensations, audible content, and/or visual content, artificial-reality systems may create an entire virtual experience or enhance a user's real-world experience in a variety of contexts and environments. For instance, artificial-reality systems may assist or extend a user's perception, memory, or cognition within a particular environment. Some systems may enhance a user's interactions with other people in the real world or may enable more immersive interactions with other people in a virtual world. Artificial-reality systems may also be used for educational purposes (e.g., for teaching or training in schools, hospitals, government organizations, military organizations, business enterprises, etc.), entertainment purposes (e.g., for playing video games, listening to music, watching video content, etc.), and/or for accessibility purposes (e.g., as hearing aids, visual aids, etc.). The embodiments disclosed herein may enable or enhance a user's artificial-reality experience in one or more of these contexts and environments and/or in other contexts and environments.
As noted, augmented-reality systems 900 and 1000 may be used with a variety of other types of devices to provide a more compelling artificial-reality experience. These devices may be haptic interfaces with transducers that provide haptic feedback and/or that collect haptic information about a user's interaction with an environment. The artificial-reality systems disclosed herein may include various types of haptic interfaces that detect or convey various types of haptic information, including tactile feedback (e.g., feedback that a user detects via nerves in the skin, which may also be referred to as cutaneous feedback) and/or kinesthetic feedback (e.g., feedback that a user detects via receptors located in muscles, joints, and/or tendons).
Haptic feedback may be provided by interfaces positioned within a user's environment (e.g., chairs, tables, floors, etc.) and/or interfaces on articles that may be worn or carried by a user (e.g., gloves, wristbands, etc.). As an example,
One or more vibrotactile devices 1140 may be positioned at least partially within one or more corresponding pockets formed in textile material 1130 of vibrotactile system 1100. Vibrotactile devices 1140 may be positioned in locations to provide a vibrating sensation (e.g., haptic feedback) to a user of vibrotactile system 1100. For example, vibrotactile devices 1140 may be positioned against the user's finger(s), thumb, or wrist, as shown in
A power source 1150 (e.g., a battery) for applying a voltage to the vibrotactile devices 1140 for activation thereof may be electrically coupled to vibrotactile devices 1140, such as via conductive wiring 1152. In some examples, each of vibrotactile devices 1140 may be independently electrically coupled to power source 1150 for individual activation. In some embodiments, a processor 1160 may be operatively coupled to power source 1150 and configured (e.g., programmed) to control activation of vibrotactile devices 1140.
Vibrotactile system 1100 may be implemented in a variety of ways. In some examples, vibrotactile system 1100 may be a standalone system with integral subsystems and components for operation independent of other devices and systems. As another example, vibrotactile system 1100 may be configured for interaction with another device or system 1170. For example, vibrotactile system 1100 may, in some examples, include a communications interface 1180 for receiving and/or sending signals to the other device or system 1170. The other device or system 1170 may be a mobile device, a gaming console, an artificial-reality (e.g., virtual-reality, augmented-reality, mixed-reality) device, a personal computer, a tablet computer, a network device (e.g., a modem, a router, etc.), a handheld controller, etc. Communications interface 1180 may enable communications between vibrotactile system 1100 and the other device or system 1170 via a wireless (e.g., Wi-Fi, Bluetooth, cellular, radio, etc.) link or a wired link. If present, communications interface 1180 may be in communication with processor 1160, such as to provide a signal to processor 1160 to activate or deactivate one or more of the vibrotactile devices 1140.
Vibrotactile system 1100 may optionally include other subsystems and components, such as touch-sensitive pads 1190, pressure sensors, motion sensors, position sensors, lighting elements, and/or user interface elements (e.g., an on/off button, a vibration control element, etc.). During use, vibrotactile devices 1140 may be configured to be activated for a variety of different reasons, such as in response to the user's interaction with user interface elements, a signal from the motion or position sensors, a signal from the touch-sensitive pads 1190, a signal from the pressure sensors, a signal from the other device or system 1170, etc.
Although power source 1150, processor 1160, and communications interface 1180 are illustrated in
Haptic wearables, such as those shown in and described in connection with
Head-mounted display 1202 generally represents any type or form of virtual-reality system, such as virtual-reality system 1000 in
While haptic interfaces may be used with virtual-reality systems, as shown in
One or more of band elements 1332 may include any type or form of actuator suitable for providing haptic feedback. For example, one or more of band elements 1332 may be configured to provide one or more of various types of cutaneous feedback, including vibration, force, traction, texture, and/or temperature. To provide such feedback, band elements 1332 may include one or more of various types of actuators. In one example, each of band elements 1332 may include a vibrotactor (e.g., a vibrotactile actuator) configured to vibrate in unison or independently to provide one or more of various types of haptic sensations to a user. Alternatively, only a single band element or a subset of band elements may include vibrotactors.
Haptic devices 1110, 1120, 1204, and 1330 may include any suitable number and/or type of haptic transducer, sensor, and/or feedback mechanism. For example, haptic devices 1110, 1120, 1204, and 1330 may include one or more mechanical transducers, piezoelectric transducers, and/or fluidic transducers. Haptic devices 1110, 1120, 1204, and 1330 may also include various combinations of different types and forms of transducers that work together or independently to enhance a user's artificial-reality experience. In one example, each of band elements 1332 of haptic device 1330 may include a vibrotactor (e.g., a vibrotactile actuator) configured to vibrate in unison or independently to provide one or more of various types of haptic sensations to a user.
As detailed above, the computing devices and systems described and/or illustrated herein broadly represent any type or form of computing device or system capable of executing computer-readable instructions, such as those contained within the modules described herein. In their most basic configuration, these computing device(s) may each include at least one memory device and at least one physical processor.
In some examples, the term “memory device” generally refers to any type or form of volatile or non-volatile storage device or medium capable of storing data and/or computer-readable instructions. In one example, a memory device may store, load, and/or maintain one or more of the modules described herein. Examples of memory devices include, without limitation, Random Access Memory (RAM), Read Only Memory (ROM), flash memory, Hard Disk Drives (HDDs), Solid-State Drives (SSDs), optical disk drives, caches, variations or combinations of one or more of the same, or any other suitable storage memory.
In some examples, the term “physical processor” generally refers to any type or form of hardware-implemented processing unit capable of interpreting and/or executing computer-readable instructions. In one example, a physical processor may access and/or modify one or more modules stored in the above-described memory device. Examples of physical processors include, without limitation, microprocessors, microcontrollers, Central Processing Units (CPUs), Field-Programmable Gate Arrays (FPGAs) that implement softcore processors, Application-Specific Integrated Circuits (ASICs), portions of one or more of the same, variations or combinations of one or more of the same, or any other suitable physical processor.
Although illustrated as separate elements, the modules described and/or illustrated herein may represent portions of a single module or application. In addition, in certain embodiments one or more of these modules may represent one or more software applications or programs that, when executed by a computing device, may cause the computing device to perform one or more tasks. For example, one or more of the modules described and/or illustrated herein may represent modules stored and configured to run on one or more of the computing devices or systems described and/or illustrated herein. One or more of these modules may also represent all or portions of one or more special-purpose computers configured to perform one or more tasks.
In addition, one or more of the modules described herein may transform data, physical devices, and/or representations of physical devices from one form to another. For example, one or more of the modules recited herein may receive data to be transformed, transform the data, output a result of the transformation to form a first layer, use the result of the transformation to form second and third layers, and store the result of the transformation to manufacture a multi-layered electronic device. Additionally or alternatively, one or more of the modules recited herein may transform a processor, volatile memory, non-volatile memory, and/or any other portion of a physical computing device from one form to another by executing on the computing device, storing data on the computing device, and/or otherwise interacting with the computing device.
In some embodiments, the term “computer-readable medium” generally refers to any form of device, carrier, or medium capable of storing or carrying computer-readable instructions. Examples of computer-readable media include, without limitation, transmission-type media, such as carrier waves, and non-transitory-type media, such as magnetic-storage media (e.g., hard disk drives, tape drives, and floppy disks), optical-storage media (e.g., Compact Disks (CDs), Digital Video Disks (DVDs), and BLU-RAY disks), electronic-storage media (e.g., solid-state drives and flash media), and other distribution systems.
The process parameters and sequence of the steps described and/or illustrated herein are given by way of example only and can be varied as desired. For example, while the steps illustrated and/or described herein may be shown or discussed in a particular order, these steps do not necessarily need to be performed in the order illustrated or discussed. The various exemplary methods described and/or illustrated herein may also omit one or more of the steps described or illustrated herein or include additional steps in addition to those disclosed.
The preceding description has been provided to enable others skilled in the art to best utilize various aspects of the exemplary embodiments disclosed herein. This exemplary description is not intended to be exhaustive or to be limited to any precise form disclosed. Many modifications and variations are possible without departing from the spirit and scope of the present disclosure. The embodiments disclosed herein should be considered in all respects illustrative and not restrictive. Reference should be made to the appended claims and their equivalents in determining the scope of the present disclosure.
Unless otherwise noted, the terms “connected to” and “coupled to” (and their derivatives), as used in the specification and claims, are to be construed as permitting both direct and indirect (i.e., via other elements or components) connection. In addition, the terms “a” or “an,” as used in the specification and claims, are to be construed as meaning “at least one of.” Finally, for ease of use, the terms “including” and “having” (and their derivatives), as used in the specification and claims, are interchangeable with and have the same meaning as the word “comprising.”
This application claims the benefit of U.S. Provisional Application No. 63/227,214, filed 29 Jul. 2021, the disclosures of each of which are incorporated, in their entirety, by this reference.
Number | Date | Country | |
---|---|---|---|
63227214 | Jul 2021 | US |