The disclosure relates to extended reality (XR) technology. More particularly, the disclosure relates to harmonizing perceptual quality of multiple applications in an XR environment.
In general, extended reality (XR) is a combination of human and computer-generated graphics interaction. The XR creates a state of immersive digital experience in which physical and digital objects coexist in real time. Further, the XR includes augmented reality (AR), virtual reality (VR) and mixed reality (MR). The augmented reality (AR) is an integration of digital information with user's environment in real time. Some of the real time applications of AR used in mobile devices includes but not limited to e-commerce applications (e.g., Amikasa, three-dimensional (3D) floor planner, or the like), gaming applications (e.g., Pokemon Go), and travel and tourism applications (google translate app). The virtual reality (VR) is a complete computer-generated environment with scenes and objects which appears to be real, and making the user feel immersed in the virtual environment. The VR is used in one or more applications, such as gaming, healthcare, education and the like. In addition, the mixed reality is combination of augmented reality and virtual reality in which the physical reality and digital contents are combined such that it enables the interaction between real-world and virtual objects. The one or more application of AR, VR and MR can be used in mobile devices and VR devices respectively. The VR device is a head-mounted device that provides virtual reality for the wearer. In existing systems, the XR devices is capable of running multiple applications simultaneously. Further, the multiple applications can be viewed simultaneously and the user can interact concurrently with multiple applications running on the XR device. Further, each of the multiple applications can be streamed with different qualities. However, using multiple applications in the XR device with different qualities leads to poor user experience and reduced immersiveness by the user using the XR device. Thus, there is a need for an improved method and system to achieve similar quality of streams for multiple applications running on the XR device and hence improve the user experience Hence, it is desired to address the above-mentioned disadvantages or other shortcomings or at least provide a useful alternative.
The above information is presented as background information only to assist with an understanding of the disclosure. No determination has been made, and no assertion is made, as to whether any of the above might be applicable as prior art with regard to the disclosure.
Aspects of the disclosure are to address at least the above-mentioned problems and/or disadvantages and to provide at least the advantages described below. Accordingly, an aspect of the disclosure is to provide a method of harmonizing perceptual quality of multiple applications in an extended reality (XR) environment.
Another aspect of the disclosure is to determine and compare the perceptual quality scores to perform parameter renegotiation for harmonizing the quality of streams of multiple applications running on XR device.
Another aspect of the disclosure is to determine and compare the perceptual quality scores to post data processing for harmonizing the quality of streams of multiple applications running on XR device.
Another aspect of the disclosure is to achieve similar quality scores for multiple streams of multiple applications to provide an even and smooth user observation while using XR devices.
Another aspect of the disclosure is to enhance the user immersiveness while using multiple applications in the XR device.
Another aspect of the disclosure is to display the multiple streams simultaneously for which the quality of streams is even for multiple steams associated with multiple application.
Additional aspects will be set forth in part in the description which follows and, in part, will be apparent from the description, or may be learned by practice of the presented embodiments.
In accordance with an aspect of the disclosure, a method of harmonizing perceptual quality of multiple applications in an extended reality (XR) environment is provided. The method includes receiving, by an XR device, at least one media stream from each application among plurality of applications available in the XR device, determining a perceptual quality score for media stream received from each application of the plurality of applications, determining at least one candidate application with different media quality from plurality of applications based on perceptual score of at least one media stream received from each application, determining, a target perceptual quality score for the at least one media stream of the at least one candidate application based on the perceptual quality score of media stream received from each of the application of the plurality of applications and plurality of network parameters, and harmonizing at least one media parameter of the at least one media stream received from at least one candidate application based on the target perceptual quality score.
In an embodiment of the disclosure, the at least one media parameter includes a bit-rate, a codec, a resolution, a brightness, a frame-complexity, a sample rate, a channel count, a pitch, and a frame-rate.
In an embodiment of the disclosure, the determining of the target perceptual quality score for the at least one media stream of the at least one candidate application based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications and the plurality of network parameters includes determining, a bandwidth ratio based on the plurality of network parameters. Further, the method also includes assigning a weight factor to the at least one candidate application based on the bandwidth ratio. In addition, the method includes determining the target perceptual quality score for the at least one media stream of the at least one candidate application based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications, the plurality of network parameters, the assigned weight factor to the at least one candidate application, and the bandwidth ratio.
In an embodiment of the disclosure, the plurality of the network parameters includes at least one of a network bandwidth, network congestion, a network quality of service (QoS) identifier, the XR device capability and capability of XR device, capability of at least one server from which the at least one stream is received, and latency requirements.
In an embodiment of the disclosure, the determining of the perceptual quality score for the at least one media stream received from each application of the plurality of applications includes determining, the at least one media parameter of at least one media stream received from each application of the plurality of applications, and the plurality of XR parameters. Further, determining the perceptual quality score for the at least one media stream received from each application of the plurality of applications based on the at least one media parameter and the plurality of XR parameters.
In an embodiment of the disclosure, the plurality of XR parameters includes a XR rendering space, and user preferences in the XR rendering space, and wherein the user preferences in the XR rendering space includes a position of the user in the XR rendering space, user's observation axis in the position of the user in the XR rendering space, user's observation range in the position of the user in the XR rendering space, and user's field of view or focus in the position of the user in the XR rendering space.
In an embodiment of the disclosure, the harmonizing of the at least one media parameter of the at least one media stream received from the at least one candidate application based on the target perceptual quality score includes determining whether a capability of the XR device meets a capability threshold, the XR device has service negotiation capability, and the XR device has strict latency requirement. Further, performing one of harmonizing the at least one media parameter of the at least one media stream received from the at least one candidate application using an enhance content harmonizer, when the capability of the XR device meets the capability threshold, the XR device has service negotiation capability, and the strict latency requirement of the XR device is not strict, harmonizing the at least one media parameter of the at least one media stream received from the at least one candidate application using an On-Device content harmonizer, when the capability of the XR device meets the capability threshold, the XR device does not has service negotiation capability, and the strict latency requirement of the XR device is not strict. In addition, harmonizing the at least one media parameter of the at least one media stream received from the at least one candidate application using a service level content harmonizer, when the capability of the XR device does not meets the capability threshold, the XR device does not has service negotiation capability, and the strict latency requirement of the XR device is strict.
In an embodiment of the disclosure, the harmonizing of the at least one media parameter of the at least one media stream received from the at least one candidate application using the enhance content harmonizer includes transmitting a stream renegotiation request to at least one server associated with the at least one candidate application for harmonization of the at least one media stream, wherein the stream renegotiation request the target perceptual quality score required for the at least one stream of the at least one candidate application. Further, receiving a stream renegotiation response from the at least one server associated with the at least one candidate application accepting the harmonization of the at least one media stream. Thereafter, upscaling or downscaling, the at least one media parameter of the at least one media stream received from the at least one candidate application based on the target perceptual quality score when the stream renegotiation response is received from the at least one server, wherein the at least one media parameter of the at least one media stream received from the at least one candidate application is harmonized to change the perceptual quality score of the at least one candidate application to the target perceptual quality score.
In an embodiment of the disclosure, the harmonizing of the at least one media parameter of the at least one media stream received from the at least one candidate application using the on-device content harmonizer includes upscaling or downscaling, by the XR device, the at least one media parameter of the at least one media stream received from the at least one candidate application based on the target perceptual quality score, wherein the at least one media parameter of the at least one media stream received from the at least one candidate application is harmonized to change the perceptual quality score of the at least one candidate application to the target perceptual quality score.
In an embodiment of the disclosure, the harmonizing, by the XR device, of the at least one media parameter of the at least one media stream received from the at least one candidate application using the service level content harmonizer includes transmitting a stream renegotiation request to at least one server associated with the at least one candidate application for harmonization of the at least one media stream, wherein the stream renegotiation request includes at least one media parameter which needs to be modified for the at least one stream of the at least one candidate application. Further, receiving a stream renegotiation response from the at least one server associated with the at least one candidate application accepting the modification of the at least one media parameter of the at least one media stream of the at least one candidate application. Thereafter, modifying, by the at least one server, the at least one media parameter of the at least one media stream received from the at least one candidate application based on the target perceptual quality score, wherein the at least one media parameter of the at least one media stream received from the at least one candidate application is harmonized to change the perceptual quality score of the at least one candidate application to the target perceptual quality score. Furthermore, sending the at least one upscaled or downscaled media stream associated with the at least one candidate application to the XR device. Finally, rendering the at least one upscaled or downscaled media stream associated with the at least one candidate application.
In accordance with another aspect of the disclosure, an XR device of harmonizing perceptual quality of multiple applications in an XR environment is provided. The XR device includes one or more processors, a content harmonizer, and memory storing one or more computer programs including computer-executable instructions that, when executed by the one or more processors, cause the XR device to initially receive at least one media stream received from each application of a plurality of applications available in the XR device, determine a perceptual quality score for at least one media stream received from each application of the plurality of applications, determine at least one candidate application with different media quality from the plurality of applications based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications, determine a target perceptual quality score for the at least one media stream of the at least one candidate application based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications and a plurality of network parameters, and harmonize at least one media parameter of the at least one media stream received from the at least one candidate application based on the target perceptual quality score.
In accordance with another aspect of the disclosure, one or more non-transitory computer-readable storage media storing computer-executable instructions that, when executed by one or more processors of an XR device, cause the XR device to perform operations are provided. The operations include receiving, by the XR device, at least one media stream received from each application of a plurality of applications available in the XR device, determining, by the XR device, a perceptual quality score for at least one media stream received from each application of the plurality of applications, determining, by the XR device, at least one candidate application with different media quality from the plurality of applications based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications, determining, by the XR device, a target perceptual quality score for the at least one media stream of the at least one candidate application based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications, capability of XR device and a plurality of network parameters, and harmonizing, by the XR device, at least one media parameter of the at least one media stream received from the at least one candidate application based on the target perceptual quality score.
Other aspects advantages, and salient features of the disclosure will become apparent to those skilled in the art from the following detailed description, which, taken in conjunction with the annexed drawings discloses various embodiments of the disclosure.
The above and other aspects, features, and advantages of certain embodiments of the disclosure will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
Throughout the drawings, like reference numerals will be understood to refer to like parts, components, and structures.
The following description with reference to the accompanying drawings is provided to assist in a comprehensive understanding of various embodiments of the disclosure as defined by the claims and their equivalents. It includes various specific details to assist in that understanding but these are to be regarded as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the various embodiments described herein can be made without departing from the scope and spirit of the disclosure. In addition, descriptions of well-known functions and constructions may be omitted for clarity and conciseness.
The terms and words used in the following description and claims are not limited to the bibliographical meanings, but, are merely used by the inventor to enable a clear and consistent understanding of the disclosure. Accordingly, it should be apparent to those skilled in the art that the following description of various embodiments of the disclosure is provided for illustration purpose only and not for the purpose of limiting the disclosure as defined by the appended claims and their equivalent.
It is to be understood that the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a component surface” includes reference to one or more of such surfaces.
As is traditional in the field, embodiments may be described and illustrated in terms of blocks which carry out a described function or functions. These blocks, which may be referred to herein as managers, units, modules, hardware components or the like, are physically implemented by analog and/or digital circuits, such as logic gates, integrated circuits, microprocessors, microcontrollers, memory circuits, passive electronic components, active electronic components, optical components, hardwired circuits and the like, and may optionally be driven by firmware and software. The circuits may, for example, be embodied in one or more semiconductor chips, or on substrate supports, such as printed circuit boards and the like. The circuits constituting a block may be implemented by dedicated hardware, or by a processor (e.g., one or more programmed microprocessors and associated circuitry), or by a combination of dedicated hardware to perform some functions of the block and a processor to perform other functions of the block. Each block of the embodiments may be physically separated into two or more interacting and discrete blocks without departing from the scope of the disclosure. Likewise, the blocks of the embodiments may be physically combined into more complex blocks without departing from the scope of the disclosure.
It should be appreciated that the blocks in each flowchart and combinations of the flowcharts may be performed by one or more computer programs which include computer-executable instructions. The entirety of the one or more computer programs may be stored in a single memory or the one or more computer programs may be divided with different portions stored in different multiple memories.
Any of the functions or operations described herein can be processed by one processor or a combination of processors. The one processor or the combination of processors is circuitry performing processing and includes circuitry like an application processor (AP, e.g., a central processing unit (CPU)), a communication processor (CP, e.g., a modem), a graphical processing unit (GPU), a neural processing unit (NPU) (e.g., an artificial intelligence (AI) chip), a wireless-fidelity (Wi-Fi) chip, a Bluetooth™ chip, a global positioning system (GPS) chip, a near field communication (NFC) chip, connectivity chips, a sensor controller, a touch controller, a finger-print sensor controller, a display drive integrated circuit (IC), an audio CODEC chip, a universal serial bus (USB) controller, a camera controller, an image processing IC, a microprocessor unit (MPU), a system on chip (SoC), an IC, or the like.
In existing system, multiple applications may be used simultaneously in a XR device. For example, referring to
Referring to
In the proposed method, the uneven perceptual quality of media streams received from multiple applications are harmonized. The harmonization is performed by determining a perceptual quality score for at least one media stream received from multiple applications used in the XR device. Further, a target perceptual score is determined based on the determined perceptual quality score of the media streams received from multiple applications and network parameters. Finally, media parameters of at least one media stream is harmonized based on the target perceptual quality score. Thus, the harmonization of the media streams of multiple applications used in the XR device provides an even perceptual quality of media streams of multiple applications. Hence providing a seamless and smooth experience to the users while using multiple applications in the XR device.
Referring to
Further, the processor 203 of the XR device 201 communicates with the memory 207, the I/O interface 205 and a content harmonizer 209. The processor 203 is configured to execute instructions stored in the memory 207 and to perform various processes. The processor 203 may include one or a plurality of processors, can be a general-purpose processor, such as a central processing unit (CPU), an application processor (AP), or the like, a graphics-only processing unit, such as a graphics processing unit (GPU), a visual processing unit (VPU), and/or an Artificial intelligence (AI) dedicated processor, such as a neural processing unit (NPU).
The memory 207 of the XR device 201 includes storage locations to be addressable through the processor 203. The memory 207 is not limited to volatile memory and/or non-volatile memory. Further, the memory 207 may include one or more computer-readable storage media. The memory 207 may include non-volatile storage elements. For example, non-volatile storage elements may include magnetic hard discs, optical discs, floppy discs, flash memories, or forms of electrically programmable read only memories (EPROMs) or electrically erasable and programmable ROMs (EEPROMs). The memory 207 may store the media streams, such as audios stream, video streams, haptic feedbacks and the like.
The I/O interface 205 transmits the information between the memory 207 and external peripheral devices. The peripheral devices are the input-output devices associated with the XR device 201. The I/O interface receives several information from at least one of sensors, cameras, actuators, displays, speakers and microphones associated with the XR device 201.
The content harmonizer 209 of the XR device 201 communicates with the processor 203, I/O interface 205 and memory 207 to harmonize the perceptual quality of the media streams of multiple applications running simultaneously in the XR device 201. Initially, the content harmonizer 209 receives at least one media stream from each applications application among plurality of application running simultaneously in the XR device 201. As per 3GPP spec 22.856, there is a provision in the XR device 201 for supporting multiple applications to run simultaneously and allow the user to view and interact with multiple applications concurrently. For example, the XR device 201 may run multiple applications, such as painting application, gaming applications, healthcare applications and the like. Each of the applications can be associated with multiple media streams where the media stream represents a stream of media content. For example, some of the media stream includes, but not limited to at least one of audio stream, video stream, haptics, and 3D content.
Upon receiving, the content harmonizer 209 determines a perceptual quality score for at least one media stream received from each application of the plurality of applications. The perceptual quality score is a score which represents the media quality perceived by the user while using multiple applications in the XR device 201. In some embodiments of the disclosure, the perceptual quality score is a score which denotes the extent of influence of media parameters or metrics on the perceptual quality of the media content. The content harmonizer 209 determines the perceptual quality score for each of the media stream associated with multiple applications running in the XR device 201. The content harmonizer 209 determines the perceptual quality score using at least one of techniques, such as universal video quality (UVQ), media opinion score network (MOSNet), and the like. More particularly, the UVQ can be used to determine the perceptual quality of the video and MOSNet may determine the perceptual quality score for audio. For example, consider, a painting application and a movie application is running in the XR device 201 simultaneously. The painting application comprises media streams, audio streams and the haptic streams. Similarly, the movie application comprises the audio stream and the video stream. Further, the content harmonizer 209 determines the perceptual quality scores of audio stream, video stream and haptics streams of the painting applications and also determines the perceptual quality score for each of the audio stream and video stream of the movie application.
Furthermore, the content harmonizer 209 determines at least one candidate application with different media quality from plurality of applications based on the determined perceptual quality score of media streams received from each of the application. The content harmonizer 209 determines the candidate application among the plurality of applications running in the XR device 201 based on the determined perceptual quality score. The candidate application is the application which is having different perceptual quality score of the media stream with respect to the perceptual quality score of the media stream associated with other applications. For example, consider the multiple applications, such as video call application, movie applications, and painting application running in the XR device. Further, the perceptual quality score of the audio stream received from movie application and painting application is having similar value. However, the perceptual quality score of the video stream received from the movie application is different from that of the video stream received from the painting application. The movie application may have a less perceptual score for the video stream when compared to that of the perceptual quality score of video stream associated with the painting application. Thereafter, the content harmonizer 209 decides the candidate applications being the movie application, since the perceptual quality scores for the video streams is lesser and requires harmonization with painting application.
Thereafter, the content harmonizer 209 determines a target perceptual quality score for the at least one media stream of the at least one candidate application based on the determined perceptual quality score of the media streams and the plurality of network parameters. The target perceptual quality score is a score which represents the media quality that is required by the candidate application to harmonize with the other application. The target perceptual quality score is determined based on the perceptual quality score of the media streams of the candidate application and the network parameters associated with the candidate application. The network parameters may include at least one of network bandwidth, network congestion, a network quality of service (QoS) identifier, the XR device capability, latency requirement and the capability of at least one server from which the candidate application is receiving the media streams.
Finally, the content harmonizer 209 harmonizes the at least one media parameter of media stream received from the candidate application based on the target perceptual quality score. The media parameters include, but not limited to, a bit rate, a codec, a brightness, a frame complexity, a sample rate, a channel count, a pitch and a frame rate. Thus, the content harmonizer 209 harmonizes the at least one of media parameter of media stream received from candidate application with the media parameter of the media stream received from the other applications running simultaneously in the XR device 201. Hence, the content harmonizer 209 provides an even perceptual quality of media streams of multiple applications which results in a seamless and smooth experience to the users while using multiple applications in the XR device 201.
Referring to
Initially, the score generator module 211 of the content harmonizer 209 is configured to generate the perceptual quality score for each of the media stream received from multiple applications running in the XR device 201. The perceptual quality score is a score which denotes the extent of influence of media parameters or metrics on the perceptual quality of the media content. In addition, the perceptual quality score is a subjective score generated by media content and parameter-based assessment which can be used for comparison across applications. The score generator module 211 assesses and generates a perceptual quality score for each media stream received from plurality of applications running simultaneously in the XR device 201. The score generator module 211 initially extracts media parameters of each of the media stream received from plurality of applications running simultaneously in the XR device 201. The media parameters can be extracted using the media feature extractor. Further, the score generator module 211 generates a calibration table based on the extracted media parameters. The calibration table represents the information indicating the impact of individual media parameters on the quality of the media streams. The calibration table for the media stream can be determined using an AI model to determine the impact of media parameters for varying the quality of content. For example, the calibration table is <P1-5, P2-6, P3-9, P4-2>, where P1, P2, P3 and P4 indicates the media parameters and the numerical value represents the amount of impact of the metric on the media stream. It is observed that for the above calibration table the parameter 3 (P3) has the highest impact on the quality of the media stream and P4 has the lowest impact on the quality of the media stream. Hence, it can be deduced that the change in the P3 will lead to the change in the quality of the media content. Further, the score generator module 211 generates a perceptual quality score for the media streams received from the plurality of applications. The score generator module 211 determines a subjective perceptual quality score vector based on the determined calibration table and the media streams received from the plurality of applications. The perceptual quality scores for the media streams can be generated using at least one of an universal video quality (UVQ) method, mean opinion score (MOSNet). For example, the score generator module 211 generates the perceptual quality score for each of the media stream, such as video stream, audio stream, haptics streams, and 3D objects streams based on the calibration table for each of the media streams and the media streams received from plurality of applications. In addition, the perceptual quality score may have the values ranging between 0-9, where 0 indicates the lowest impact or lowest quality and 9 indicates the highest quality or highest impact. In addition, the score generator module 211 determines the candidate application from the plurality of applications for which the perceptual quality score of media streams is different from each other.
The score evaluator module 213 of the content harmonizer 209 determines a target perceptual quality score based on the perceptual quality score of media streams received from candidate applications, the XR device 201 capability and the network information. The network information includes, but not limited to network bandwidth, network congestion, a network quality of service QoS identifier, capability of at least one server from which the at least one stream is received, and latency requirements. The target perceptual quality score is a weighted average of all the application scores, where the target perceptual quality score is determined using the below Equation 1:
The weight factor is determined based on a bandwidth ratio, where the bandwidth ratio is a ratio of the bandwidth available to the bandwidth used. Further, if the bandwidth ratio is higher than a threshold, then the weight factor is directly proportional to the perceptual quality score. Similarly, if the bandwidth ratio is lower than a threshold, then the weight factor is inversely proportional to the quality score.
Upon determination of the target perceptual quality score, the score harmonizer module 215 harmonizes the media parameters of media streams received from candidate applications based on the target perceptual quality score. For example, consider the Video stream perceptual quality score of application 1 is V-4 and the video stream perceptual quality score of application 2 is V-8 and the target perceptual score is determined to be 7. The content modifier module 215-2 of the score harmonizer module 215 up samples the video stream perceptual quality score of application 1 to V-7 and makes no changes to the video stream perceptual quality score of application 2. Thus, ensuring even perceptual quality between application 1 and application 2. In some embodiments of the disclosure, the content negotiator module 215-1 of the score harmonizer module 215 renegotiates with server of the application 1 to render the video stream as per the determined target perceptual quality score.
Referring to
Referring to
Initially at operation S-01, the content of the application-1 running in the XR device 201 is received from an application server-1 401. The media content received from application server-1 401 comprises audio content and video content, where the audio content is received at XR device 201 from application server-1 401 is having codec value as AMR-WB 16000 and the video content received at the XR device 201 from the application server-1 401 is having a resolution of H264, QVGA.
At operation S-02 the content of application-2 running in the XR device 201 is received from an application server-2 405. The media content received from application server-2 also comprises a video content and an audio content. The video content from the application server-2 405 received at the XR device 201 is having a resolution value of H265, 720p. In addition, the audio content from the application server-2 405 received at the XR device 201 is having a codec value of AMR/8000.
At operation S-03, a content harmonization of the media contents received from application server-1 401 and application server-2 405 is invoked, when the perceptual quality score of the media content received from application server-1 401 is different from that of the perceptual quality score of the media content received from the application server-2 405. Further, a target perceptual quality score is determined based on the perceptual quality scores of the media content received from application server-1 401 and application server-2 405 and network parameters of the communication network between the XR device 201, application server-1 401 and application server-2 405.
At operation S-04, the XR device 201 transmits a stream configuration request to application server-1 401 for rendering the video content with a resolution H265/VGA (target perceptual quality score).
In addition, at operation S-05, the XR device 201 transmits a stream configuration request to application server-2 405 for rendering the audio content with a codec value AMR-WB/16000 (target perceptual quality score).
Further at operation S-07, the application server-1 401 accepts the stream reconfiguration request and renders the requested video content with a resolution H265/VGA to the XR device 201.
Similarly, at operation S-08, the application server-2 405 accepts the stream reconfiguration request and renders the requested audio content with a codec AMR-WB/16000 to the XR device 201.
At operation S-09, the XR device 201 reconfigures the quality and QoS parameters based on the rendered video content and audio content received from application server-1 401 and application server-2 405 respectively.
At operation S-10, the XR device 201 is able to view both the application 1 and application 2 with similar perceptual quality.
At operation S-11, the user device 201 invokes the content harmonization when a new application starts running in the XR device 201.
Referring to
In an embodiment of the disclosure, at operation S-501, the content harmonizer 209 detects if there are multiple applications running in the XR device 201. The XR device 201, continues to render the application content from an application server without harmonization when only one application is running in the XR device 201.
Upon detecting that multiple applications are running in the XR device 201, the content harmonizer 209 at operation S-502 further determines if the media contents of the multiple applications are comparable in terms of service criticality, latency requirements and the like. When the media contents of the multiple applications are determined to be comparable, then at operation S-503, the content harmonizer 209 determines if some media stream is accessible for processing. And, when the media stream is determined to be not accessible then media content continues to render the application content from an application server without harmonization.
However, when the media stream is determined to be available and accessible for processing, then at operation S-505 the content harmonizer 209, determines whether the XR device 201 has good on-device processing capability.
When the XR device 201 is determined to have a good on-device processing, at operation S-507, the content harmonizer 209 further determines whether a service renegotiation is possible. When the service re-negotiation is determined to be possible, at operation S-509 the content harmonizer 209 further determines whether the latency requirement is strict or not.
Furthermore, when the latency requirement is determined to be not strict, then at operation S-511 the content harmonizer 209 performs a combination of on-device content harmonization and service level content harmonization.
In some embodiments of the disclosure, when the service re-negotiation is determined to be not possible at operation S-507 and if the latency requirement is determined to be strict at operation S-509, then the content harmonizer 209 does not perform the content harmonization and simply renders the media content received from the application servers.
Furthermore, when the service re-negotiation is determined to be not possible at operation S-507 and when the latency requirement is determined to be not strict at operation S-509, then at operation S-514 the content harmonizer 209 performs an on-device content harmonization.
In addition, when the XR device 201 is determined to not have a good on-device processing capability at operation S-505, and service re-negotiation is possible at operation S-507, then at operation S-513, the content harmonizer 209 performs a service level content harmonization.
Finally, at operation S-515, the harmonized renderer 313 will render the requested media content for the applications running in the XR device 201.
Hence, the content harmonizer 209 applies different content harmonization on the media content based on different policy, device capability and the server capability. The different policies may include but not limited to a latency policy for the application and criticality policy for the application.
Referring to
Further, the XR device 201 receives inputs from one or more devices associated with the XR-device 201. The one or more devices associated with the includes, but not limited to sensors 605A, cameras 605B, actuators, 605C, displays 605D, speakers 605E and microphones 605F. Further, the inputs received from one or more devices associated with the XR device 201 can be pre-processed in a XR runtime 607. In some embodiments of the disclosure, the XR runtime 607 receives user inputs from multiple applications through the XR source management 609. The XR source management 609 serializes multiple user inputs 601—before transmitting to the XR runtime 607 for further processing.
The XR-runtime 607 provide data parallelly to all registered applications, e.g., if multiple application registers to get position tracking information, then XR Runtime may give that information parallelly to all registered applications.
For example, sensor data captured by sensor 605A, image data captured by cameras 605B are processed using some of the runtime functions 607A, such as tracking techniques, simultaneous localization and mapping (SLAM) techniques and the like. In some embodiments of the disclosure, the XR runtime functions 607A may receive the user input 601 from the application-1 600A for further processing through an interface IF-1a associated with API-1. Similarly, the data displayed on the displays 605D can be processed or modified using some of the composition techniques 607B and audio content received from speakers 605E and microphones 605F can be processed though audio subsystems 607C using audio processing techniques.
Further the XR runtime 607 communicates with XR source management 609 through an interface IF-1b. The XR source management 609 serializes requests from multiple applications 600A, 600B, 600N before forwarding it to XR Runtime 607. It also parallelizes multiple application data which will be sent in the uplink. In addition, the XR runtime 607 communicates with the XR-content harmonizer 611 through an interface IF-1d associated with API-1 and presentation engine 613 through an interface 1F-1c associated with API-1.
The content harmonizer 209 performs a service level content harmonization for the media streams received from multiple applications 600A, 600B, . . . 600N received through interface IF-11 associated with the application programming interface (API-11), data received from the XR runtime 607. The content harmonizer 209 comprises three sub-modules namely a score generator module 211, the score evaluator module 213 and a content negotiator module 215A. The score generator module 211 extracts at least one media parameter of each of the media stream received from multiple applications 600A, 600B, . . . 600N.
Referring to
Referring to
Referring to
For example, consider the perceptual quality score determined for the application 1 is V-4//RE-9/BR-6/CD-4/, and the perceptual quality score of the application 2 is V-8//RE-9/BR-4/CD-6/. Further, consider the bandwidth ratio is having a value 1.7. The bandwidth ratio is ratio between the available bandwidth and the bandwidth used. For the bandwidth ratio being 1.7, consider the corresponding weight factor is 3 for application 1 and 7 for application 2.
In some embodiments of the disclosure, consider the perceptual quality score determined for the application 1 is V-4//RE-9/BR-6/CD-4/, and the perceptual quality score of the application 2 is V-8//RE-9/BR-4/CD-6/. Further, consider the bandwidth ratio is having a value 1.1. The bandwidth ratio is ratio between the available bandwidth and the bandwidth used. For the bandwidth ratio being 1.7, consider the corresponding weight factor is 7 for application 1 and 3 for application 2.
Upon the generation of the target perceptual quality score, the content negotiator module 215A renegotiates with a server for modifying the media parameters of the media streams based on the target perceptual quality score. For example, consider for the target perceptual quality score having a value of 7, the content negotiator module 215A renegotiates with the server of the application 1 for renegotiating the resolution and bitrate value such that the V-score is modified from 4 to 7. In addition, the content negotiator module 215A renegotiates with the server of the application 2 for renegotiating the bitrate to a lower value to free up some bandwidth such that the V-score is reduced from 8 to 7.
Similarly, consider the target perceptual score having a value 5, the content negotiator module 215A renegotiates with server of the application 1 for renegotiating the bitrate value to modify the V-score from 4 to 5. In addition, the content negotiator module 215A renegotiates with the server of the application 2 for renegotiating the resolution and bitrate to a lower value to free up some bandwidth to reduce the V-score from 8 to 5.
Upon the content harmonization, the content harmonizer 209 communicates with a media session handler 615 through an interface IF-6 associated with the application programming interface (API-6). The media session handler 615 offers tools for content harmonizer 209 to negotiate media parameters for providing similar quality in multiple applications running to the users. The media session handler 615 further communicates with fifth-generation (5G) to establish, control and support the delivery of a media session through an interface IF-5. Furthermore, the content harmonizer 209 communicates with a scene manager 617 through an interface IF-12 associated with API-12. The scene manager 617 renders multiple media streams which in the user's observation range. In addition, the scene manager 617 performs the occlusion control. The scene manager 617 helps multiple applications in arranging the logical and spatial representation of a multisensory scene based on support from the XR Runtime 607 and XR Content manager 611. The IF-12 is realized through an API (API-12). In addition, the content harmonizer 209 communicates with a media access function 619 through an interface IF-7 associated with API-7. The IF-7 helps XR Content manager/content harmonizer 209 to get media content of multiple applications from media access function 619. Finally, the media access function 619 enables access to media streams to be communicated through the 5G system over an interface IF-4.
Referring to
Further, the XR device 201 receives inputs from one or more devices associated with the XR-device 201. The one or more devices associated with the includes, but not limited to sensors 705A, cameras 705B, actuators, 705C, displays 705D, speakers 705E and microphones 705F. Further, the inputs received from one or more devices associated with the XR device 201 can be pre-processed in a XR runtime 707. In addition, the XR runtime 707 provides data parallelly to all multiple applications 700A, 700B, . . . 700N. For example, if multiple applications are registered to get position tracking information, then the XR runtime 707 provides parallelly the position tracking information to all registered applications 700A, 700B . . . 700N. In some embodiments of the disclosure, the XR runtime 707 communicates with the XR source management 709 through an interface IF-1b. The XR source management 709 serializes multiple user inputs 701 before transmitting to the XR runtime 707 for further processing.
Further, in the XR runtime 707, data received from one or more devices associated with the XR device 201 can be pre-processed. For example, sensor data captured by sensor 705A, image data captured by cameras 705B are processed using some of the runtime functions 707A, such as tracking techniques, SLAM techniques, and the like. In some embodiments of the disclosure, the XR runtime functions 707A may receive the user input 701A from the application-1 700A for further processing. Similarly, the data displayed on the displays 705D can be processed or modified using some of the composition techniques 707B and audio content received from speakers 705E and microphones 705F can be processed though audio subsystems 707C using audio processing techniques.
In some embodiments of the disclosure, the XR runtime 707 communicates with XR source management 709 through an interface IF-1b. The XR-source management 709 serializes the request from multiple applications 700A, 700B, . . . 700N before forwarding to the XR runtime 707. Further, the XR source management 709 parallelizes multiple data received from multiple applications 700A, 700B, . . . 700N that needs to be transmitted in the uplink.
Upon the content harmonization, the content harmonizer 209 communicates with a media session handler 715 through an interface IF-6 associated with the application programming interface (API-6). Further, the XR runtime 707 communicates with a content harmonizer 209 through an interface IF-1d. The content harmonizer 209 accesses the media content from XR runtime 707 for multiple applications 700A, 700B, . . . 700N. In addition, the XR runtime 707 communicates with the XR-content harmonizer 209 through an interface IF-1d associated with API-1 and presentation engine 713 through an interface 1F-1c associated with API-1.
The content harmonizer 209 performs an on-device content harmonization for the media streams received from multiple applications 700A, 700B, . . . 700N received through interface IF-11 associated with the application programming interface (API-11), and data received from the XR runtime 707. The content harmonizer 209 comprises three sub-modules namely a score generator module 211, the score evaluator module 213 and a content modifier module 215B. The score generator module 211 extracts at least one media parameter of each of the media stream received from multiple applications 700A, 700B, . . . 700N.
Referring to
For example, the media streams received from multiple applications 700A, 700B, . . . 700N may include video stream, audio stream, haptic stream, and 3D object stream. Further, a video feature extractor 721A extracts video parameters of the video streams received from multiple applications 700A, 700B, . . . 700N. The video parameters extracted by the video feature extractor 721-1 may include, but not limited to resolution, frame-rate, bit-rate, codec, luminance, and frame-complexity. The frame-complexity can be quantified using measures like Shannon Entropy. For example, a video frame with plain background and very few objects will be less complex, whereas a frame with lots of texture and objects will be more complex.
Similarly, an audio feature extractor 721B extracts audio parameters of the audio streams received from multiple applications 700A, 700B, . . . 700N. The audio parameters may include, but not limited to sample rate, pitch, codec, bitrate and channel count. Furthermore, haptics feature extractor 721C extracts haptic parameters of the haptics streams received from multiple applications 700A, 700B, . . . 700N. The haptics parameters may include, but not limited to sample rate, codec and bitrate. In addition, a 3D object feature extractor extracts the 3D object parameters of the 3D object streams received from multiple applications 700A, 700B, . . . 700N. The 3D object parameters may include, but not limited to codec and bitrate.
Upon, the feature extraction, feature calibrator 723 of the score generator module 211 generates a calibration table based on the media parameters extracted from the feature extractor 721. The V-feature calibrator 723A generates the calibration table for each of the video parameters of the video stream received from the video feature extractor 721A. The A-feature calibrator 723B generates the calibration table for each of the audio parameters of the audio streams received from the audio feature extractor 721B. Further, the H-feature calibrator 723C generates the calibration table for each of the haptics parameters received from the haptics feature extractor 721C. An O-feature calibrator 723D generates a calibration table for each of the 3D object parameters received from the 3D object feature extractor 721D. Upon the generation of calibration table, the score assessor 725 of the score generator module 211 determines a perceptual quality score for each of the media stream received from multiple applications 700A, 700B, . . . 700N based on the calibration table and the user parameters in XR space and rendering space. A V-score assessor 725A determines the perceptual quality score 727A for each of the video stream received from multiple applications based on the calibration table generated by the V-feature calibrator 723A and the user parameters in XR space and rendering space. Similarly, A-score assessor 725B determines the perceptual quality score 727B for each of the audio stream received from multiple applications based on the calibration table generated by the A-feature calibrator 723B and the user parameters in XR space and rendering space. Moreover, H-score assessor 725C determines the perceptual quality score 727C for each of the haptics stream received from multiple applications based on the calibration table generated by the H-feature calibrator 723C and the user parameters in XR space and rendering space. In addition, O-score assessor 725D generates a perceptual quality score 727D for each of the 3D object stream received from multiple applications based on the calibration table generated by the O-feature calibrator 723D and the user parameters in XR space and rendering space.
For example, referring to
For example, referring to
For example, referring to
For example, referring to
Upon the generation of the perceptual quality score for each of the media stream received from multiple applications 700A, 700B, . . . 700N, the content harmonizer 209 determines candidate applications with different media quality from the plurality of application 700A, 700B, . . . 700N based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications 700A, 700B, . . . 700N. Further, the content harmonizer 209 determines a target perceptual quality score for at least one media stream based on the determined perceptual quality score of media streams for candidate applications. Finally, the content modifier module 215A of the content harmonizer 209 modifies the media parameter values based on the determined target perceptual score for at least one media stream of the candidate applications and the application information. For example, the application information may include but not limited to a Codecs, and resolutions.
Referring to
For example, consider the perceptual quality score determined for the application 1 is V-4//RE-9/BR-6/CD-4/, and the perceptual quality score of the application 2 is V-8//RE-9/BR-4/CD-6/. Further, consider the bandwidth ratio is having a value 1.7. The bandwidth ratio is ratio between the available bandwidth and the bandwidth used. For the bandwidth ratio being 1.7, consider the corresponding weight factor is 3 for application 1 and 7 for application 2.
Upon the generation of the target perceptual quality score, the content modifier module 215B up-samples the video resolution of the application 1 by factor of 2 in order to increase the perceptual quality score of the video stream from 4 to 7. However, content modifier module 215B does make any further modification in the media parameters of the media streams received from application 2. In addition, the content modifier module 215B indicates the harmonized renderer 313 regarding the up-sampling of the video parameters of the application 1.
In some embodiments of the disclosure, consider the perceptual quality score determined for the application 1 is V-4//RE-9/BR-6/CD-4/, and the perceptual quality score of the application 2 is V-8//RE-9/BR-4/CD-6/. Further, consider the bandwidth ratio is having a value 1.1. The bandwidth ratio is ratio between the available bandwidth and the bandwidth used. For the bandwidth ratio being 1.7, consider the corresponding weight factor is 7 for application 1 and 3 for application 2.
Upon the generation of the target perceptual quality score, the content modifier module 215B down-samples the video resolution of the application 2 by factor of 2 in order to decrease the perceptual quality score of the video stream from 8 to 5. However, content modifier module 215B does make any further modification in the media parameters of the media streams received from application 1. In addition, the content modifier module 215B indicates the harmonized renderer 313 regarding the down-sampling of the video parameters of the application 2.
Upon the content harmonization, the content harmonizer 209 communicates with a scene manager 717 through an interface IF-12 associated with API-12. The scene manager 717 renders multiple media streams which in the user's observation range. In addition, the scene manager 717 performs the occlusion control. The scene manager 717 helps multiple applications in arranging the logical and spatial representation of a multisensory scene based on support from the XR runtime 707 and XR content manager/content harmonizer 209. In addition, the content harmonizer 209 communicates with a media access function 719 through an interface IF-7 associated with API-7. The IF-7 helps XR content manager/content harmonizer 209 to get media content of multiple applications from media access function 719. Finally, the media access function 719 enables access to media streams to be communicated through the 5G system over an interface IF-4.
Referring to
Further, the XR device 201 receives inputs from one or more devices associated with the XR-device 201. The one or more devices associated with the includes, but not limited to sensors 805A, cameras, 805B, actuators, 805C, displays 805D, speakers 805E and microphones 805F. Further, the inputs received from one or more devices associated with the XR device 201 can be pre-processed in a XR runtime 807. In addition, the XR runtime 807 provides data parallelly to all multiple applications 800A, 800B, . . . 800N. For example, if multiple applications are registered to get position tracking information, then the XR runtime 807 provides parallelly the position tracking information to all registered applications 800A, 800B . . . 800-N−1. In some embodiments of the disclosure, the XR runtime 807 communicates with the XR source management 809 through an interface IF-1b. The XR source management 809 serializes multiple user inputs 801 before transmitting to the XR runtime 807 for further processing. In addition, the XR runtime 807 communicates with the XR-content harmonizer 209 through an interface IF-1d associated with API-1 and presentation engine 813 through an interface 1F-1c associated with API-1.
Further, in the XR runtime 807, data received from one or more devices associated with the XR device 201 can be pre-processed. For example, sensor data captured by sensor 805A, image data captured by cameras 805B are processed using some of the runtime functions 807A, such as tracking techniques, SLAM techniques and the like. In some embodiments of the disclosure, the XR runtime functions 807A may receive the user input 801A from the application-1 800A for further processing. Similarly, the data displayed on the displays 805D can be processed or modified using some of the composition techniques 807B and audio content received from speakers 805E and microphones 805-F can be processed though audio subsystems 807C using audio processing techniques.
In some embodiments of the disclosure, the XR runtime 807 communicates with XR source management 809 through an interface IF-1b. The XR-source management 809 serializes the request from multiple applications 800-1, 800-2, . . . 800-n before forwarding to the XR runtime 807. Further, the XR source management 809 parallelizes multiple data received from multiple applications 800A, 800B, . . . 800N that needs to be transmitted in the uplink.
Further, the XR runtime 807 communicates with a XR content harmonizer 811 through an interface IF-1d. The XR content harmonizer 811 accesses the media content from XR runtime 807 for multiple applications 800A, 800B, . . . 800N.
The content harmonizer 811 performs a combination of both on-device and service level content harmonization for the media streams received from multiple applications 800A, 800B, . . . 800N received through interface IF-11 associated with the application programming interface (API-11), and data received from the XR runtime 807. The content harmonizer 811 comprises four sub-modules namely a score generator module 211, the score evaluator module 213, a content negotiator module 215A and a content modifier module 215B. The score generator module 211 extracts at least one media parameter of each of the media stream received from multiple applications 800A, 800B, . . . 800N.
Referring to
For example, the media streams received from multiple applications 800A, 800B, . . . 800N may include video stream, audio stream, haptic stream, and 3D object stream. Further, a video feature extractor 821A extracts video parameters of the video streams received from multiple applications 800A, 800B, . . . 800N. The video parameters extracted by the video feature extractor 821A may include, but not limited to resolution, frame-rate, bit-rate, codec, luminance, and frame-complexity. The frame-complexity can be quantified using measures like Shannon Entropy. For example, a video frame with plain background and very few objects will be less complex, whereas a frame with lots of texture and objects will be more complex. Similarly, an audio feature extractor 821B extracts audio parameters of the audio streams received from multiple applications 800A, 800B, . . . 800N. The audio parameters may include, but not limited to sample rate, pitch, codec, bitrate and channel count. Furthermore, haptics feature extractor 821C extracts haptic parameters of the haptics streams received from multiple applications 800A, 800B, . . . 800N. The haptics parameters may include, but not limited to sample rate, codec and bitrate. In addition, a 3D object feature extractor 821D extracts the 3D object parameters of the 3D object streams received from multiple applications 800A, 800B, . . . 800N. The 3D object parameters may include, but not limited to codec and bitrate.
Upon, the feature extraction, feature calibrator 823 of the score generator module 211 generates a calibration table based on the media parameters extracted from the feature extractor 821. The V-feature calibrator 723A generates the calibration table for each of the video parameters of the video stream received from the video feature extractor 721A. The A-feature calibrator 823A generates the calibration table for each of the audio parameters of the audio streams received from the audio feature extractor 821B. Further, the H-feature calibrator 823C generates the calibration table for each of the haptics parameters received from the haptics feature extractor 821C. An O-feature calibrator 823D generates a calibration table for each of the 3D object parameters received from the 3D object feature extractor 821D. Upon the generation of calibration table, the score assessor 825 of the score generator module 211 determines a perceptual quality score for each of the media stream received from multiple applications 800A, 800B, . . . 800N based on the calibration table and the user parameters in XR space and rendering space. A V-score generator 825A determines the perceptual quality score 827A for each of the video stream received from multiple applications based on the calibration table generated by the V-feature calibrator 823A and the user parameters in XR space and rendering space. Similarly, A-score generator 825B determines the perceptual quality score 827B for each of the audio stream received from multiple applications based on the calibration table generated by the A-feature calibrator 823B and the user parameters in XR space and rendering space. Moreover, H-score generator 825C determines the perceptual quality score 827C for each of the haptics stream received from multiple applications based on the calibration table generated by the H-feature calibrator 823C and the user parameters in XR space and rendering space. In addition, O-score generator 825D generates a perceptual quality score 827D for each of the 3D object stream received from multiple applications based on the calibration table generated by the O-feature calibrator 823D and the user parameters in XR space and rendering space.
For example, referring to
For example, referring to
For example, referring to
For example, referring to
Upon the generation of the perceptual quality score for each of the media stream received from plurality of applications 800A, 800B, . . . 800N, the content harmonizer 811 determines candidate applications with different media quality from the plurality of application 800A, 800B, . . . 800N based on the perceptual quality score for the at least one media stream received from each application of the plurality of applications 800A, 800B, . . . 800N. Further, the content harmonizer 811 determines a target perceptual quality score for at least one media stream based on the determined perceptual quality score of media streams for candidate applications and network parameters. The network parameters may include, but not limited to network bandwidth, network congestion, a network quality of service (QoS) identifier, the XR device capability and capability of XR device, capability of at least one server from which the at least one stream is received, and latency requirements. Finally, the content negotiator module 215-A and content modifier module 215-B of the content harmonizer 811 performs at least one of renegotiates with a server for modifying the media parameters of the media streams based on the target perceptual quality score and also upscales or downscales the media parameter values based on the determined target perceptual score for at least one media stream of the candidate applications, network parameters and the application information. The renegotiation or upscaling/downscaling of the media parameters is performed by determining the difference between the perceptual quality score, and the target perceptual quality score and the calibration table. If the difference is high, then the multiple media parameters having a high value in the calibration table is modified. Similarly, if the difference is low, then the multiple parameters having a low value in the calibration table is modified.
Referring to
For example, consider the perceptual quality score determined for the application 1 is V-4//RE-9/BR-6/CD-4/, and the perceptual quality score of the application 2 is V-8//RE-9/BR-4/CD-6/. Further, consider the bandwidth ratio is having a value 1.7. The bandwidth ratio is ratio between the available bandwidth and the bandwidth used. For the bandwidth ratio being 1.7, consider the corresponding weight factor is 3 for application 1 and 7 for application 2.
Upon the generation of the target perceptual quality score, the content modifier module 215B up-samples the video resolution of the application 1 by factor of 2 and the content negotiator module 215A re-negotiates for an increase in bitrate with the server associated with application 2. The modification and re-negotiation is performed in order to increase the perceptual quality score of the video stream from 4 to 7.
In some embodiments of the disclosure, consider the perceptual quality score determined for the application 1 is V-4//RE-9/BR-6/CD-4/, and the perceptual quality score of the application 2 is V-8//RE-9/BR-4/CD-6/. Further, consider the bandwidth ratio is having a value 1.1. The bandwidth ratio is ratio between the available bandwidth and the bandwidth used. For the bandwidth ratio being 1.7, consider the corresponding weight factor is 7 for application 1 and 3 for application 2.
Upon the generation of the target perceptual quality score, the content modifier module 215B down-samples the video resolution of the application 2 by factor of 2 and the content negotiator module 215A re-negotiates with server associated with the application 2 to lower the bitrate value of the application 2 in order to decrease the perceptual quality score of the video stream from 8 to 5.
Upon the content harmonization, the content harmonizer 209 communicates with a scene manager 817 through an interface IF-12 associated with API-12. The scene manager 817 helps multiple applications in arranging the logical and spatial representation of a multisensory scene based on support from the XR Runtime 807 and content harmonizer 209. In addition, the content harmonizer 209 communicates with a media access function 819 through an interface IF-7 associated with API-7. The IF-7 helps XR Content manager/content harmonizer 209 to get media content of multiple applications from media access function 819. In addition, the content harmonizer 209 communicates with a media session handler 815 through an interface IF-6 associated with API-6. The media session handler 815 offers tools for content harmonizer 209 to negotiate media parameters with the server associated with the plurality of applications. In some embodiments of the disclosure, the media session handler 815 activates 5G media functionality, such as network assistance, edge resource discovery, edge computation offload, and the like. In addition, the media access function 819 enables access to media streams to be communicated through the 5G system over an interface IF-4.
Referring to
Referring to
Referring to
Referring to
At operation 1003, the content harmonizer 209 of the XR device 201, determines perceptual quality score for at least one media stream received each application of plurality of applications. The perceptual quality score for at least one media stream is determined based on the calibration table and the network parameters. The calibration table indicates the impact of each media parameter of at least one media stream received from plurality of applications. The network parameters include, but not limited to a network bandwidth, network congestion, a network quality of service (QoS) identifier, capability of XR device, capability of at least one server from which the at least one stream is received, and latency requirements.
At operation 1005, the content harmonizer 209 of the XR device 201, determines at least one candidate application with different quality from plurality of applications based on perceptual quality score for at least one media stream received from each application of plurality of applications. The candidate applications are the applications for which the perceptual quality scores are different and require the harmonization of the perceptual quality of media streams received from plurality of applications.
At operation 1007, the content harmonizer 209 of the XR device 201, determines a target perceptual quality score for at least one media stream of the at least one candidate application based on the determined perceptual quality score for the candidate applications and plurality of network parameters.
At operation 1009, the content harmonizer 209 of the XR device 201, harmonizes at least one media parameter of the at least one media stream received from the at least one candidate application based on the target perceptual quality score. The content harmonizer 209 harmonizes the media parameter of the at least one media stream by at least one of modifying the media parameter value based on target perceptual quality score and re-negotiating with the server associated with the candidate application to stream the candidate application with a target perceptual quality score.
It will be appreciated that various embodiments of the disclosure according to the claims and description in the specification can be realized in the form of hardware, software or a combination of hardware and software.
Any such software may be stored in a non-transitory computer readable storage medium. The non-transitory computer readable storage medium stores one or more computer programs (software modules), the one or more computer programs comprising instructions, which when executed by one or more processors in an electronic device, cause the electronic device to perform a method of the disclosure.
Any such software may be stored in the form of volatile or non-volatile storage such as, for example, a storage device like read only memory (ROM), whether erasable or rewritable or not, or in the form of memory such as, for example, random access memory (RAM), memory chips, device or integrated circuits or on an optically or magnetically readable medium such as, for example, a compact disk (CD), digital versatile disc (DVD), magnetic disk or magnetic tape or the like. It will be appreciated that the storage devices and storage media are various embodiments of non-transitory machine-readable storage that are suitable for storing a computer program or computer programs comprising instructions that, when executed, implement various embodiments of the disclosure. Accordingly, various embodiments provide a program comprising code for implementing apparatus or a method as claimed in any one of the claims of this specification and a non-transitory machine-readable storage storing such a program.
While the disclosure has been shown and described with reference to various embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the disclosure as defined by the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
202341033717 | May 2023 | IN | national |
2023 41033717 | Oct 2023 | IN | national |
This application is a continuation application, claiming priority under § 365(c), of an International application No. PCT/KR2024/004751, filed on Apr. 9, 2024, which is based on and claim the benefit of an Indian provisional patent application number 202341033717, filed on May 12, 2023, in the Indian Intellectual Property Office, and of an Indian Non-Provisional patent application number 202341033717, filed on Oct. 19, 2023, in the Indian Intellectual Property Office, the disclosure of each of which is incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/KR2024/004751 | Apr 2024 | WO |
Child | 18638057 | US |