This application claims the priority benefit of Taiwan application serial no. 110127878, filed on Jul. 29, 2021. The entirety of the above-mentioned patent application is hereby incorporated by reference herein and made a part of this specification.
The disclosure relates to a video system, and more particularly to an augmented reality (AR) system and an operation method thereof.
Various audio-visual streaming services have gained increasing popularity. Common audio-visual streaming services include video conferencing. In a video conference, a user A may show something to a user B far away through a communication network. For example, a mobile phone held by the user A is displaying an interesting digital content (a picture or a three-dimensional digital object), and the user A may want to show this digital content to the user B far away through the video conference. Therefore, the user A uses a video conferencing device to take a picture of this mobile phone. However, due to various environmental factors (such as resolution, color shift, or the like), the user B may not be able to see the content displayed by the mobile phone of the user A clearly.
The disclosure provides an augmented reality (AR) system and an operation method thereof for an AR application.
In an embodiment of the disclosure, the AR system includes a target device, an AR server, and an AR device. The target device is configured to display the marker. The AR server is configured to provide a digital content corresponding to the marker. The AR device is configured to capture the target device and the marker to generate a picture. The AR device obtains the digital content from the AR server through a communication network. The AR device tracks the target device in the picture according to the marker for an AR application. In the AR application, the AR device overlays the digital content on the target device in the picture.
In an embodiment of the disclosure, the operation method includes the following steps. A target device displays a marker. An AR server provides a digital content corresponding to the marker. An AR device receives the digital content from the AR server through a communication network. The AR device captures the target device and the marker to generate a picture. The AR device tracks the target device in the picture according to the marker for an AR application. In the AR application, the AR device overlays the digital content on the target device in the picture.
Based on the above, the AR device in the embodiments of the disclosure may capture the marker of the target device to generate the picture for the AR application. The AR server may provide the digital content corresponding to the marker to the AR device. During the AR application, the AR device may overlay the digital content provided by the AR server on the target device in the picture. Since the digital content is not fixedly stored in the AR device, the AR device may present AR effect in a more flexible manner.
In order to make the aforementioned features and advantages of the disclosure comprehensible, embodiments accompanied with drawings are described in detail below.
Throughout the text of the specification (including the claims), the term “couple (or connect)” refers to any direct or indirect connection means. For example, where a first device is described to be coupled (or connected) to a second device in the text, it should be interpreted that the first device may be directly connected to the second device, or that the first device may be indirectly connected to the second device through another device or some connection means. The terms “first,” “second,” and the like mentioned in the specification or the claims are used only to name the elements or to distinguish different embodiments or scopes, and are not intended to limit the upper or lower limit of the number of the elements, nor are they intended to limit the order of the elements. Moreover, wherever applicable, elements/components/steps referenced by the same numerals in the figures and embodiments refer to the same or similar parts. Elements/components/steps referenced by the same numerals or the same language in different embodiments may be mutually referred to for relevant descriptions.
In step S240, the AR device 120 may capture the target device 110 and the marker MRK to generate a picture (or a picture stream). The AR device 120 may, for example (but not limited to), obtain digital content download information for the AR server 130 according to the marker MRK displayed by the target device 110. According to the actual design, in some embodiments, the marker MRK may include a QR code or other programmable figure, and the digital content download information may be embedded into the marker MRK. According to the actual design, the digital content download information may include an address of the AR server 130, an identification code of the target device 110, a digital content identification code, and/or other related information of digital content download. The AR device 120 may obtain the digital content DC from the AR server 130 through the communication network according to the digital content download information.
In step S250, the AR device 120 may track the target device 110 in the picture for an AR application. According to the actual design, the AR application may include a game application, an education application, a video conferencing application, and/or other applications. During the AR application, the AR device 120 may overlay the digital content DC provided by the AR server 430 on the target device 110 in the picture (step S260).
In the video conference shown by
Therefore, in the video conference (AR application), the target device 110 may provide the digital content DC being displayed to the AR device 120, and the AR device 120 may capture the target device 110 and the user A to generate a picture (here referred to as a conference picture). The AR device 120 may overlay the digital content DC on the target device 110 in the conference picture to generate an AR conference picture. The AR device 120 may transmit the AR conference picture to the remote device 300 through the communication network for video conferencing. The remote device 300 may display the AR conference picture to the user B. Since the digital content being displayed by the target device 110 that the user B sees is not captured by the AR device 120, the digital content does not have issues such as resolution or color shift.
For example, based on the actual design, the digital content provided by the target device 110 to the AR device 120 may include a three-dimensional digital object, and the target device 110 has at least one attitude sensor (not shown in
In the embodiment shown in
It is assumed herein that the display information D_inf may include the device identification code corresponding to the target device 410. The target device 410 may display the marker MRK for transmitting the device identification code of the target device 410 to the AR device 420. The AR device 420 may transmit a content request carrying the device identification code to the AR server 430 through the communication network, and the target device 410 may provide the display information D_inf carrying the device identification code to the AR server 430 through the communication network. The AR server 430 may compare the device identification code of the display information D_inf with the device identification code of the content request of the AR device 420 to generate a comparison result. The AR server 430 may determine whether to provide the digital content DC to the AR device 420 according to the comparison result.
It is assumed herein that the display information D_inf may include the display content currently displayed corresponding to the target device 410. The AR server 430 may perform a value-added service for converting the display content (the display information D_inf) currently displayed by the target device 410 into the digital content DC. The value-added service may be different according to the actual design/application. For example, in some embodiments, the value-added service provided by the AR server 430 may include a super-resolution (SR) imaging service, a three-dimensional image conversion service, an image enhancement service, a translation service, and/or other services. The “super-resolution imaging” is a technique improving video resolution. The super-resolution imaging service provided by the AR server 430 may enhance the display content (the display information D_inf) currently displayed by the target device 410 as the digital content DC. The three-dimensional image conversion service provided by the AR server 430 may convert a two-dimensional display content (the display information D_inf) currently displayed by the target device 410 into a three-dimensional content as the digital content DC. The image enhancement service provided by the AR server 430 includes performing a de-blurring operation on the display content (the display information D_inf) currently displayed by the target device 410 for converting the display content (the display information D_inf) into the digital content DC. The translation service provided by the AR server 430 may convert a text content (the display information D_inf) currently displayed by the target device 410 from a first language into a second language and use a conversion result as the digital content DC.
According to different design requirements, the application processor 411 and/or the image processor 421 may be implemented as a hardware, a firmware, a software (i.e., a program), or a combination of many among the above three. In terms of hardware, the application processor 411 and/or the image processor 421 may be implemented at a logic circuit on an integrated circuit. Related functions of the application processor 411 and/or the image processor 421 may be implemented as a hardware by using hardware description languages such as Verilog, HDL, or VHDL, or other suitable programming languages. For example, the related functions of the application processor 411 and/or the image processor 421 may be implemented at various logic blocks, modules and circuits in one or more controllers, microcontrollers, microprocessors, application-specific integrated circuits (ASIC), digital signal processors (DSP), field programmable gate arrays (FPGA), and/or other processing units.
In terms of software and/or firmware, the related functions of the application processor 411 and/or the image processor 421 may be implemented as programming codes. For example, general programming languages (such as C, C++, or assembly languages) or other suitable programming languages are used to implement the application processor 411 and/or image processor 421. The programming codes may be recorded/stored in a non-transitory computer readable medium. In some embodiments, the non-transitory computer readable medium includes, for example, a read only memory (ROM), a tape, a disk, a card, a semiconductor memory, a programmable logic circuit and/or a memory device. The memory device includes a hard disk drive (HDD), a solid-state drive (SSD), or other memory device. A computer, a central processing unit (CPU), a controller, a microcontroller, or a microprocessor may read and execute the programming codes from the non-transitory computer readable medium, thereby implementing the related functions of the application processor 411 and/or the image processor 421. Moreover, the programming codes may also be provided to the computer (or the CPU) through any transmission medium (a communication network, a broadcast wave, or the like). The communication network is, for example, the Internet, a wired communication network, a wireless communication network, or other communication medium.
In summary, the AR device of the embodiments above may capture the marker of the target device 110 to generate the picture for the AR application. The AR server may provide the digital content corresponding to the marker MRK to the AR device. During the AR application, the AR device may overlay the digital content provided by the AR server on the target device in the picture. Since the digital content is not fixedly stored in the AR device, the AR device may present AR effect in a more flexible manner.
Although the disclosure has been described with reference to the above embodiments, they are not intended to limit the disclosure. It will be apparent to one of ordinary skill in the art that modifications to the described embodiments may be made without departing from the spirit and the scope of the disclosure. Accordingly, the scope of the disclosure will be defined by the attached claims and their equivalents and not by the above detailed descriptions.
Number | Date | Country | Kind |
---|---|---|---|
110127878 | Jul 2021 | TW | national |