METHOD BY WHICH RECEPTION DEVICE PERFORMS END-TO-END TRAINING IN WIRELESS COMMUNICATION SYSTEM, RECEPTION DEVICE, PROCESSING DEVICE, STORAGE MEDIUM, METHOD BY WHICH TRANSMISSION DEVICE PERFORMS END-TO-END TRAINING, AND TRANSMISSION DEVICE

TECHNICAL FIELD

The present disclosure relates to a wireless communication system.

BACKGROUND

A variety of technologies, such as machine-to-machine (M2M) communication, machine type communication (MTC), and a variety of devices demanding high data throughput, such as smartphones and tablet personal computers (PCs), have emerged and spread. Accordingly, the volume of data throughput demanded to be processed in a cellular network has rapidly increased. In order to satisfy such rapidly increasing data throughput, carrier aggregation technology or cognitive radio technology for efficiently employing more frequency bands and multiple input multiple output (MIMO) technology or multi-base station (BS) cooperation technology for raising data capacity transmitted on limited frequency resources have been developed.

As more and more communication devices have required greater communication capacity, there has been a need for enhanced mobile broadband (eMBB) communication relative to legacy radio access technology (RAT). In addition, massive machine type communication (mMTC) for providing various services at anytime and anywhere by connecting a plurality of devices and objects to each other is one main issue to be considered in next-generation (e.g., 5G) communication.

Communication system design considering services/user equipment (UEs) sensitive to reliability and latency is also under discussion. The introduction of next-generation RAT is being discussed in consideration of eMBB communication, mMTC, ultra-reliable and low-latency communication (URLLC), and the like.

While 5G communication is still under development, there is an increasing demand for higher data rates to accommodate new services such as virtual reality and autonomous driving.

SUMMARY

As new radio communication technology has been introduced, the number of UEs to which a BS should provide services in a prescribed resource region is increasing and the volume of data and control information that the BS transmits/receives to/from the UEs to which the BS provides services is also increasing. Since the amount of resources available to the BS for communication with the UE(s) is limited, a new method for the BS to efficiently receive/transmit uplink/downlink data and/or uplink/downlink control information from/to the UE(s) using the limited radio resources is needed. In other words, due to increase in the density of nodes and/or the density of UEs, a method for efficiently using high-density nodes or high-density UEs for communication is needed.

A method to efficiently support various services with different requirements in a wireless communication system is also needed.

Overcoming delay or latency is an important challenge to applications, performance of which is sensitive to delay/latency.

To provide various types of communication services, applying artificial intelligence to wireless communication systems is being considered. Thus, methods for training artificial intelligence efficiently are needed.

The objects to be achieved with the present disclosure are not limited to what has been particularly described hereinabove and other objects not described herein will be more clearly understood by persons skilled in the art from the following detailed description.

In an aspect of the present disclosure, provided herein is a method of performing end-to-end learning by a receiving device in a wireless communication system. The method includes: receiving transmission neural network information including a configuration of a transmission neural network from a transmitting device; receiving a plurality of training symbols for the transmission neural network from the transmission device; determining a gradient for the transmission neural network based on the transmission neural network information and the plurality of training symbols; and feeding the gradient back to the transmission device.

In another aspect of the present disclosure, provided herein is a receiving device configured to perform end-to-end learning in a wireless communication system. The receiving device includes: at least one transceiver; at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions that, when executed, cause the at least one processor to perform operations. The operations include: receiving transmission neural network information including a configuration of a transmission neural network from a transmitting device; receiving a plurality of training symbols for the transmission neural network from the transmission device; determining a gradient for the transmission neural network based on the transmission neural network information and the plurality of training symbols; and feeding the gradient back to the transmission device.

In another aspect of the present disclosure, provided herein is a processing device for a communication device. The processing device includes: at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions that, when executed, cause the at least one processor to perform operations. The operations include: receiving transmission neural network information including a configuration of a transmission neural network from a transmitting device; receiving a plurality of training symbols for the transmission neural network from the transmission device; determining a gradient for the transmission neural network based on the transmission neural network information and the plurality of training symbols; and feeding the gradient back to the transmission device.

In another aspect of the present disclosure, provided herein is a computer-readable storage medium configured to store at least one program code including instructions that, when executed, cause at least one processor to perform operations. The operations include: receiving transmission neural network information including a configuration of a transmission neural network from a transmitting device; receiving a plurality of training symbols for the transmission neural network from the transmission device; determining a gradient for the transmission neural network based on the transmission neural network information and the plurality of training symbols; and feeding the gradient back to the transmission device.

In each aspect of the present disclosure, determining the gradient for the transmission neural network based on the transmission neural network information and the plurality of training symbols may include: determining a plurality of gradient values of the transmission neural network based on the plurality of training symbols, respectively; and determining the gradient for the transmission neural network by averaging the plurality of gradient values.

In each aspect of the present disclosure, the operations may include receiving information on the training part of the transmission neural network from the transmitting device.

In each aspect of the present disclosure, the transmission neural network information may include information on an initial state of the transmission neural network.

In each aspect of the present disclosure, the transmission neural network information may include information on generation of training symbols in the transmission neural network.

In another aspect of the present disclosure, provided herein is a method of performing end-to-end learning by a transmitting device in a wireless communication system. The method includes: transmitting transmission neural network information including a configuration of a transmission neural network to a receiving device; transmitting a plurality of training symbols for the transmission neural network to the receiving device; transmitting a plurality of training symbols for the transmission neural network to the receiving device; receiving a gradient that is an average of a plurality of gradient values respectively related to the plurality of training symbols from the receiving device; and updating a weight of the transmission neural network based on the gradient.

In a further aspect of the present disclosure, provided herein is a transmitting device configured to perform end-to-end learning in a wireless communication system. The transmitting device includes: at least one transceiver; at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions that, when executed, cause the at least one processor to perform operations. The operations include: transmitting transmission neural network information including a configuration of a transmission neural network to a receiving device; transmitting a plurality of training symbols for the transmission neural network to the receiving device; transmitting a plurality of training symbols for the transmission neural network to the receiving device; receiving a gradient that is an average of a plurality of gradient values respectively related to the plurality of training symbols from the receiving device; and updating a weight of the transmission neural network based on the gradient.

In each aspect of the present disclosure, the operations may include: determining the training part of the transmission neural network; and transmitting information on the training part to the receiving device.

In each aspect of the present disclosure, determining the training part of the transmission neural network may include determining a front end of the transmission neural network as the training part.

In each aspect of the present disclosure, the transmission neural network information may include information on an initial state of the transmission neural network.

In each aspect of the present disclosure, the transmission neural network information may include information on generation of training symbols in the transmission neural network.

The foregoing solutions are merely a part of the examples of the present disclosure and various examples into which the technical features of the present disclosure are incorporated may be derived and understood by persons skilled in the art from the following detailed description.

According to implementations of the present disclosure, a wireless communication signal may be efficiently transmitted/received. Accordingly, the overall throughput of a wireless communication system may be improved.

According to implementations of the present disclosure, a wireless communication system may efficiently support various services with different requirements.

According to implementations of the present disclosure, delay/latency occurring during wireless communication between communication devices may be reduced.

According to some implementations of the present disclosure, training of artificial intelligence may be efficiently performed.

The effects according to the present disclosure are not limited to what has been particularly described hereinabove and other effects not described herein will be more clearly understood by persons skilled in the art related to the present disclosure from the following detailed description.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are included to provide a further understanding of the present disclosure, illustrate examples of implementations of the present disclosure and together with the detailed description serve to explain implementations of the present disclosure:

FIG. 1 illustrates an example of a communication system 1 to which implementations of the present disclosure are applied;

FIG. 2 is a block diagram illustrating examples of communication devices capable of performing a method according to the present disclosure;

FIG. 3 illustrates another example of a wireless device capable of performing implementation(s) of the present disclosure;

FIG. 4 illustrates a perceptron structure used in an artificial neural network;

FIG. 5 illustrates a multilayer perceptron structure;

FIG. 6 illustrates the structure of a convolutional neural network (CNN);

FIG. 7 illustrates a filtering operation in a CNN;

FIG. 8 illustrates a concept of end-to-end learning to which backpropagation is applied;

FIG. 9 illustrates an example of computing a gradient in a neural network;

FIG. 10 illustrates an example of transfer learning (TL) to which deep learning is applied;

FIG. 11 is a conceptual diagram illustrating TL to which fine-tuning pre-trained models are applied;

FIG. 12 is a diagram illustrating a concept of end-to-end learning according to some implementations of the present disclosure;

FIG. 13 illustrates a conventional end-to-end learning process performed based on backpropagation; and

FIG. 14 illustrates an example of an end-to-end learning process according to some implementations of the present disclosure.

DETAILED DESCRIPTION

Hereinafter, implementations according to the present disclosure will be described in detail with reference to the accompanying drawings. The detailed description, which will be given below with reference to the accompanying drawings, is intended to explain exemplary implementations of the present disclosure, rather than to show the only implementations that may be implemented according to the present disclosure. The following detailed description includes specific details in order to provide a thorough understanding of the present disclosure. However, it will be apparent to those skilled in the art that the present disclosure may be practiced without such specific details.

In some instances, known structures and devices may be omitted or may be shown in block diagram form, focusing on important features of the structures and devices, so as not to obscure the concept of the present disclosure. The same reference numbers will be used throughout the present disclosure to refer to the same or like parts.

A technique, a device, and a system described below may be applied to a variety of wireless multiple access systems. The multiple access systems may include, for example, a code division multiple access (CDMA) system, a frequency division multiple access (FDMA) system, a time division multiple access (TDMA) system, an orthogonal frequency division multiple access (OFDMA) system, a single-carrier frequency division multiple access (SC-FDMA) system, a multi-carrier frequency division multiple access (MC-FDMA) system, etc. CDMA may be implemented by radio technology such as universal terrestrial radio access (UTRA) or CDMA2000. TDMA may be implemented by radio technology such as global system for mobile communications (GSM), general packet radio service (GPRS), enhanced data rates for GSM evolution (EDGE) (i.e., GERAN), etc. OFDMA may be implemented by radio technology such as institute of electrical and electronics engineers (IEEE) 802.11 (Wi-Fi), IEEE 802.16 (WiMAX), IEEE 802.20, evolved-UTRA (E-UTRA), etc. UTRA is part of universal mobile telecommunications system (UMTS) and 3rd generation partnership project (3GPP) long-term evolution (LTE) is part of E-UMTS using E-UTRA. 3GPP LTE adopts OFDMA on downlink (DL) and adopts SC-FDMA on uplink (UL). LTE-advanced (LTE-A) is an evolved version of 3GPP LTE.

For convenience of description, description will be given under the assumption that the present disclosure is applied to LTE and/or new RAT (NR). However, the technical features of the present disclosure are not limited thereto. For example, although the following detailed description is given based on mobile communication systems corresponding to 3GPP LTE/NR systems, the mobile communication systems are applicable to other arbitrary mobile communication systems except for matters that are specific to the 3GPP LTE/NR system.

For terms and techniques that are not described in detail among terms and techniques used in the present disclosure, reference may be made to 3GPP based standard specifications, for example, 3GPP TS 36.211, 3GPP TS 36.212, 3GPP TS 36.213, 3GPP TS 36.321, 3GPP TS 36.300, 3GPP TS 36.331, 3GPP TS 37.213, 3GPP TS 38.211, 3GPP TS 38.212, 3GPP TS 38.213, 3GPP TS 38.214, 3GPP TS 38.300, 3GPP TS 38.331, etc.

In examples of the present disclosure described later, if a device “assumes” something, this may mean that a channel transmission entity transmits a channel in compliance with the corresponding “assumption.” This also may mean that a channel reception entity receives or decodes the channel in the form of conforming to the “assumption” on the premise that the channel has been transmitted in compliance with the “assumption.”

In the present disclosure, a user equipment (UE) may be fixed or mobile. Each of various devices that transmit and/or receive user data and/or control information by communicating with a base station (BS) may be the UE. The term UE may be referred to as terminal equipment, mobile station (MS), mobile terminal (MT), user terminal (UT), subscriber station (SS), wireless device, personal digital assistant (PDA), wireless modem, handheld device, etc. In the present disclosure, the term user is used to refer to a UE. In the present disclosure, a BS refers to a fixed station that communicates with a UE and/or another BS and exchanges data and control information with a UE and another BS. The term BS may be referred to as advanced base station (ABS), Node-B (NB), evolved Node-B (eNB), base transceiver system (BTS), access point (AP), processing server (PS), etc. Particularly, a BS of a universal terrestrial radio access (UTRAN) is referred to as an NB, a BS of an evolved-UTRAN (E-UTRAN) is referred to as an eNB, and a BS of new radio access technology network is referred to as a gNB. Hereinbelow, for convenience of description, the NB, eNB, or gNB will be referred to as a BS regardless of the type or version of communication technology.

In the present disclosure, a transmission and reception point (TRP) refers to a fixed point capable of transmitting/receiving a radio signal to/from a UE by communication with the UE. Various types of BSs may be used as TRPs regardless of the names thereof. For example, a BS, NB, eNB, pico-cell eNB (PeNB), home eNB (HeNB), relay, repeater, etc. may be a TRP. Furthermore, a TRP may not be a BS. For example, a radio remote head (RRH) or a radio remote unit (RRU) may be a TRP. Generally, the RRH and RRU have power levels lower than that of the BS. Since the RRH or RRU (hereinafter, RRH/RRU) is connected to the BS through a dedicated line such as an optical cable in general, cooperative communication according to the RRH/RRU and the BS may be smoothly performed relative to cooperative communication according to BSs connected through a wireless link. At least one antenna is installed per TRP. An antenna may refer to a physical antenna port or refer to a virtual antenna or an antenna group. The TRP may also be called a point.

In the present disclosure, a cell refers to a specific geographical area in which one or more TRPs provide communication services. Accordingly, in the present disclosure, communication with a specific cell may mean communication with a BS or a TRP providing communication services to the specific cell. A DL/UL signal of the specific cell refers to a DL/UL signal from/to the BS or the TRP providing communication services to the specific cell. A cell providing UL/DL communication services to a UE is especially called a serving cell. Furthermore, channel status/quality of the specific cell refers to channel status/quality of a channel or a communication link generated between the BS or the TRP providing communication services to the specific cell and the UE. In 3GPP-based communication systems, the UE may measure a DL channel state from a specific TRP using cell-specific reference signal(s) (CRS(s)) transmitted on a CRS resource and/or channel state information reference signal(s) (CSI-RS(s)) transmitted on a CSI-RS resource, allocated to the specific TRP by antenna port(s) of the specific TRP.

A 3GPP-based communication system uses the concept of a cell in order to manage radio resources, and a cell related with the radio resources is distinguished from a cell of a geographic area.

The “cell” of the geographic area may be understood as coverage within which a TRP may provide services using a carrier, and the “cell” of the radio resources is associated with bandwidth (BW), which is a frequency range configured by the carrier. Since DL coverage, which is a range within which the TRP is capable of transmitting a valid signal, and UL coverage, which is a range within which the TRP is capable of receiving the valid signal from the UE, depend upon a carrier carrying the signal, coverage of the TRP may also be associated with coverage of the “cell” of radio resources used by the TRP. Accordingly, the term “cell” may be used to indicate service coverage by the TRP sometimes, radio resources at other times, or a range that a signal using the radio resources may reach with valid strength at other times.

In 3GPP communication standards, the concept of the cell is used in order to manage radio resources. The “cell” associated with the radio resources is defined by a combination of DL resources and UL resources, that is, a combination of a DL component carrier (CC) and a UL CC. The cell may be configured by the DL resources only or by the combination of the DL resources and the UL resources. If carrier aggregation is supported, linkage between a carrier frequency of the DL resources (or DL CC) and a carrier frequency of the UL resources (or UL CC) may be indicated by system information. In this case, the carrier frequency may be equal to or different from a center frequency of each cell or CC.

In a wireless communication system, the UE receives information on DL from the BS and the UE transmits information on UL to the BS. The information that the BS and UE transmit and/or receive includes data and a variety of control information and there are various physical channels according to types/usage of the information that the UE and the BS transmit and/or receive.

The 3GPP-based communication standards define DL physical channels corresponding to resource elements carrying information originating from a higher layer and DL physical signals corresponding to resource elements which are used by the physical layer but do not carry the information originating from the higher layer. For example, a physical downlink shared channel (PDSCH), a physical broadcast channel (PBCH), a physical multicast channel (PMCH), a physical control format indicator channel (PCFICH), a physical downlink control channel (PDCCH), etc. are defined as the DL physical channels, and a reference signal (RS) and a synchronization signal (SS) are defined as the DL physical signals. The RS, which is also referred to as a pilot, represents a signal with a predefined special waveform known to both the BS and the UE. For example, a demodulation reference signal (DMRS), a channel state information RS (CSI-RS), etc. are defined as DL RSs. The 3GPP-based communication standards define UL physical channels corresponding to resource elements carrying information originating from the higher layer and UL physical signals corresponding to resource elements which are used by the physical layer but do not carry the information originating from the higher layer. For example, a physical uplink shared channel (PUSCH), a physical uplink control channel (PUCCH), and a physical random access channel (PRACH) are defined as the UL physical channels, and a DMRS for a UL control/data signal, a sounding reference signal (SRS) used for UL channel measurement, etc. are defined.

In the present disclosure, the PDCCH refers to a set of time-frequency resources (e.g., a set of resource elements (REs)) that carry downlink control information (DCI), and the PDSCH refers to a set of time-frequency resources (e.g., a set of REs) that carry DL data. The PUCCH, PUSCH, and PRACH refer to a set of time-frequency resources (i.e., a set of REs) that carry uplink control information (UCI), UL data, and random access signals, respectively. In the following description, the meaning of “The UE transmits/receives the PUCCH/PUSCH/PRACH” is that the UE transmits/receives the UCI/UL data/random access signals on or through the PUCCH/PUSCH/PRACH, respectively. In addition, the meaning of “the BS transmits/receives the PBCH/PDCCH/PDSCH” is that the BS transmits the broadcast information/DCI/DL data on or through a PBCH/PDCCH/PDSCH, respectively.

In the present disclosure, a radio resource (e.g., a time-frequency resource) scheduled or configured for the UE by the BS for transmission or reception of PUCCH/PUSCH/PDSCH is also referred to as a PUCCH/PUSCH/PDSCH resource.

Since a communication device receives an SS/PBCH resource block (SSB), DMRS, CSI-RS, PBCH, PDCCH, PDSCH, PUSCH, and/or PUCCH in the form of radio signals on a cell, the communication device may not select and receive radio signals including only a specific physical channel or a specific physical signal through a radio frequency (RF) receiver, or may not select and receive radio signals without a specific physical channel or a specific physical signal through the RF receiver. In actual operations, the communication device receives radio signals on the cell via the RF receiver, converts the radio signals, which are RF band signals, into baseband signals, and then decodes physical signals and/or physical channels in the baseband signals using one or more processors. Thus, in some implementations of the present disclosure, reception of physical signals and/or physical channels may mean that a communication device does not attempt to restore the physical signals and/or physical channels from radio signals, for example, does not attempt to decode the physical signals and/or physical channels, rather than that the communication device does not actually receive the radio signals including the corresponding physical signals and/or physical channels.

FIG. 1 illustrates an example of a communication system 1 to which implementations of the present disclosure are applied. Referring to FIG. 1, the communication system 1 applied to the present disclosure includes wireless devices, BSs, and a network. Here, the wireless devices represent devices performing communication using radio access technology (RAT) (e.g., 5G New RAT (NR) or LTE (e.g., E-UTRA), 6G) and may be referred to as communication/radio/5G devices. The wireless devices may include, without being limited to, a robot 100a, vehicles 100b-1 and 100b-2, an extended reality (XR) device 100c, a hand-held device 100d, a home appliance 100e, an Internet of Things (IoT) device 100f, and an artificial intelligence (AI) device/server 400. For example, the vehicles may include a vehicle having a wireless communication function, an autonomous driving vehicle, and a vehicle capable of performing vehicle-to-vehicle communication. Here, the vehicles may include an unmanned aerial vehicle (UAV) (e.g., a drone). The XR device may include an augmented reality (AR)/virtual reality (VR)/mixed reality (MR) device and may be implemented in the form of a head-mounted device (HMD), a head-up display (HUD) mounted in a vehicle, a television, a smartphone, a computer, a wearable device, a home appliance device, a digital signage, a vehicle, a robot, etc. The hand-held device may include a smartphone, a smartpad, a wearable device (e.g., a smartwatch or smartglasses), and a computer (e.g., a notebook). The home appliance may include a TV, a refrigerator, and a washing machine. The IoT device may include a sensor and a smartmeter. For example, the BSs and the network may also be implemented as wireless devices and a specific wireless may operate as a BS/network node with respect to another wireless device.

The wireless devices 100a to 100f may be connected to a network 300 via BSs 200. AI technology may be applied to the wireless devices 100a to 100f and the wireless devices 100a to 100f may be connected to the AI server 400 via the network 300. The network 300 may be configured using a 3G network, a 4G (e.g., LTE) network, or a 5G (e.g., NR) network or 6G network to be introduced in the future. Although the wireless devices 100a to 100f may communicate with each other through the BSs 200/network 300, the wireless devices 100a to 100f may perform direct communication (e.g., sidelink communication) with each other without passing through the BSs/network. For example, the vehicles 100b-1 and 100b-2 may perform direct communication (e.g., vehicle-to-vehicle (V2V)/Vehicle-to-everything (V2X) communication). The IoT device (e.g., a sensor) may perform direct communication with other IoT devices (e.g., sensors) or other wireless devices 100a to 100f.

Wireless communication/connections 150a and 150b may be established between the wireless devices 100a to 100f and the BSs 200 and between the wireless devices 100a to 100f). Here, the wireless communication/connections such as UL/DL communication 150a and sidelink communication 150b (or, device-to-device (D2D) communication) may be established by various RATs. The wireless devices and the BSs/wireless devices may transmit/receive radio signals to/from each other through the wireless communication/connections 150a and 150b. To this end, at least a part of various configuration information configuring processes, various signal processing processes (e.g., channel encoding/decoding, modulation/demodulation, and resource mapping/demapping), and resource allocating processes, for transmitting/receiving radio signals, may be performed based on the various proposals of the present disclosure.

FIG. 2 is a block diagram illustrating examples of communication devices capable of performing a method according to the present disclosure. Referring to FIG. 2, a first wireless device 100 and a second wireless device 200 may transmit and/or receive radio signals through a variety of RATs. Here, {the first wireless device 100 and the second wireless device 200} may correspond to {the wireless device 100x and the BS 200} and/or {the wireless device 100x and the wireless device 100x} of FIG. 1.

The first wireless device 100 may include one or more processors 102 and one or more memories 104 and additionally further include one or more transceivers 106 and/or one or more antennas 108. The processor(s) 102 may control the memory(s) 104 and/or the transceiver(s) 106 and may be configured to implement the below-described/proposed functions, procedures, and/or methods. For example, the processor(s) 102 may process information within the memory(s) 104 to generate first information/signals and then transmit radio signals including the first information/signals through the transceiver(s) 106. The processor(s) 102 may receive radio signals including second information/signals through the transceiver(s) 106 and then store information obtained by processing the second information/signals in the memory(s) 104. The memory(s) 104 may be connected to the processor(s) 102 and may store a variety of information related to operations of the processor(s) 102. For example, the memory(s) 104 may perform a part or all of processes controlled by the processor(s) 102 or store software code including instructions for performing the below-described/proposed procedures and/or methods. Here, the processor(s) 102 and the memory(s) 104 may be a part of a communication modem/circuit/chip designed to implement wireless communication technology. The transceiver(s) 106 may be connected to the processor(s) 102 and transmit and/or receive radio signals through one or more antennas 108. Each of the transceiver(s) 106 may include a transmitter and/or a receiver. The transceiver(s) 106 is used interchangeably with radio frequency (RF) unit(s). In the present disclosure, the wireless device may represent the communication modem/circuit/chip.

The second wireless device 200 may include one or more processors 202 and one or more memories 204 and additionally further include one or more transceivers 206 and/or one or more antennas 208. The processor(s) 202 may control the memory(s) 204 and/or the transceiver(s) 206 and may be configured to implement the below-described/proposed functions, procedures, and/or methods. For example, the processor(s) 202 may process information within the memory(s) 204 to generate third information/signals and then transmit radio signals including the third information/signals through the transceiver(s) 206. The processor(s) 202 may receive radio signals including fourth information/signals through the transceiver(s) 106 and then store information obtained by processing the fourth information/signals in the memory(s) 204. The memory(s) 204 may be connected to the processor(s) 202 and may store a variety of information related to operations of the processor(s) 202. For example, the memory(s) 204 may perform a part or all of processes controlled by the processor(s) 202 or store software code including instructions for performing the below-described/proposed procedures and/or methods. Here, the processor(s) 202 and the memory(s) 204 may be a part of a communication modem/circuit/chip designed to implement wireless communication technology. The transceiver(s) 206 may be connected to the processor(s) 202 and transmit and/or receive radio signals through one or more antennas 208. Each of the transceiver(s) 206 may include a transmitter and/or a receiver. The transceiver(s) 206 is used interchangeably with RF unit(s). In the present disclosure, the wireless device may represent the communication modem/circuit/chip.

The wireless communication technology implemented in the wireless devices 100 and 200 of the present disclosure may include narrowband Internet of Things for low-power communication as well as LTE, NR, and 6G communications. For example, NB-IoT technology may be an example of Low Power Wide Area Network (LPWAN) technology, and may be implemented by, but is limited to, standards such as LTE Cat NB1 and/or LTE Cat NB2. Additionally or alternatively, the wireless communication technology implemented in the wireless devices XXX and YYY of the present disclosure may perform communication based on the LTE-M technology. For example, the LTE-M technology may be an example of the LPWAN technology, and may be called by various names such as enhanced machine type communication (eMTC). For example, the LTE-M technology may be implemented by, but is not limited to, at least one of various standards such as 1) LTE CAT 0, 2) LTE Cat M1, 3) LTE Cat M2, 4) LTE non-BL (non-Bandwidth Limited), 5) LTE-MTC, 6) LTE Machine Type Communication, and/or 7) LTE M. Additionally or alternatively, the wireless communication technology implemented in the wireless devices XXX and YYY of the present disclosure may include, but is not limited to, at least one of ZigBee, Bluetooth, and Low Power Wide Area Network (LPWAN) considering low-power communication. For example, the ZigBee technology may create personal area networks (PAN) related to small/low-power digital communications based on various standards such as IEEE 802.15.4, and may be called by various names.

Hereinafter, hardware elements of the wireless devices 100 and 200 will be described more specifically. One or more protocol layers may be implemented by, without being limited to, one or more processors 102 and 202. For example, the one or more processors 102 and 202 may implement one or more layers (e.g., functional layers such as a physical (PHY) layer, medium access control (MAC) layer, a radio link control (RLC) layer, a packet data convergence protocol (PDCP) layer, radio resource control (RRC) layer, and a service data adaptation protocol (SDAP) layer). The one or more processors 102 and 202 may generate one or more protocol data units (PDUs) and/or one or more service data units (SDUs) according to the functions, procedures, proposals, and/or methods disclosed in the present disclosure. The one or more processors 102 and 202 may generate messages, control information, data, or information according to the functions, procedures, proposals, and/or methods disclosed in the present disclosure. The one or more processors 102 and 202 may generate signals (e.g., baseband signals) including PDUs, SDUs, messages, control information, data, or information according to the functions, procedures, proposals, and/or methods disclosed in the present disclosure and provide the generated signals to the one or more transceivers 106 and 206. The one or more processors 102 and 202 may receive the signals (e.g., baseband signals) from the one or more transceivers 106 and 206 and acquire the PDUs, SDUs, messages, control information, data, or information according to the functions, procedures, proposals, and/or methods disclosed in the present disclosure.

The one or more processors 102 and 202 may be referred to as controllers, microcontrollers, microprocessors, or microcomputers. The one or more processors 102 and 202 may be implemented by hardware, firmware, software, or a combination thereof. As an example, one or more application specific integrated circuits (ASICs), one or more digital signal processors (DSPs), one or more digital signal processing devices (DSPDs), one or more programmable logic devices (PLDs), or one or more field programmable gate arrays (FPGAs) may be included in the one or more processors 102 and 202. The functions, procedures, proposals, and/or methods disclosed in the present disclosure may be implemented using firmware or software, and the firmware or software may be configured to include the modules, procedures, or functions. Firmware or software configured to perform the functions, procedures, proposals, and/or methods disclosed in the present disclosure may be included in the one or more processors 102 and 202 or stored in the one or more memories 104 and 204 so as to be driven by the one or more processors 102 and 202. The functions, procedures, proposals, and/or methods disclosed in the present disclosure may be implemented using firmware or software in the form of code, commands, and/or a set of commands.

The one or more memories 104 and 204 may be connected to the one or more processors 102 and 202 and store various types of data, signals, messages, information, programs, code, commands, and/or instructions. The one or more memories 104 and 204 may be configured by read-only memories (ROMs), random access memories (RAMs), electrically erasable programmable read-only memories (EPROMs), flash memories, hard drives, registers, cash memories, computer-readable storage media, and/or combinations thereof. The one or more memories 104 and 204 may be located at the interior and/or exterior of the one or more processors 102 and 202. The one or more memories 104 and 204 may be connected to the one or more processors 102 and 202 through various technologies such as wired or wireless connection.

The one or more transceivers 106 and 206 may transmit user data, control information, and/or radio signals/channels, mentioned in the methods and/or operational flowcharts of the present disclosure, to one or more other devices. The one or more transceivers 106 and 206 may receive user data, control information, and/or radio signals/channels, mentioned in the functions, procedures, proposals, methods, and/or operational flowcharts disclosed in the present disclosure, from one or more other devices. For example, the one or more transceivers 106 and 206 may be connected to the one or more processors 102 and 202 and transmit and receive radio signals. For example, the one or more processors 102 and 202 may perform control so that the one or more transceivers 106 and 206 may transmit user data, control information, or radio signals to one or more other devices. The one or more processors 102 and 202 may perform control so that the one or more transceivers 106 and 206 may receive user data, control information, or radio signals from one or more other devices. The one or more transceivers 106 and 206 may be connected to the one or more antennas 108 and 208. The one or more transceivers 106 and 206 may be configured to transmit and receive user data, control information, and/or radio signals/channels, mentioned in the functions, procedures, proposals, methods, and/or operational flowcharts disclosed in the present disclosure, through the one or more antennas 108 and 208. In the present disclosure, the one or more antennas may be a plurality of physical antennas or a plurality of logical antennas (e.g., antenna ports). The one or more transceivers 106 and 206 may convert received radio signals/channels etc. from RF band signals into baseband signals in order to process received user data, control information, radio signals/channels, etc. using the one or more processors 102 and 202. The one or more transceivers 106 and 206 may convert the user data, control information, radio signals/channels, etc. processed using the one or more processors 102 and 202 from the base band signals into the RF band signals. To this end, the one or more transceivers 106 and 206 may include (analog) oscillators and/or filters.

FIG. 3 illustrates another example of a wireless device capable of performing implementation(s) of the present disclosure. Referring to FIG. 3, wireless devices 100 and 200 may correspond to the wireless devices 100 and 200 of FIG. 2 and may be configured by various elements, components, units/portions, and/or modules. For example, each of the wireless devices 100 and 200 may include a communication unit 110, a control unit 120, a memory unit 130, and additional components 140. The communication unit may include a communication circuit 112 and transceiver(s) 114. For example, the communication circuit 112 may include the one or more processors 102 and 202 and/or the one or more memories 104 and 204 of FIG. 2. For example, the transceiver(s) 114 may include the one or more transceivers 106 and 206 and/or the one or more antennas 108 and 208 of FIG. 2. The control unit 120 is electrically connected to the communication unit 110, the memory 130, and the additional components 140 and controls overall operation of the wireless devices. For example, the control unit 120 may control an electric/mechanical operation of the wireless device based on programs/code/commands/information stored in the memory unit 130. The control unit 120 may transmit the information stored in the memory unit 130 to the exterior (e.g., other communication devices) via the communication unit 110 through a wireless/wired interface or store, in the memory unit 130, information received through the wireless/wired interface from the exterior (e.g., other communication devices) via the communication unit 110.

The additional components 140 may be variously configured according to types of wireless devices. For example, the additional components 140 may include at least one of a power unit/battery, input/output (I/O) unit, a driving unit, and a computing unit. The wireless device may be implemented in the form of, without being limited to, the robot (100a of FIG. 1), the vehicles (100b-1 and 100b-2 of FIG. 1), the XR device (100c of FIG. 1), the hand-held device (100d of FIG. 1), the home appliance (100e of FIG. 1), the IoT device (100f of FIG. 1), a digital broadcast UE, a hologram device, a public safety device, an MTC device, a medicine device, a fintech device (or a finance device), a security device, a climate/environment device, the AI server/device (400 of FIG. 1), the BS (200 of FIG. 1), a network node, etc. The wireless device may be used in a mobile or fixed place according to a use-case/service.

In FIG. 3, the entirety of the various elements, components, units/portions, and/or modules in the wireless devices 100 and 200 may be connected to each other through a wired interface or at least a part thereof may be wirelessly connected through the communication unit 110. For example, in each of the wireless devices 100 and 200, the control unit 120 and the communication unit 110 may be connected by wire and the control unit 120 and first units (e.g., 130 and 140) may be wirelessly connected through the communication unit 110. Each element, component, unit/portion, and/or module within the wireless devices 100 and 200 may further include one or more elements. For example, the control unit 120 may be configured by a set of one or more processors. As an example, the control unit 120 may be configured by a set of a communication control processor, an application processor, an electronic control unit (ECU), a graphical processing unit, and a memory control processor. As another example, the memory 130 may be configured by a random access memory (RAM), a dynamic RAM (DRAM), a read-only memory (ROM)), a flash memory, a transitory memory, a non-transitory memory, and/or a combination thereof.

In the present disclosure, the at least one memory (e.g., 104 or 204) may store instructions or programs, and the instructions or programs may cause, when executed, at least one processor operably connected to the at least one memory to perform operations according to some embodiments or implementations of the present disclosure.

In the present disclosure, a computer readable (non-transitory) storage medium may store at least one instruction or program, and the at least one instruction or program may cause, when executed by at least one processor, the at least one processor to perform operations according to some embodiments or implementations of the present disclosure.

In the present disclosure, a processing device or apparatus may include at least one processor, and at least one computer memory operably connected to the at least one processor. The at least one computer memory may store instructions or programs, and the instructions or programs may cause, when executed, the at least one processor operably connected to the at least one memory to perform operations according to some embodiments or implementations of the present disclosure.

In the present disclosure, a computer program may include program code stored on at least one computer-readable (non-transitory) storage medium and, when executed, configured to perform operations according to some implementations of the present disclosure or cause at least one processor to perform the operations according to some implementations of the present disclosure. The computer program may be provided in the form of a computer program product. The computer program product may include at least one computer-readable (non-transitory) storage medium

A communication device of the present disclosure includes at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions for causing, when executed, the at least one processor to perform operations according to example(s) of the present disclosure described later.

Wireless communication systems are extensively deployed to provide various types of communication services such as voice and data. The demand for higher data rates is increasing to accommodate incoming new services and/or scenarios where the virtual and real worlds blend. To address these ever-growing demands, new communication technologies beyond 5G are required. New communication technologies beyond 6G systems (hereinafter referred to as 6G) aim to achieve (i) extremely high data speeds per device, (ii) very large number of connected devices, (iii) global connectivity, (iv) ultra-low latency, (v) reducing energy consumption of battery-free IoT devices, (vi) ultra-reliable connections, (vii) connected intelligence with machine learning capabilities. In the 6G system, the following technologies are being considered: artificial intelligence (AI), terahertz (THz) communication, optical wireless communication (OWC), free space optics (FSO) backhaul network, massive multiple-input multiple-output (MIMO) technology, blockchain, three-dimensional (3D) networking, quantum communication, unmanned aerial vehicle (UAV), cell-free communication, integration of wireless information and energy transmission, integration of sensing and communication, integration of access backhaul networks, hologram beamforming, big data analysis, large intelligent surface (LIS), and so on.

In particular, there has been a rapid increase in attempts to integrate AI into communication systems. Methods being attempted in relation to AI may be broadly categorized into two: AI for communications (AI4C), which uses AI to enhance communication performance, and communications for AI (C4AI), which develops communication technologies to support AI. In the AI4C field, designs have been attempted to replace the roles of channel encoders/decoders, modulators/demodulators, or channel equalizers with end-to-end autoencoders or neural networks. In the C4AI field, as one type of distributed learning, federated learning involves updating a common prediction model by sharing only the weights and gradients of models with the server without sharing device raw data while protecting privacy. In addition, there is a method for distributing the loads of devices, network edges, and cloud servers based on split inference.

Introducing AI into communications may simplify and enhance real-time data transmission. AI may use numerous analytics to determine a method of performing complex target tasks. In other words, AI may increase efficiency and reduce processing delays.

Time-consuming tasks such as handover, network selection, and resource scheduling may be instantly performed using AI. AI may also play a significant role in machine-to-machine, machine-to-human, and human-to-machine communications. AI-based communication systems may be supported by meta-materials, intelligent architectures, intelligent networks, intelligent devices, intelligence cognitive radio, self-sustaining wireless networks, and machine learning.

Recent attempts to integrate AI into wireless communication systems have primarily focused on the application layer, network layer, and particularly on wireless resource management and allocation. However, research into integrating AI into wireless communication systems is increasingly evolving towards the MAC layer and the physical layer. In particular, there are emerging attempts to combine deep learning with wireless transmission at the physical layer. AI-based physical layer transmission refers to applying signal processing and communication mechanisms based on AI drivers rather than traditional communication frameworks in fundamental signal processing and communication mechanisms. For example, the AI-based physical layer transmission may include deep learning-based channel coding and decoding, deep learning-based signal estimation and detection, deep learning-based MIMO mechanisms, AI-based resource scheduling and allocation, and the like.

Machine learning may be used for channel estimation and channel tracking. Machine learning can be used for power allocation, interference cancellation, etc. in the DL physical layer. Machine learning may also be used in MIMO systems for antenna selection, power control, and symbol detection.

However, applying deep neural networks for transmission at the physical layer may have the following issues.

Deep learning-based AI algorithms require a large amount of training data to optimize training parameters. However, due to limitations in acquiring data from specific channel environments, a significant amount of training data is often used offline. Static training of training data in specific channel environments may lead to contradictions between the dynamic features and diversity of wireless channels.

Furthermore, current deep learning primarily targets real signals. However, signals at the physical layer of wireless communication are complex signals. More research is needed on neural networks for detecting complex-domain signals to match the characteristics of wireless communication signals.

Hereinafter, machine learning will be described in detail.

Machine learning refers to a series of operations for training machines to perform tasks that are difficult to be performed by human. Machine learning requires data and learning models. In machine learning, data learning methods may be broadly categorized into three types: supervised learning, unsupervised learning, and reinforcement learning.

Neural network learning aims to minimize errors in outputs. Neural network learning refers to a process of repeatedly inputting training data to a neural network, calculating the error of the output and target of the neural network for the training data, backpropagating the error of the neural network from the output layer of the neural network to the input layer to reduce the error, and updating the weight of each node of the neural network.

Supervised learning may use training data labeled with a correct answer, whereas unsupervised learning may use training data that is not labeled with a correct answer. For example, in the case of supervised learning for data classification, training data may be labeled with each category. The labeled training data may be input to the neural network, and the output (category) of the neural network may be compared with the label of the training data, thereby calculating the error. The calculated error may be backpropagated through the neural network in reverse (that is, from the output layer to the input layer), and the connection weight(s) of each node of each layer of the neural network may be updated based on the backpropagation. Changes in the updated connection weight(s) of each node may be determined based on the learning rate. The calculation of the neural network for input data and the backpropagation of the error may configure a learning epoch. The learning data may be applied differently depending on the number of repetitions of the learning epoch of the neural network. For example, in the early phase of learning of the neural network, a high learning rate may be used to increase efficiency such that the neural network rapidly ensures a certain level of performance, but in the late phase of learning, a low learning rate may be used to increase accuracy.

The learning method may vary depending on the feature of data. For example, learning may be performed based on supervised learning rather than unsupervised learning or reinforcement learning to allow a receiver to accurately predict data transmitted from a transmitter in a communication system.

The learning model corresponds to the human brain. To this end, the most basic linear model may be considered. However, a machine learning paradigm that uses highly complex neural network structures such as artificial neural networks as learning models is referred to as deep learning.

Neural network cores used for learning may be broadly categorized into a deep neural network (DNN), a convolutional deep neural network (CNN), and a recurrent neural machine (RNN).

FIG. 4 illustrates a perceptron structure used in an artificial neural network.

An artificial neural network may be implemented by connecting multiple perceptrons. Referring to FIG. 4, a process of receiving an input vector of x=(x₁, x₂, . . . , x_d), multiplying each component by a weight of w=(w₁, w₂, . . . , w_d), summing up the results, and then applying an activation function σ(·) is referred to as a perceptron. For a large artificial neural network structure, the simplified perceptron structure shown in FIG. 14 may be extended. For a large artificial neural network structure, the simplified perceptron structure shown in FIG. 4 may be extended and applied to a multi-dimensional perceptron with different input vectors.

FIG. 5 illustrates a multilayer perceptron structure.

The perceptron structure shown in FIG. 4 may be extended to a multilayer perceptron structure having a total of three layers based on input and output values. An artificial neural network having H perceptrons of (d+1) dimensions between the first and second layers and K perceptrons of (H+1) dimensions between the second and third layers may be represented by the multilayer perceptron structure shown in FIG. 5.

A layer where input vectors are located is called an input layer, a layer where final output value(s) are located is called an output layer, and all layers between the input and output layers are referred to as hidden layers. In the example of FIG. 5, three layers are illustrated. However, since the actual number of layers in an artificial neural network is counted excluding the input layer, the artificial neural network based on the multilayer perceptron structure in FIG. 5 may be considered as having two layers. An artificial neural network is constructed by two-dimensionally connecting perceptrons of basic blocks.

In a neural network, layers are composed of small individual units called neurons. In the neural network, neurons receive inputs from other neurons, perform processing, and produce outputs. A region within the previous layer where each neuron receives inputs is called a receptive field. Each neuron computes output values by applying a specific function to input values received from the receptive field within the previous layer. The specific function applied to the input values is determined by i) a vector of weights and ii) biases. Learning in the neural network is performed based on iterative adjustment of the biases and weights. The vector of weights and the biases are called filters, which represent particular features of the input.

The aforementioned input layer, hidden layer, and output layer may be commonly applied not only to the multilayer perceptron structure but also to various artificial neural network structures such as CNNs, which will be discussed later. As the number of hidden layers increases, the artificial neural network becomes deeper, and the machine learning paradigm that uses sufficiently deep artificial neural networks as learning models is called deep learning. In addition, an artificial neural network used for deep learning are called DNNs.

The aforementioned multilayer perceptron structure is referred to as a fully-connected neural network. In the fully-connected neural network, there are no connections between neurons within the same layer, and connections exist only between neurons in adjacent layers. A DNN, which has the fully-connected neural network structure, includes multiple hidden layers and combinations of activation functions, and thus the DNN may be effectively applied to capture the characteristics of correlation between inputs and outputs. Here, the correlation characteristic may mean the joint probability of inputs and outputs.

On the other hand, various artificial neural network structures distinct from the DNN may be formed depending on how multiple perceptrons are connected to each other.

FIG. 6 illustrates the structure of a CNN.

In a DNN, neurons within a layer are arranged in a one-dimensional manner. However, referring to FIG. 6, in the CNN, neurons may be assumed to be arranged in a two-dimensional manner, with w neurons horizontally and h neurons vertically. In this case, since a weight is added for each connection from a single input neuron to hidden layers, a total of h×w weights need to be considered. Since there are h×w neurons in input layers, a total of h²w²weights are required between two adjacent layers.

FIG. 7 illustrates a filtering operation in a CNN.

The CNN shown in FIG. 6 faces the issue of an exponential increase in the number of weights depending on the number of connections. Thus, small-sized filters are assumed to exist instead of considering connections between all neurons in adjacent layers. Then, weighted sum and activation function operations are performed on overlapping regions of filter as shown in FIG. 7.

A single filter has weights corresponding to the size of the filter and may undergo learning of the weights such that the filter extracts specific features from an image as factors and produce outputs based on the factors. In FIG. 7, a 3×3 filter is applied to a top-left 3×3 region of an input layer, and an output value obtained by performing the weighted sum and activation function operations on related neurons is stored in z₂₂.

The filter scans the input layer, performs the weighted sum and activation function operations while moving horizontally and vertically at regular intervals, and places the output value at the current position of the filter. This operation method is similar to a convolution operation on images in the field of computer vision. Thus, a DNN with such a structure is called a CNN, and a hidden layer generated by the convolution operation is referred to as a convolutional layer. In addition, a neural network with multiple convolutional layers is called a deep convolutional neural network (DCNN).

In the convolutional layer, the weighted sum is calculated by considering only neuron(s) located within a region covered by the current filter, thereby reducing the number of weights. As a result, a single filter may focus on features within a local region. Therefore, the CNN may be effectively applied to process image data where a physical distance in two-dimensional space is an important criterion. In the CNN, multiple filters may be applied immediately before the convolutional layer, and multiple output results may be produced by convolution operations of each filter.

The fully connected layer connects every neuron in one layer to every neuron in another layer.

Neural network (NN) systems have an advantage of being capable of solving or optimizing difficult problems based on non-linearity. For NN-based wireless communication systems, end-to-end learning for simultaneously optimizing channel coding, modulation, and filtering at a transmitter and channel estimation and signal detection algorithms at a receiver has been proposed.

End-to-End Communication

In end-to-end communication, a transmitter NN and a receiver NN are trained jointly, and thus, high performance gains may be expected compared to conventional communication systems where individual blocks as well as a transmitter and a receiver are separately optimized. Accordingly, research is ongoing to address the following problems: CSI acquisition, pilot signal design, data transmission, beamforming, etc., by applying end-to-end communication.

However, to maximize the performance of end-to-end communication, both the transmitter and receiver need to be trained to adapt appropriately to a channel environment. For example, in an end-to-end communication system, a transmitter, channel, and receiver are implemented as NN(s), and trainable parameters of the transmitter and receiver may be jointly optimized for a specific channel model. However, compared to a system that considers an NN for the receiver, end-to-end learning not only involves transmitting training signals from the transmitter to the receiver for weight calculation but also requires feedback from the receiver to the transmitter, which results in significant signaling overhead. To address this issue, offline learning or training only the NN of the receiver may be considered. However, offline learning has a disadvantage in that end-to-end communication is incapable of adaptively operating on a channel. In addition, if only the NN of the receiver is trained, the NN of the transmitter may not be trained, and as a result, the transmitter is incapable of being fine-tuned. Thus, training only the NN of the receiver is a suboptimal method.

FIG. 8 illustrates the concept of end-to-end learning to which backpropagation is applied. In particular, FIG. 8 is a conceptual diagram illustrating training signals and feedback in end-to-end learning to which backpropagation is applied. In FIG. 8, ∇_t,pdenotes the gradient of a p-th training signal in a t-th batch, H_t,pdenotes a channel experienced by the p-th training signal in the t-th batch, X_t,pdenotes the p-th training signal in the t-th batch, and L_t,pdenotes a loss for the p-th training signal in the t-th batch.

Referring to FIG. 8, backpropagation is performed from a loss value of a receiver, which is calculated from a training signal x input from a transmitter, to the receiver. Therefore, research is needed to explore an efficient method for jointly training a transmitter and receiver in end-to-end communication to ensure the optimal performance thereof. Hereinafter, some implementations of the present disclosure where end-to-end learning strategies and transfer learning are applied will be described.

FIG. 9 illustrates an example of computing a gradient in an NN.

In AI, training of an NN involves computing gradients, which are calculated through backpropagation. Backpropagation, which is known as backward propagation of errors, is an algorithm for supervised learning in an artificial NN based on gradient descents. As illustrated in FIG. 9, backpropagation is computed based on a chain rule. When a specific training signal is provided as an input, gradient values for each layer are obtained from feedforward values calculated as the training signal passes through each layer of an NN.

In end-to-end learning, gradients may also be computed through backpropagation. Referring to FIG. 9, since terms differentiated with respect to w_tinclude derivative terms from the receiver, backpropagation values need to be transferred from the receiver to the transmitter. As a result, there is a significant increase in signaling overhead in end-to-end learning. In FIG. 9, derivative terms expressed by a chain rule correspond to values associated with parts connected by arrows along the transmission/reception path. In addition, in FIG. 9, t and k denote a t-th batch and a k-th training symbol, respectively, while w_nmdenotes the element in an n-th row and an m-th column of w_t.

While the NN system has various advantages, an NN need to be trained to adapt appropriately to a channel environment to maximize the performance of the NN system. Speeding up the training affects the performance of a communication system where the NN is applied. Offline learning training methods are being considered to speed up the training of the NN. However, since offline learning has a disadvantage of being incapable of adaptively adapting to channels, the offline learning has challenges in optimizing the performance of the NN system. Therefore, research is being conducted on online learning to speed up the training of the NN and adapt the NN adaptively to channels. In particular, transfer learning is considered for the NN to effectively perform online learning.

Hereinafter, some implementations of the present disclosure for applying transfer learning to a wireless communication system in consideration of a wireless channel environment will be described.

First, transfer learning (TL) and hierarchical feature representations will be briefly explained to facilitate understanding of some implementations of the present disclosure.

Transfer Learning (TL)

TL is a machine learning approach that focuses on storing knowledge acquired by solving one problem and applying the knowledge to different but related problems. The key idea of TL is to synthesize distilled knowledge accumulated from past diverse experiences as well as similar tasks to facilitate learning of new problems. TL techniques may reduce dependency on labeled data, improve the speed of learning, and enhance the robustness of machine learning to diverse wireless environments.

For example, TL may transmit the results learned at different times or from different tasks to an NN, enabling the NN to perform learning more quickly and with less computational load. Deep learning, which is a type of machine learning, uses a multi-layer architecture called a deep neural network (DNN), which is inspired by the structure of the human brain and consist of input layers, output layers, and multiple hidden layers therebetween as described above. The DNN is trained to perform specific tasks such as classification, clustering, or regression, and during the learning process, the DNN use the knowledge thereof to execute the trained tasks. A trained deep learning model including architectures and parameters may be considered as knowledge obtained from training data. Deep learning may be considered as a means for TL to transfer knowledge between different domains. In the case of deep learning, information transfer between NNs for TL may involve transfer of parameters, weights, and so on.

FIG. 10 illustrates an example of TL to which deep learning is applied.

The strategies used in deep transfer learning (DTL) may be categorized into the following three strategies.

- 1) Off-the-shelf pre-trained models: Training deep learning models for complex tasks requires a significant amount of data and time. Such a lengthy training process is one of the major obstacles that hinders the progress of deep learning. Thus, pre-trained models, i.e., models trained from neighboring domains directly affect a target task, instead of training a DNN at the beginning. When the domains and tasks of a source and target are similar, the learned results of the source task learned in the source domain may be directly applicable to the target task in the target domain.
- 2) Pre-trained models as feature extractors: For traditional machine learning algorithms, raw data may not be directly used as an input, and preprocessing is required to extract features. A DNN may learn these features automatically. Based on the deep learning capability, pre-trained models may be used as feature extractors for target domain data. In particular, target data is provided to the pre-trained models to obtain new feature representations before being used. The new representations combined with knowledge from the source domain may enhance the learning process. As the feature extractors, the pre-trained models may be used when the source and target domains are similar but the source and target tasks are different. In this case, the learned results of the source task learned in the source domain may be used as the feature extractors in the target domain.
- 3) Fine tuning pre-trained models: Fine-tuning pre-trained models are similar to the second strategy described above in that pre-trained models are used. However, all or part of pre-trained source models may be continuously trained with target data to further enhance the effectiveness of knowledge transfer, instead of freezing all parameters of the pre-trained models. The fine-tuning pre-trained models may be applied when the source and target domains are related but different and when the source and target tasks are the same. The learned results of the source task learned in the source domain are fine-tuned in the target domain.

In some implementations of the present disclosure, the fine-tuning pre-trained models among the three strategies described above are used to perform end-to-end learning.

FIG. 11 is a conceptual diagram illustrating TL to which fine-tuning pre-trained models are applied. In FIG. 11, “fc” represents a fully-connected layer, and “conv” represents a convolutional layer.

Referring to FIG. 11, fine-tuning of pre-trained models may be performed according to the following two methods: a) weight initialization, and b) selective fine-tuning.

- a) Weight initialization: The parameters of a target model are first initialized by the parameters of a pre-trained model. Then, a new model is trained with target domain data. Weight initialization is often applied when the target domain has a significant amount of labeled data.
- b) Selective fine-tuning: Considering that lower layers in a DNN are capable of learning (domain-independent) general features and higher layers are capable of learning (domain-dependent) specific features, it may be determined how many layers of a pre-trained model need to be tuned. In particular, when target data is small and the DNN has a large number of parameters, more layers are frozen to prevent overfitting issues. On the other hand, when the target data is large and the DNN has a small number of parameters is small, more layers should be trained with new data.

In FIG. 11(a), the weights of all layers are updated during fine-tuning, while in FIG. 11(b), the weights of only selected layer(s) are updated.

Therefore, if the results learned in the source domain are appropriately selected, the learning speed in the target domain may be accelerated.

Hereinafter, some implementations of the present disclosure regarding channel adaptation based on TL for end-to-end communication will be described.

FIG. 12 is provided to illustrate the concept of end-to-end learning according to some implementations of the present disclosure.

In end-to-end learning according to some implementations of the present disclosure, a receiver may compute and transmit a gradient or weight for a training part of a transmitter as shown in FIG. 12, instead of transmitting backpropagation results for all training symbols to the transmitter as shown in FIG. 8.

FIG. 13 illustrates a conventional end-to-end learning process performed based on backpropagation, whereas FIG. 14 illustrates an example of an end-to-end learning process according to some implementations of the present disclosure.

Referring to FIG. 13, when an NN is updated, a gradient or weight is calculated on a mini-batch basis. The gradient of the mini-batch is obtained from the average of gradient results for training symbols. The gradient obtained for each training symbol at a receiver is transmitted to a transmitter. As a result, the conventional end-to-end learning shown in FIG. 13 may lead to high feedback frequency as well as significant signaling overhead.

In the conventional end-to-end learning, for example, the gradient for each training symbol is transmitted from the receiver to the transmitter. When the transmitter intends to update the weight, the transmitter averages the weights of the training symbols and perform the update based on the average of the weights.

In some implementations of the present disclosure, to compute and transmit the gradient or weight for the training part of the transmitter, the receiver may replicate a transmitter NN (Tx NN) and then perform backpropagation to the transmitter as shown in the example of FIG. 14.

In some implementations of the present disclosure, to enable the replicated Tx NN to operate at the receiver, the transmitter may transmit to the receiver information on the Tx NN including the configuration and initial state (e.g., initial value of NN weights) of the Tx NN and/or configurations for generating the training symbols (signal 1 in FIG. 14). Thereafter, based on the received Tx NN information, the receiver may start tracking weights and feedforward values, calculate the gradient or weight on a mini-batch basis, and transmit the gradient or weight to the transmitter (signal 2 in FIG. 14). In some implementations of the present disclosure, transmitting the gradient or weight on a mini-batch basis may reduce the frequency of feedback and signaling overhead from the receiver to the transmitter, compared to transmitting backpropagation results.

However, when the receiver directly calculates the gradient or weight by replicating the Tx NN, if the receive replicates the entirety of the Tx NN, it may significantly increase signaling overhead as well as computational complexity at the receiver. Therefore, in some implementations of the present disclosure, transfer learning (TL) is applied to mitigate increases in the signaling overhead and reduce the computational complexity at the receiver. For example, when TL is applied, only information on a part of the replicated Tx NN where learning is required is transferred to the receiver. The receiver may perform computations only on the corresponding part, and thus learning may be performed more efficiently than the structure shown in FIG. 12. According to some implementations of the present disclosure, when TL is applied to end-to-end learning, only a specific layer of the Tx NN may be selected and trained depending on the channel environment in a source domain and the channel environment in a target domain, thereby reducing the training part of the Tx NN. In addition, the signaling overhead and computational complexity at the receiver may be reduced. In particular, as the training part of the Tx NN gets closer to the front-end of the Tx NN (e.g., closer to a transmission antenna), the number of learning layers to be considered decreases, thereby reducing the signaling required for training and the computational complexity at the Rx NN. When TL is applied, signal 1 in the example of FIG. 14 may include the configuration and initial state of the Tx training part as well as information on a pre-trained part of the transmitter if necessary, and/or information on a training signal. In the example of FIG. 14, signal 2 represents a gradient and/or weight for the Tx training part. For instance, in the example of FIG. 14, signal 2 may represent an averaged or updated gradient obtained by averaging training symbols in the Tx training part.

Some implementations of the present disclosure may be represented by the following two methods (A and B).

A. No backpropagation feedback: Instead of performing backpropagation feedback to a transmitter, a receiver calculates the gradient of a Tx NN and feeds the calculated gradient back to the transmitter.

- A-1. To enable the receiver to compute the gradient of the Tx NN, the transmitter may provide information on the Tx NN (e.g., signal 1 in FIG. 14) through initial signaling. For example, the transmitter may provide a Tx NN configuration, an initial state, training symbol generation information (e.g., reference signal information), and so on to the receiver. In this case, the initial signaling occurs for the transmission of the information on the Tx NN.
- A-2. The receiver may track the state of the Tx NN (e.g., weight(s) and/or feedforward value(s) necessary for weight calculation) and compute weights. In this case, the computational complexity at the receiver increases.
- A-3. The receiver feeds the gradients/weights of the Tx NN back to the transmitter at mini-batch intervals (e.g., weight update intervals) (e.g., signal 2 in FIG. 14).
- A-4. A mini-batch consists of several hundred or thousand training symbols. Thus, it is necessary to reduce the frequency and overhead of signal 2.
- B. End-to-end learning to which TL is applied: TL may be applied to mitigate increases in computational complexity at the receiver and reduce signaling overhead.
- B-1. Fine-tuning pre-trained TL: The sizes and structures of pre-trained and training parts may be selected in consideration of a channel environment.
- B-2. Signal 1: Tx training part configuration and/or initial state (if necessary, information on the Tx pre-trained part and training symbol generation information may be included).
- B-3. Signal 2: Gradient(s)/weight(s) of Tx refining NN.
- B-4. The size of the Tx training part may decrease by applying TL. As a result, the overhead of signal 1 and/or signal 2 may also be reduced. In implementations where a receiver directly computes the gradient(s)/weight(s) of a Tx NN and provides the gradient(s)/weight(s) to a transmitter, the increases in the computational complexity at the receiver may be mitigated.
- B-5. As the location of the Tx training part gets closer to the receiver, the overhead of signal 1 and/or signal 2 may be reduced, and the increase in the computational complexity at the receiver may also be mitigated.

In some implementations of the present disclosure, in the case of online learning of end-to-end communication, a receiver may compute the gradient of a Tx NN and feed the computed gradient back to a transmitter during online learning of end-to-end communication. In some implementations of the present disclosure, in the case of online learning of end-to-end communication, when a receiver intends to compute the gradient of a Tx NN and feed the gradient back to a transmitter, information on the Tx NN may be transmitted to the receiver to enable the receiver to compute the gradient of the transmitter. In some implementations of the present disclosure, in the case of online learning of end-to-end communication, when a receiver intends to compute the gradient of a Tx NN and feed the gradient back to the transmitter, TL may be applied to mitigate increases in computational complexity at the receiver and reduce signaling overhead. In some implementations of the present disclosure, in the case of online learning of end-to-end communication, when a receiver intends to compute the gradient of a Tx NN and feed the gradient back to a transmitter, if TL is applied to mitigate increases in computational complexity at the receiver and reduce signaling overhead, information on a Tx training part may be transmitted to the receiver to enable the receiver to compute the gradient of the training part of the Tx NN. In some implementations of the present disclosure, in the case of online learning of end-to-end communication, when a receiver intends to compute the gradient of a Tx NN and feed the gradient back to a transmitter, if TL is applied to mitigate increases in computational complexity at the receiver and reduce signaling overhead, a part of the NN of the transmitter closer to the front-end of the transmitter may be determined as a training part to alleviate the computational complexity at the receiver.

A receiving device may perform operations related to end-to-end learning according to some implementations of the present disclosure. The receiving device may include: at least one transceiver; at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions that, when executed, cause the at least one processor to perform the operations according to some implementations of the present disclosure. A processing device for the receiving device may include: at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions that, when executed, cause the at least one processor to perform the operations according to some implementations of the present disclosure. A computer-readable (non-transitory) storage medium may be configured to store at least one computer program including instructions that, when executed by at least one processor, cause the at least one processor to perform the operations according to some implementations of the present disclosure. A computer program or computer program product may include instructions stored on at least one computer-readable (non-transitory) storage medium and, when executed, cause (at least one processor) to perform the operations according to some implementations of the present disclosure.

For the receiving device, processing device, computer-readable (non-volatile) storage medium, and/or computer program product, the operations may include: receiving transmission NN information including a configuration of a transmission NN from a transmitting device; receiving a plurality of training symbols for the transmission NN from the transmission device; determining a gradient for the transmission NN based on the transmission NN information and the plurality of training symbols; and feeding the gradient back to the transmission device.

In some implementations, determining the gradient for the transmission NN based on the transmission NN information and the plurality of training symbols may include: determining a plurality of gradient values of the transmission NN based on the plurality of training symbols, respectively; and determining the gradient for the transmission NN by averaging the plurality of gradient values.

In some implementations, determining the gradient for the transmission NN based on the transmission NN information and the plurality of training symbols may include determining a gradient of a training part of the transmission NN.

In some implementations, the operations may include receiving information on the training part of the transmission NN from the transmitting device.

In some implementations, the transmission NN information may include information on an initial state of the transmission NN.

In some implementations, the transmission NN information may include information on generation of training symbols in the transmission NN.

A transmitting device may perform operations related to end-to-end learning according to some implementations of the present disclosure. The transmitting device may include: at least one transceiver; at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions that, when executed, cause the at least one processor to perform the operations according to some implementations of the present disclosure. A processing device for the transmitting device may include: at least one processor; and at least one computer memory operably connected to the at least one processor and configured to store instructions that, when executed, cause the at least one processor to perform the operations according to some implementations of the present disclosure. A computer-readable (non-transitory) storage medium may be configured to store at least one computer program including instructions that, when executed by at least one processor, cause the at least one processor to perform the operations according to some implementations of the present disclosure. A computer program or computer program product may include instructions stored on at least one computer-readable (non-transitory) storage medium and, when executed, cause (at least one processor) to perform the operations according to some implementations of the present disclosure.

For the transmitting device, processing device, computer-readable (non-volatile) storage medium, and/or computer program product, regarding the operations, updating the weight of the transmission NN based on the gradient may include updating a weight of a training part of the transmission NN based on the gradient.

In some implementations, the operations may include: determining the training part of the transmission NN; and transmitting information on the training part to the receiving device.

In some implementations, determining the training part of the transmission NN may include determining a front end of the transmission NN as the training part.

In some implementations, the transmission NN information may include information on an initial state of the transmission NN.

In some implementations, the transmission NN information may include information on generation of training symbols in the transmission NN.

In some implementations, the transmitting device or receiving device may be a network device (e.g., BS, server, etc.). In some implementations, the transmitting device or receiving device may be a UE.

The examples of the present disclosure as described above have been presented to enable any person of ordinary skill in the art to implement and practice the present disclosure. Although the present disclosure has been described with reference to the examples, those skilled in the art may make various modifications and variations in the example of the present disclosure. Thus, the present disclosure is not intended to be limited to the examples set for the herein, but is to be accorded the broadest scope consistent with the principles and features disclosed herein.

The implementations of the present disclosure may be used in a BS, a UE, or other equipment in a wireless communication system.

METHOD BY WHICH RECEPTION DEVICE PERFORMS END-TO-END TRAINING IN WIRELESS COMMUNICATION SYSTEM, RECEPTION DEVICE, PROCESSING DEVICE, STORAGE MEDIUM, METHOD BY WHICH TRANSMISSION DEVICE PERFORMS END-TO-END TRAINING, AND TRANSMISSION DEVICE

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATIONS

PCT Information