DISTRIBUTED COMPUTATION OFFLOADING METHOD BASED ON COMPUTATION-NETWORK COLLABORATION IN STOCHASTIC NETWORK

Information

  • Patent Application
  • 20230199061
  • Publication Number
    20230199061
  • Date Filed
    November 05, 2021
    3 years ago
  • Date Published
    June 22, 2023
    a year ago
Abstract
A distributed computation offloading method based on computation-network collaboration in a stochastic network is provided. The distributed computation offloading method includes: building a device revenue maximization problem model and a MEC server revenue maximization problem model based on a local computing model and an edge cloud computing model; building composite sets of scenarios for dwell time and waiting latency based on a random movement of a user and burst computation demands; compensating for a game strategy by using a posteriori recourse action and building a game-based stochastic programming model between the device and the MEC server; transforming a multi-stage stochastic regularization problem for both the device and the MEC server into a DEP problem by constructing a scenario tree, and solving the DEP problem to obtain an optimal task strategy for the offloading from the MEC server and an optimal offering strategy of the MEC server to the device.
Description

The present application claims priority to Chinese Patent Application No. 202111095551.7, titled “DISTRIBUTED COMPUTATION OFFLOADING METHOD BASED ON COMPUTATION-NETWORK COLLABORATION IN STOCHASTIC NETWORK”, filed on Sep. 17, 2021 with the Chinese Patent Office, which is incorporated herein by reference in its entirety.


FIELD

The present disclosure relates to the technical field of mobile communications, and in particular to a distributed computation offloading method based on computation-network collaboration in a stochastic network.


BACKGROUND

With the rapid development and integration of mobile Internet and Internet of Things, mobile devices (MDs) (such as smartphones, wearable devices, and automatic driving vehicles) and cloud-oriented applications (such as virtual reality (VR), augmented reality (AR), and online games) are widely used, driving computing power and storage resources to the edge of the network to reduce transmission delays and congestions. With the multi-access edge computing (MEC) technology, part or all of computing tasks of devices or applications may be offloaded to an edge-cloud server to enhance the user's service experience. In addition, in order to meet requirements of a next generation wireless communication networks, small base stations (SBSs) with low powers and short distances are arranged in coverage of macro base stations (MBSs) in the ultra-dense network (UDN), effectively improving network throughput and access capacity. As shown in FIG. 1, MEC and UDN are integrated, so that the users can enjoy ubiquitous network computing services anywhere and anytime, thereby facilitating the continuity of task computing offload.


However, the actual UDN system supporting MEC has the following problems. (1) With the rapid growth of cloud-oriented devices, applications and data in the era of the Internet of Things, the conventional centralized optimization framework requires accurate information (such as traffic features, network loads and channel state information (CSI)) about system states, which is inefficient for the complex and time-varying UDN supporting MEC with many heterogeneous Internet of Things applications. (2) Due to random movement, mobile devices frequently switch between different wireless edge-cloud access points, increasing the cost of task re-offloading and resource reallocation and seriously reducing the computation offloading performance. According to the conventional technology, the following results are achieved. (1) Mobility aware and dynamic migration services for the internet of vehicles (referring to: I. Labriji, F. Meneghello, D. Cecchinato, S. Sesia, E. Perraud, E. C. Strinati and M. Rossi, “Mobility Aware and Dynamic Migration of MEC Services for the Internet of Vehicles,” IEEE Transactions on Network and Service Management, vol. 18, no. 1, pp. 570-584, March 2021) are achieved. According to the algorithm, by using an MEC server, dwell time is predicted and proportional computation offloading is performed. (2) Energy-efficient and delay-guaranteed workload distribution algorithm applied in IoT-Edge-Cloud computing systems (referring to: M. Guo, L. Li and Q. Guan, “Energy-Efficient and Delay-Guaranteed Workload Allocation in IoT-Edge-Cloud Computing Systems,” IEEE Access, vol. 7, pp. 78685-78697, 2019.). According to the algorithm, there are many kinds of lightweight edge-cloud servers having different function features which cannot respond to sudden computation requirements quickly and timely, and there is an uncertain waiting delay for offloading a task in the server (including task queue, decompression, security analysis, and the like), which indicates that offloading and resource allocation decisions should consider a network edge (a communication factor) and should further consider a service edge (an actual available computing power factor), especially for the delay-sensitive applications. However, requirements for task offloading demands in stochastic and time-varying network environments are lacking in the conventional technology.


SUMMARY

In order to meet the growing requirements for large-scale Internet of Things applications in a stochastic and time-varying network environment, a distributed computation offloading method based on computation-network collaboration in a stochastic network is provided according to a first aspect of the present disclosure. The distributed computation offloading method may include:


establishing a local computing model and an edge-cloud computing model based on differentiated service quality requirements, task processing costs, and the type of a device in a UDN environment;


establishing, based on the local computing model and the edge-cloud computing model, a device revenue maximization problem model by using a Lyapunov optimization theory and a minimum drift decreasing utility function at the device;


establishing, based on the local computing model and the edge cloud computing model, a MEC server revenue maximization problem model by using a maximum value theory at an MEC server;


respectively establishing, based on a random movement of a user and a burst computation requirement, a composite scenario set based on a dwell time period and a composite scenario set based on a waiting delay;


performing compensation on a game strategy by using a posteriori recourse action, and establishing a game-based stochastic programming model for the device and the MEC server;


establishing a scenario tree to transform a multi-stage stochastic programming problem of the device and the MEC server to a DEP problem;


solving an optimization problem of the device and an optimization problem of the MEC server based on a Lagrangian multiplier and a KKT condition to obtain an optimal task offloading strategy of the device to the MEC server and an optimal quotation strategy of the MEC server to the device; and


in a case that each of the optimal task offloading strategy of the device to the MEC server and the optimal quotation strategy of the MEC server to the device meets a Stackelberg equilibrium solution, offloading, by the device based on the optimal task offloading strategy, a task to the MEC server.


In some embodiments, the establishing a device revenue maximization problem model includes:









max



f

i
,
L


(
t
)

,


D

i
,
k


(
t
)





Y

b
i


(
t
)


=



V
i

·


U

b
i


[


D
i

(
t
)

]


+





k


(

L
,
M
,
S



}





Q
i

(
t
)

·


D

i
,
k


(
t
)



-



Q
i

(
t
)

·


A
i

(
t
)




,




and constraints include:









T

i
,
L

pt

(
t
)



τ
i
d


,








f

i
,
L

min




f

i
,
L


(
t
)



f

i
,
L

max


,

t

T

,







0



D
i

(
t
)




Q
i

(
t
)


,

t

T

,
and










Q
i



(
t
)


_

=



lim

T


+





sup


1
T






t
=
0


T
-
1



E


{


Q
i

(
t
)

}







+




;




where fi,L(t) represents a CPU clock frequency set of an i-th device, Di,k(t) represents an offloading task set in a time slot t, Ybi(t) represents a long-term revenue of the i-th device, Vi represents a non-negative control parameter in the Lyapunov optimization theory, Di,k(t) represents an offloading task for an i-th mobile device, Ai(t) represents a task arriving at the i-th mobile device in the time slot t, T represents a time slot index and T={0, 1, . . . }, Ubi[Di(t)] represents a total revenue of the i-th device in the time slot t; k∈{L,M,S} represents a current task type, L represents a non-offloading task, M represents a task offloaded to a macro base station, S represents a task offloaded to a small base station; Ti,Lpt(t) represents a time period in which a local server processes an offloaded task, τid represents a maximum computation delay constraint of Di(t), fi,Lmin represents a minimum CPU clock frequency allocated for the i-th mobile device in the time slot t, fi,L(t) represents a CPU clock frequency allocated for the i-th mobile device in the time slot t, fi,Lmax represents a maximum CPU clock frequency allocated for the i-th mobile device in the time slot t, Di(t) represents a task processing decision at a beginning of the time slot t, Qi(t) represents a task queue backlog at the beginning of the time slot t, and custom-character(t) represents an average queue backlog.


In some embodiments, in a case that a task is offloaded to a macro base station, the MEC server revenue maximization problem model is expressed as:









max



p
M

(
t
)

,

β

i
,

l
M







Y

s
M


(
t
)


=



u
M

[


p
M

(
t
)

]

-





i




e

i
,
M


[



D

i
,
M


(
t
)

,


T

i
,
M

wt

(
t
)


]




,




and constraints include:






p
i,M(t)≥0,t∈T,






T
i,M
co(t)≤τid,t∈T, and





βlMmin≤βi,lM(t)≤βlMmax,t∈T;


where pM(t) represents a pricing set of the macro base station in a time slot t, βi,lM represents a computing capacity set allocated to the macro base station, YsM(t) represents a revenue of the macro base station, uM[pM(t)] represents a service utility of the macro base station, ei,M[Di,M(t),Ti,Mwt(t)] represents a task computing energy consumption of the macro base station, pi,M(t) represents a payment cost of an i-th device in the time slot t to the macro base station, T represents a time slot index and T={0, 1, . . . }, Ti,Mco(t) represents a total computation offloading time period of the macro base station in the time slot t, τid represents a maximum computation delay constraint of Di(t), βlMmin represents a minimum frequency of each of CPU cores in the macro base station, βi,lM(t) represents a frequency of an i-th CPU core in the macro base station, and βlMmax represents a maximum frequency of each of the CPU cores in the macro base station.


In some embodiments, in a case that a task is offloaded to a small base station, the MEC server revenue maximization problem model is expressed as:











?


(
t
)


=



u
S

[


p
S

(
t
)

]

-





i




e

i
,
S


[



D

i
,
S


(
t
)

,


T

i
,
S

st

(
t
)

,


T

i
,
S

wt

(
t
)


]




,








?

indicates text missing or illegible when filed




and constraints include:






p
i,S(t)≥0,t∈T,






T
i,S
co(t)≤τid,t∈T, and





βlSmin≤βi,lS(t)≤βlSmax,t∈T;


where pS(t) represents a pricing set of the small base station in a time slot t, βi,lS represents a computing capacity set allocated to the small base station, YsS(t) represents a revenue of the small base station, uS[pS(t)] represents a service utility of the small base station, ei,S[Di,S(t),Ti,Sst(t),Ti,Swt(t)] represents a task computing energy consumption of the small base station, pi,S(t) represents a payment cost of an i-th device in the time slot t to the small base station, T represents a time slot index and T={0, 1, . . . }, Ti,Sco(t) represents a total computation offloading time period of the small base station in the time slot t, τid represents a maximum computation delay constraint of Di(t), βlSmin represents a minimum frequency of each of CPU cores in the small base station, βi,lS(t) represents a frequency of an i-th CPU core in the small base station, and βlSmax represents a maximum frequency of each of the CPU cores in the small base station.


In some embodiments, the establishing a composite scenario set based on a dwell time period and a composite scenario set based on a waiting delay includes:


in a case that a possible dwell time period and a possible waiting time period of a mobile device in a small base station are known, obtaining, based on a Cartesian product, a composite scenario based on dwell time periods of all mobile devices and a composite scenario based on waiting delays of all the mobile devices, wherein a composite scenario set based on the dwell time periods of all the mobile devices is expressed as:









Ω
S

(
t
)

=



Ω
S
st

(
t
)

×


Ω
S
wt

(
t
)



,









Ω
S
st

(
t
)

=





i
=
1

m



Ω

i
,
S

st

(
t
)


=



Ω

1
,
S

st

(
T
)

×

×


Ω

m
,
S

st

(
t
)




,
and









Ω
k
wt

(
t
)

=





i
=
1

m



Ω

i
,
k

wt

(
t
)


=



Ω

1
,
k

wt

(
t
)

×

×


Ω

m
,
k

wt

(
t
)




;




where m represents the number of the mobile devices, Ωi,Sst(t) represent a composite scenario based on a dwell time period of an i-th mobile device, and Ωi,kwt(t) represents a composite scenario based on a waiting delay of the i-th mobile device.


In some embodiments, compensation is performed on the game strategy by using the posteriori recourse action, that is, a two-stage optimal programming model is adopted, parameters are obtained based on the device revenue maximization problem model and the MEC server revenue maximization problem model in a first stage, compensation is performed on the parameters obtained in the first stage by using the posteriori recourse action in a second stage, and the device revenue maximization problem model after the compensation performed by using the posterior recourse action is expressed as:














max



f

i
,
L


(
t
)

,


D

i
,
k


(
t
)





Y

b
i


(
t
)


=


V
i



{



u
i

[


D
i

(
t
)

]

-


s
i

[


D
i

(
t
)

]

-











?


{


c
i

(



D

i
,
t


(
t
)

,



T

i
,
S

st

(
t
)





T

i
,
S

st

(
t
)




Ω

i
,
S

st

(
t
)




)

}


-









?


{


e

i
,
L


(



D

i
,
t


(
t
)

,



T

i
,
S

st

(
t
)

|



T

i
,
S

st

(
t
)




Ω

i
,
S

st

(
t
)




)

}


}

+










k


{

L
,
M
,
S

}






Q
i

(
t
)




D

i
,
k


(
t
)



-



Q
i

(
t
)




A
i

(
t
)






,








?

indicates text missing or illegible when filed




and constraints include:









T

i
,
L

pt

(
t
)



τ
i
d


,








f

i
,
L

min




f

i
,
L


(
t
)



f

i
,
L

max


,

t

T

,







0



D
i

(
t
)




Q
i

(
t
)


,

t

T

,
and










Q
i



(
t
)


_

=



lim

T


+





sup


1
T






t
=
0


T
-
1



E


{


Q
i

(
t
)

}







+




;




in a case that a task is offloaded to a small base station, the MEC server revenue maximization problem model after the compensation performed by using the posterior recourse action is expressed as:









max



p
S

(
t
)

,

β

i
,

l
S







Y

s
S


(
t
)


=



u
S

[


p
S

(
t
)

]

-


E


Ω
S

(
t
)


[

R

(



β

i
,

l
S



(
t
)

,


Ω
S

(
t
)


)

]



,




and constraints include:






p
i,S(t)≥0,t∈T,






T
i,S
co(t)≤τid,t∈T, and





βlSmin≤βi,lS(t)≤βlSmax,t∈T;


where fi,L(t) represents a CPU clock frequency set of an i-th device, Di,k(t) represents an offloading task set, Ybi(t) represents a long-term revenue of the i-th device, Vi represents a non-negative control parameter in the Lyapunov optimization theory, ui[Di(t)] represents a utility of the i-th device, si[Di(t)] represents a computation service cost of the i-th device, EΩi,Sst(t){ci(Di,t(t),Ti,Sst(t)|Ti,Sst(t)∈Ωi,Sst(t))} represents an average communication cost in a composite scenario Ωi,Sst(t),







E


Ω

i
,
S

st

(
t
)




{


e

i
,
L


(



D

i
,
t


(
t
)

,



T

i
,
S

st

(
t
)

|



T

i
,
S

st

(
t
)




Ω

i
,
S

st

(
t
)




)

}





represents an average energy consumption cost in the composite scenario Ωi,Sst(t), Di,k(t) represents an offloading task for the i-th mobile device, Ai(t) represents a task arriving at the i-th mobile device in a time slot t, T represents a time slot index and T={0, 1, . . . }, Ubi[Di(t)] represents a total revenue of the i-th device in the time slot t, k∈{L,M,S} represents a current task offloading position, L indicates that a task is offloaded locally, M indicates that a task is offloaded to a macro base station, S indicates that a task is offloaded to a small base station, Ti,Lpt(t) represents a time period in which a local server processes an offloaded task, τid represents a maximum computation delay constraint of Di(t), fi,Lmin represents a minimum CPU clock frequency allocated for the i-th mobile device in the time slot t, fi,L(t) represents a CPU clock frequency allocated for the i-th mobile device in the time slot t, fi,Lmax represents a maximum CPU clock frequency allocated for the i-th mobile device in the time slot t, Di(t) represents a task processing decision at a beginning of the time slot t, Qi(t) represents a task queue backlog at the beginning of the time slot t, custom-character(t) represents an average queue backlog, YsS(t) represents a revenue of a small base station, uS[pS(t)] represents a service utility of the small base station, EΩS(t)[R(βi,lS(t),ΩS(t))] represents an average recourse in a composite scenario Ωi,Sst(t), pi,M(t) represents a payment cost of the i-th device to the macro base station in the time slot t, T represents a time slot index and T={0, 1, . . . }, Ti,Mco(t) represents a total computation offloading time period of the macro base station in the time slot t, τid represents a maximum computation delay constraint of Di(t), βlMmin represents a minimum frequency of each of CPU cores in the macro base station, βi,lM(t) represents a frequency of an i-th CPU core in the macro base station, and βlMmax represents a maximum frequency of each of the CPU cores in the macro base station.


In some embodiments, a recourse function R(βi,lS(t),ΩS(t)) is expressed as:







R

(



β

i
,

l
S



(
t
)

,


Ω
S

(
t
)


)

=





i




e

i
,
S


[



D

i
,
S


(
t
)

,


T

i
,
S

st

(


ω
S

(
t
)

)

,


T

i
,
S

wt

(


ω
S

(
t
)

)


]






where ei,S[Di,S(t),Ti,SstS(t)),Ti,SwtS(t))] represents an energy consumption cost of the small base station.


In some embodiments, a task offloading process is divided into H sub-time slots, a distributed programming model with 2H stages is obtained, and the device revenue maximization problem model is expressed as:














?


Y

b
i




{



D
i

(

τ
1

)

,


,


D
i

(

τ
H

)

,


f

i
,
L


(

τ
1

)

,


,


f

i
,
L


(

τ
H

)


}


=







V
i



{



u
i

[


D
i

(

τ
1

)

]

-


c
i

[


D
i

(

τ
1

)

]

-


e

i
,
L


[


D
i

(

τ
1

)

]

-


s
i

[


D
i

(

τ
1

)

]

+










?





h
=
2

H


{



u
i

[


D
i

(

τ
h

)

]

-


s
i

[


D
i

(

τ
h

)

]


}



-








?





h
=
2

H



c
i

[



D
i

(

τ
h

)

,


T

i
,
S

st

(

τ
h

)


]



-









?





h
=
2

H



e

i
,
L


[



D
i

(

τ
h

)

,


T

i
,
S

st

(

τ
h

)


]



}

+









Q
i

(

τ
1

)




D
i

(

τ
1

)


+


?





h
=
2

H




Q
i

(

τ
h

)




D
i

(

τ
h

)




-









Q
i

(

τ
1

)




A
i

(

τ
1

)


-


?





h
=
2

H




Q
i

(

τ
h

)




A
i

(

τ
h

)








,








?

indicates text missing or illegible when filed




and constraints include:








f

i
,
L

min




f

i
,
L


(

τ
h

)



f

i
,
L

max


,


τ
h


H

,











h
=
1

H



D
i

(

τ
h

)


=


D
i

(

τ
h

)


,


τ
h


H

,
and







0





h
=
1

H



D
i

(

τ
h

)





Q
i

(

τ
h

)


,



τ
h


H

;





in the case that the task is offloaded to the small base station, the MEC server revenue maximization problem model is expressed as:














?


{



p

i
,
S




(

τ
1

)


,


,


p

i
,
S




(

τ
H

)


,


β

i
,
S




(

τ
1

)


,


,


β

i
,
S




(

τ
H

)



}


=








u
S

[


p
S

(

τ
1

)

]

-




i
=
1

m


{



e

i
,
S


[


D

i
,
S


(

τ
1

)

]

+


p

i
,
S

f

[


D

i
,
S


(

τ
1

)

]


}


+








?





h
=
2

H



u
S

[


p
S

(

τ
h

)

]



-







?





i
=
1

m





h
=
2

H



e

i
,
S


[



D

i
,
S


(

τ
h

)

,


T

i
,
S

st

(

τ
h

)

,


T

i
,
S

wt

(

τ
h

)


]







,








?

indicates text missing or illegible when filed




and constraints include:









p

i
,
S


(

τ
h

)


0

,


τ
h


H

,









E

ξ
s




{




h
=
1

H


[




T

i
,
S

up

(

τ
h

)

+

|



T

i
,
S

wt

(

τ
h

)

+





l
k



L
k







D
i

(

τ
h

)



γ
i



β

i
,

l
k









}




τ
i
d


,
and








β

l
s

min




β

i
,

l
s



(

τ
h

)



β

l
s

max


,



τ
h


H

;





where fi,Lh) represents a CPU clock frequency set of an i-th device in a sub-time slot τh, Di,kh) represents a offloading task set of an i-th mobile device in the sub-time slot τh, ui[Di1)] represents a utility of the i-th mobile device in a root node, ci[Di1)] represents a communication cost of the i-th mobile device in the root node, ei,L[Di1)] represents an energy consumption cost of the i-th mobile device in the root node, si[Di1)] represents a computation service cost of the i-th mobile device in the root node, Ti,Ssth) represents a dwell time period of the i-th mobile device in the small base station in the sub-time slot τh, Dih) represents a processed task of the i-th mobile device in the sub-time slot τh, fi,Lh) represent a local computation frequency of the i-th mobile device in the sub-time slot τh; custom-characterh) represents a task queue backlog in the sub-time slot τh, ξi,Ssth) represents a dwell time period in the sub-time slot τh, and Ai1) represents a task arriving at the i-th mobile device in the root node.


In some embodiments, the optimal strategy of the device is obtained based on the Lagrangian multiplier and the KKT condition by using the following equations:









f

i
,
L

*

(
t
)

=


(



ρ
i




A
L

·
ln


2


-
1

)




γ
i


τ
i
d




,

t

T

,
and









D

i
,
k

*

(
t
)

=



ρ
i




A
k

·
ln


2


-
1


,


k



{

M
,
S

}


t


T

;
and





the optimal quotation strategy of the MEC server to the device is obtained based on the Lagrangian multiplier and the KKT condition by using the following equation:









p

i
,
k

*

(
t
)

=


2


λ
k



k

l
k







D

i
,
k

*

(
t
)



γ
i
2



T

i
,
k

pt



-



D

i
,
k

*

(
t
)


Θ
k




;




where, fi,L*(t) represents an optimal local computation frequency, ρi represents a utility weight parameter of an i-th mobile user, AL represents a locally arriving task set, γi represents a computation density in cycles/bit obtained by performing an offline measurement, τid represents a maximum computation delay constraint of Di(t), Di,k*(t) represents an optimal offloading strategy, pi,k*(t) represents an optimal quotation strategy of a base station, λk represents a unit energy consumption, klk represents an effective switching capacitance in the MEC server, Di,k*(t) represents an optimal offloading strategy, Ti,kpt represents a processing time period for offloading a task, and








Θ
k

=





D

i
,
k

*

(
t
)






p

i
,
k


(
t
)




,




where k∈{M,S}, represents a first-order derivative of the optimal offloading strategy to a quotation.


With the extended game theory method based on the Lyapunov optimization theory according to the present disclosure, dynamic task offloading and adaptive computation power management in a time-varying environment are performed. Further, considering the uncertainty of computation and networks caused by the movements of the users and the limited edge resources, a distributed two-stage stochastic programming algorithm and a distributed multi-stage stochastic programming algorithm in a condition that multiple objectives are uncertain are provided. Furthermore, posteriori remedies are performed to compensate for inaccurate predicted network information according to the present disclosure.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows a typical UDN environment supporting MEC;



FIG. 2 shows a flowchart of a distributed computing power and network collaboration method based on a ultra-dense network under multi-objective uncertainty according to the present disclosure;



FIG. 3 shows a task processing and time slot model according to the present disclosure;



FIG. 4 shows a scenario tree based on dwell time periods of an i-th mobile device according to the present disclosure;



FIG. 5 is a schematic diagram showing a recourse and compensation process according to the present disclosure;



FIG. 6 is a schematic diagram showing changes of a price with the number of iterations according to the present disclosure;



FIG. 7 is a schematic diagram showing changes of offloaded tasks with the number of iterations according to the present disclosure;



FIG. 8 is a schematic diagram showing changes of a calculation offloading performance with time (in which V is equal to 500 and represents a non-negative control parameter in a drift decreasing utility function) according to the present disclosure; and



FIG. 9 is a schematic diagram showing changes of a computation offloading performance with V according to the present disclosure.





DETAILED DESCRIPTION OF THE EMBODIMENTS

The technical solutions in the embodiments of the present disclosure will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present disclosure. It is apparent that the embodiments described are only a part of the embodiments of the present disclosure rather than all of the embodiments. Based on the embodiments of the present disclosure, all other embodiments obtained by those skilled in the art without creative efforts fall within the scope of protection of the present disclosure.



FIG. 2 shows a flowchart of a distributed computation offloading method based on computation-network collaboration in a stochastic network according to the present disclosure. The method includes the following steps:


establishing a local computing model and an edge-cloud computing model based on differentiated service quality requirements, task processing costs, and the type of a device in a UDN environment;


establishing, based on the local computing model and the edge-cloud computing model, a device revenue maximization problem model by using a Lyapunov optimization theory and a minimum drift decreasing utility function at the device;


establishing, based on the local computing model and the edge cloud computing model, a MEC server revenue maximization problem model by using a maximum value theory at an MEC server;


respectively establishing, based on a random movement of a user and a burst computation requirement, a composite scenario set based on a dwell time period and a composite scenario set based on a waiting delay;


performing compensation on a game strategy by using a posteriori recourse action, and establishing a game-based stochastic programming model for the device and the MEC server;


establishing a scenario tree to transform a multi-stage stochastic programming problem of the device and the MEC server to a DEP problem;


solving an optimization problem of the device and an optimization problem of the MEC server based on a Lagrangian multiplier and a KKT condition to obtain an optimal task offloading strategy of the device to the MEC server and an optimal quotation strategy of the MEC server to the device; and


in a case that both the optimal task offloading strategy of the device to the MEC server and the optimal quotation strategy of the MEC server to the device meet a Stackelberg equilibrium solution, offloading, by the device based on the optimal task offloading strategy, a task to the MEC server.


First Embodiment

In the embodiment, the process of establishing a local computing model and an edge-cloud computing model based on differentiated service quality requirements, task processing costs, and the type of a device in a UDN environment according to the present disclosure is described in detail.


In the embodiment, as shown in FIG. 1, a UDN system that supports MEC and operates in discrete time is considered. It is assumed that T={0, 1, . . . }, and τ∈T, where T represents a time slot index and τ represents a time length of each of time slots, the system includes a macro base station (MBS) and n small base stations (SBS), N=(1, . . . , n) and N represents a set of n SBSs, M represents the MBS, S∈N and S represents an S-th SBS, M=(1, . . . , m) and M represents m mobile devices (MDs), each of base stations is arranged with an MEC server where the MEC server has computing power and may provide computing power to MDs in a coverage region of the base station, each of the MDs may process tasks locally or offload a task to a MEC server of a base station, and each of the MDs may be connected to a MBS and a nearby SBS by using a dual connection in a 5G network or a CoMP technology.


It is assumed that MDi represents an i-th mobile device, a task requested to be processed by MDi may be represented by a quaternion Λi(t)=custom-charactercustom-character(t),Di(t),τidicustom-character, where Qi(t) represents a task queue backlog at a beginning of a time slot t and Di(t) represents a task processing decision at the beginning of the time slot t, τid represents a maximum computation delay constraint of Di(t), and γi represents a computation density in cycles/bit that may be obtained by performing an offline measurements.


It is assumed that Ai(t) represents a task arriving at MDi (i∈M) and A(t)={A1(t), . . . , Am(t)} represents a set of all MDs in the time slot t. Since the tasks arriving in one time slot are limited, 0≤Ai(t)≤Aimax, t∈T, where Aimax represents a maximum number of arriving tasks. It is assumed that task arrival rates of all the MDs meet an independent and identical distribution. An update equation for the queue backlog of MD i may be expressed as:






custom-character(t+1=[custom-character(t)−Di(t)]++Ai(t),i∈M,i∈T


where [x]+=max(x,0),









D
i

(
t
)

=



D

i
,
L


(
t
)

+




k


{

M
,
S

}





D

i
,
k


(
t
)




,


D

i
,
L


(
t
)





represents a task backlog processed locally, Di,M(t) represents a task backlog processed by the MBS, and Di,S(t) represent a task backlog processed by a SBS.


At a beginning of each of time slots, it is required for each of mobile devices to determine a task offloading strategy to determine the number of non-offloading tasks Di,L(t) and the number of offloading tasks Di,k(t), where k∈{M,S}. As shown in FIG. 3, an offloading process includes the following three stages.


In a first stage, each of the MDs uploads a computation task Di,k(t) to a BS through a wireless channel, where Ti,kup(t) represents a task upload delay.


In a second stage, the task uploaded by each of the MDs waits in a server for a time period which is referred to as a waiting delay Ti,kwt(t).


In a third stage, the offloaded task is to be processed by the server, and Ti,kpt(t) represents a processing time period.


It is required to process the offloaded task in a constraint time length τid. In a case that the offloaded task is not processed in the constraint time length τid, it is determined that the task is failed to be processed.


Since a capacity of a battery of a mobile device is limited, in order to save energy in limited time, a MD may process a task at a CPU clock frequency determined based on a dynamic voltage and frequency scaling (DVFS) technology. It is assumed that








D

i
,
L


(
t
)

=





T

i
,
L

pt

(
t
)






f

i
,
L


(
t
)


γ
i



dt






represents a relationship between processed tasks and local computing resources, where Ti,Lpt(t)≤τid and represents a local execution time period, fi,L(t)(fi,Lmin≤fi,L(t)≤fi,Lmax) represents a CPU clock frequency (cycle/s) allocated in a time slot, fi,Lmin represents a minimum CPU clock frequency of MD i, and fi,Lmax represents a maximum CPU clock frequency of MD i.


For each of the MDs, it is assumed in the present disclosure that the energy of the device is mainly consumed by the operation of the CPU. Generally,








E

i
,
L

exe

(
t
)

=


k
i







T

i
,
L

pt

(
t
)





(


f

i
,
L


(
t
)

)

2


dt







represents energy consumption due to operation of a CPU is, where ki represents an effective energy factor, is related to a chip structure of the device and may be obtained by performing an offline measurements.


In the present disclosure, it is assumed that: (1) in a case that a MD is located both in a coverage region of a MBS and in coverage regions of SBSs, the MD may simultaneously communicate with the MBS and a nearby SBS in each of time slots, that is, the MD may offload a task to both the MBS and the nearby SBS; (2) due to random movements of the MDs and burst computation requirements, a dwell time periods Ti,Sst(t), i∈M of a MD in a SBS and waiting delays Ti,kwt(t), k∈{M,S} of the MD in the different BSs are random and meet an independent identical distribution; and (3) in a case that a MD leaves a wireless coverage region of a SBS while the MD uploading a task to the SBS (where the uploaded task is represented by Di,Str(t)), it is required to firstly upload remaining tasks (Di,S(t)−Di,Str(t)) to the MBS, and then the MBS forwards the remaining tasks to another SBS via a wired optical network.


For a delay-sensitive application, an uploading delay of a task should be less than a constraint time length, that is, Ti,kup(t)<τid. In a case that uploading delay of the task is not less than the constraint time length, it is certain that the task is to be executed unsuccessfully. Since calculation results are usually much smaller than inputted tasks of most applications, time cost for downloading calculation results from base stations is ignored in the present disclosure. Once a MD decides to offload a task to the MBS and a nearby SBS, the task uploading delay and communication energy cost are calculated as follows.


(1) Offloading a Task to a MEC Server of the MBS


For each of the MDs, it is required to transmit input bits of a task to the MBS through a wireless channel. It should be noted that an interference environment of the UDN becomes complex as the number of the users increases. Therefore, for a offloading decision, power interferences between devices should be considered. According to a Shannon-Hartley formula, an upload data rate is obtained by using the following equation:








r

i
,
M


(
t
)

=


ω
M




log
2

(

1
+




P
i

(
t
)




g

i
,
M


(
t
)





σ
2

(
t
)

+





x


M

\

x


=
i




P
x





g

x
,
M


(
t
)

·
1



(


D

x
,
M


(
t
)

)






)






where ωM represents a radio bandwidth, gi,M(t) represents a channel gain of the MBS, σ2(t) represents an average background noise power, Pi(t) represents a communication power of a MD i in a time slot t, and 1(⋅) represents an indicator function.


Therefore, for the MBS, the task uploading delay is obtained by using the following equation:









T

i
,
M

up

(
t
)

=



D

i
,
M


(
t
)



r

i
,
M


(
t
)



,

i

M

,




and


the communication energy consumption is obtained by using the following equation:









E

i
,
M

up

(
t
)

=



P
i

(
t
)





D

i
,
M


(
t
)



r

i
,
M


(
t
)




,

i


M
.






(2) Offloading a Ask to a MEC Server of a SBS


In a case that a MD i offloads a task Di,S(t) to a MEC server of a SBS, an uploading data rate is obtained by using the following equation:








r

i
,
S


(
t
)

=


ω
S




log
2

(

1
+




P
i

(
t
)




g

i
,
S


(
t
)





σ
2

(
t
)

+





x


M

\

x


=
i




P
x





g

x
,
S


(
t
)

·
1



(


D

x
,
S


(
t
)

)






)






where ωS represents a wireless bandwidth, and gi,S(t) represents a channel gain of the SBS.


Generally, a wireless coverage region of a SBS is smaller than a wireless coverage region of a MBS. Based on the assumptions mentioned above, in a case that a MD leaves a wireless coverage region of a SBS while the MD uploading a task, the remaining tasks are firstly uploaded to the MBS and then are forwarded to another SBS. Therefore, an uploading delay of a SBS is obtained by using the following equation:








T

i
,
S

up

(
t
)

=

{







T

i
,
S

st

(
t
)

+


T

i
,
M

up

^

+

T

i
,
M

f


,


if




T

i
,
S

st

(
t
)


<



D

i
,
S


(
t
)



r

i
,
S


(
t
)












D

i
,
S


(
t
)



r

i
,
S


(
t
)


,


if




T

i
,
S

st

(
t
)






D

i
,
S


(
t
)



r

i
,
S


(
t
)












where Ti,Sst(t) represents a dwell time period of the MD i in the SBS,








T

i
,
M

up

^

=




D

i
,
S


(
t
)

-



T

i
,
S

st

(
t
)




r

i
,
S


(
t
)





r

i
,
M


(
t
)






represents an uploading delay for the remaining tasks,







T

i
,
M

f

=





D

i
,
S


(
t
)

-



T

i
,
S

st

(
t
)




r

i
,
S


(
t
)



c

.





represents a forwarding time period of the remaining tasks, and c represents a data rate of an optical link.


Therefore, the communication energy consumption of the MD i uploading Di,S(t) to the SBS is obtained by using the following equation:








E

i
,
S

up

(
t
)

=

{








P
i

(
t
)



(



T

i
,
S

st

(
t
)

+

T

i
,
M

up


)


,



if




T

i
,
S

st

(
t
)


<



D

i
,
S


(
t
)



r

i
,
S


(
t
)












P
i

(
t
)





D

i
,
S


(
t
)



r

i
,
S


(
t
)



,



if




T

i
,
S

st

(
t
)






D

i
,
S


(
t
)



r

i
,
S


(
t
)







.






A total energy consumption of the MD i in the time slot t is obtained by using the following equation:








E

i
,
L


(
t
)

=



E

i
,
L

exe

(
t
)

+




k


{

M
,
S

}







E

i
,
k

up

(
t
)

.







A total computation offloading time period of the MBS/SBS is obtained by using the following equation:






T
i,k
co(t)=Ti,kup(t)=Ti,kwt(t)=Ti,kpt(t),k∈{M,S}.


To ensure that offloaded tasks are processed timely, a total computation offloading time period Ti,kco(t)(t) (including an uploading time period, a waiting delay and a processing time period) does not exceed a constrained time period in each of time slots (that is, Ti,kco(t)≤τid).


It is assumed that the MEC server is arranged with a CPU having Lk cores, a core set of a CPU is represented as Lk=(1, . . . , Lk), k∈{M,S}. A processing energy consumption of the MEC server for processing a task Di,k(t) is obtained by using the following equation:









E

i
,
k


(
t
)

=





l
k



L
k








k

l
k


(


β

i
,

L
k



(
t
)

)

2




T

i
,
k

pt

(
t
)




,

k



{

M
,
S

}

.






where klk represents an effective switching capacitance in the MEC server, βi,lk(t) (βikmin≤βi,lk(t)≤βikmax) represents a CPU clock frequency of a lk-th CPU core in the MEC server, βikmin represents a minimum CPU clock frequency of each of CPU cores, and βikmax represents a maximum CPU clock frequency of each of the CPU cores.


Second Embodiment

In the embodiment, resource cost to be consumed for processing a task is calculated based on the local computing model and the edge-cloud computing model established in the first embodiment.


Based on the local computing model and the edge-cloud computing model established in the first embodiment, it can be seen that the computation offload delay strongly relies on the communication delay, the waiting delay and the processing delay of the task. In order to ensure that an offloaded task is processed in the constrained time period τid, computation factors (including a computation waiting time period Ti,kwt(t), a computation power fi,L and βi,lk(t)) and network factors (including a dwell time period Ti,Sst(t) and a communication time period Ti,kup(t)) are required to be considered.


In order to evaluate a task processing performance, a task processing utility function and a task processing cost function are defined in the present disclosure. For each of the MDs, in order to evaluate a revenue for processing a task in each of the time slots, a logarithmic utility function is adopted in the present disclosure, which is widely used in the field of wireless communications and mobile computations. An utility of the MD i is obtained by using the following equation:









u

i



[


D
i

(
t
)

]

=




k


{

L
,
M
,
S

}






ρ
i




log

[

1
+

D





i
,

k
^



(
t
)



]




,

t

T





where ρi is a utility weight parameter of MD i.


In a case that a transmission task of the MBS are expressed as:








D

i
,
M

tr

(
t
)

=

{








D

i
,
M


(
t
)

+


D

i
,
S


(
t
)

-


T

i
,
S

st




r

i
,
S


(
t
)



,



if




T

i
,
S

st

(
t
)


<



D

i
,
S


(
t
)



r

i
,
S


(
t
)











D

i
,
M


(
t
)

,



if




T

i
,
S

st

(
t
)






D

i
,
S


(
t
)



r

i
,
S


(
t
)







,






and a transmission task of a SBS is expressed as:








D

i
,
S

tr

(
t
)

=

{






T

i
,
S

st



r

i
,
S




(
t
)


,



if



T

i
,
S

st



(
t
)


<



D

i
,
S


(
t
)



r

i
,
S


(
t
)











D

i
,
S




(
t
)


,



if



T

i
,
S

st



(
t
)






D

i
,
S


(
t
)



r

i
,
S


(
t
)












communication cost of MD i is obtained by using the following equation:






c
i
[D
i(t)]=θiDi,Mtr(t)+ηiDi,Str(t),t∈T


where θi represents communication cost for a unit of bit of MD i in the MBS, and ηi represents communication cost for a unit of bit of MD i in the SBS.


To simplify the analysis, it is assumed in the embodiment that a unit energy consumption cost is represented as








λ



k
^




0

,


k
^




{

L
,
M
,
S

}

.






Therefore, the energy consumption cost is obtained by using the following equation:








e





i
,

k
^



[


D
i

(
t
)

]


=

λ




k
^


E





i
,

k
^



(
t
)



,

t


T
.






In order to improve the user's experience for the computation offload service, a cloud server provider has to consume costs. Apparently, offload services are not free, and an edge-cloud server usually requires compensations to share resources. It is assumed that pi,k(t), k∈{M,S} ($/bit) represents a unit payment cost of MD i for BS k in a time slot t, the compute service cost of MD i is obtained by using the following equation:









s
i

[


D
i

(
t
)

]

=




k


{

M
,
S

}







p

i
,
k


(
t
)




D

i
,
k


(
t
)




,

i

M

,




and the service utility of the is obtained by using the following equation:








u
k

[


p
k

(
t
)

]

=






i

M







p

i
,
k


(
t
)




D

i
,
k


(
t
)





"\[LeftBracketingBar]"


,

k


{

M
,
S

}


,








where pk(t)custom-character[p1,k(t), . . . , pm,k(t)] represents a price strategy set of the MBS or a price strategy set of the SBS.


Third Embodiment

Based on the local computing model and the edge-cloud computing model established in the first embodiment and the resource cost to be consumed for offloading tasks obtained in the second embodiment, a device revenue maximization problem model and a MEC server revenue maximization problem model are established based on a Lyapunov optimization theory and a minimum drift decreasing utility function in the embodiment.


In the present disclosure, the task offloading and computation resource allocation problem is described as a deterministic optimization problem, and it is assumed that the dwell time period Ti,Sst(t) S∈N and the computation waiting time period Ti,kwt(t), k∈{M,S} are known in advance.


In the present disclosure, it is assumed that mobile devices are always rational, and seek an optimal offloading decision to maximize the long-term revenue. For an optimal decision of each of the MDs/buyers, an offloaded task, communication cost, energy cost and payment cost should be considered. Based on the above analysis, an objective function of a MD/buyer i in a time slot t is expressed as:






U
b

i

[D
i(t)]=ui[Di(t)]−ci[Di(t),Ti,Sst(t)]−ei,L[Di(t),Ti,Sst(t)]−si[Di(t)].


To ensure the computation performance in the long term evolution, for the MD/buyer i, the device revenue maximization problem model is expressed as:










P

1

-

buyer
:













max





f

i
,
L


(
t
)

,


D

i
,
k


(
t
)






U

b
i


_


=


lim

T


+







1
T


E





t
=
0


T
-
1




{


U

b
i


[


D
i



(
t
)


]

}
















s
.
t
.







T

i
,
L

pi

(
t
)




τ
i
d


,








f

i
,
L

min




f

i
,
L


(
t
)



f

i
,
L

max


,

t

T

,







0



D
i

(
t
)




Q
i

(
t
)


,

t

T

,

and










Q
i



(
t
)


_

=



lim

T


+






sup


1
T






t
=
0


T
-
1



E


{


Q
i

(
t
)

}







+


.







A Lyapunov function for a computation task queue of the MD i is represented as:








L
i

(


Q
i

(
t
)

)

=


1
2




{


Q
i

(
t
)

}

2






where Li(custom-character(t))≥0. Thus, a conditional Lyapunov drift is expressed as:





Δ(custom-character(t))=E{Li(custom-character)−Li(custom-character(t))|custom-character(t)}.


Based on an online optimal decision, an upper bound of a drift decreasing utility function is minimized, which is defined as: Δ(custom-character(t))−ViE{Ui[Di(t)]|custom-character(t)}, where Vi≥0 is a non-negative control parameter. To obtain the upper bound of the drift decreasing utility function, based on control parameters Vi≥0 and Ai(t)∈[0, Aimax] provided according to any possible decision, the following inequality may be obtained:







Δ


(


Q
i

(
t
)

)


-


V
i


E


{



U
i

[


D
i

(
t
)

]





"\[LeftBracketingBar]"



Q
i

(
t
)



}











B
i

+



Q
i

(
t
)




A
i

(
t
)


-

E


{



Q
i

(
t
)




D
i

(
t
)





"\[LeftBracketingBar]"



Q
i

(
t
)



}


-


V
i


E


{



U
i

[


D
i

(
t
)

]





"\[LeftBracketingBar]"



Q
i

(
t
)



}










where







B
i


=


1
2




{



(



D
i

(
t
)

max

)

2

+


(



A
i

(
t
)

max

)

2


}

.






The upper bound of the drift decreasing utility function may be obtained by minimizing the right-hand side (RHS) of the above inequality. Thus, the optimization problem P1-buyer may be transformed to:










P

2

-

buyer
:













max






f

i
,
L


(
t
)

,


D

i
,
k


(
t
)






Y

b
i




(
t
)


=



V
i

·


U

b
i


[


D
i



(
t
)


]


+




k


{

L
,
M
,
S

}





Q
i




(
t
)

·

D

i
,
k





(
t
)



-


Q
i




(
t
)

·

A
i




(
t
)

















s
.
t
.







T

i
,
L


p

t


(
t
)




τ
i
d


,











f

i
,
L

min




f

i
,
L


(
t
)



f

i
,
L

max


,

t

T

,










0



D
i

(
t
)




Q
i

(
t
)


,


t

T

,












Q
i



(
t
)


_

=



lim

T


+





sup


1
T






t
=
0


T
-
1



E


{


Q
i

(
t
)

}







+


.








For each of BSs (base stations)/sellers, based on








e





i
,

k
^



[


D
i

(
t
)

]


=

λ



k
^


E



i
,

k
^




(
t
)



,

t

T

,









s
i

[


D
i

(
t
)

]

=




k


{

M
,
S

}






p

i
,
k


(
t
)




D

i
,
k


(
t
)




,

i


M


and











u
k

[


p
k

(
t
)

]

=






i

M






p

i
,
k


(
t
)




D

i
,
k


(
t
)




,

k


{

M
,
S

}


,




revenues Uskk∈{M,S} of the MBS/SBS are respectively defined as:









P

1



-

seller



(
MBS
)

:












max



p

i
,
M


(
t
)

,


β

i
,

t
M



(
t
)





U

s
M


_


=


lim

T


+







1
T



E
[




t
=
0


T
-
1



{



u
M

[


p
M

(
t
)

]

-





i




e

i
,
M


[



D

i
,
M


(
t
)

,


T

i
,
M


w

t


(
t
)


]





]
















s
.
t
.











p

i
,
M


(
t
)


0

,


t

T

,
















T

i
,
M


c

o


(
t
)



τ
i
d


,

t

T

,











β

l
M

min




β

i
,

l
M



(
t
)



β

l
M

max


,

t

T

,










P

1

-

seller



(
SBS
)

:









max



p

i
,
S


(
t
)

,


β

i
,

l
S



(
t
)






U

s
S


_


=


lim

T


+







1
T


E





t
=
0


T
-
1



{



u
S

[


p
S

(
t
)

]

-





i




e

i
,
S


[



D

i
,
S


(
t
)

,


T

i
,
S


s

t


(
t
)

,


T

i
,
S


w

t


(
t
)


]


















s
.
t
.



p

i
,
S


(
t
)



0

,

t

T

,












T

i
,
S


c

o


(
t
)



τ
i
d


,


t

T

,











β

l
s

min




β

i
,

l
s



(
t
)



β

l
s

max


,


t

T






The revenues of the MBS and the SBS are related to a pricing set pi,k(t), an allocated computational power βi,lk(k) and a computation waiting delay Ti,kwt(t) in the time slot t. In order to ensure that the offloaded tasks are processed within the constrained time period, it is required for the SBS S to consider the waiting time period Ti,Sst(t) of MD i. Furthermore, based on a maximum theory, the problem P1-seller may be transformed to the following problem:









P

2

-

seller



(
MBS
)

:









max



p
M

(
t
)

,

β

i
,

l
M







Y

s
M


(
t
)


=



u
M

[


p
M

(
t
)

]

-





i




e

i
,
M


[



D

i
,
M


(
t
)

,


T

i
,
M

wt

(
t
)


]














s
.
t
.








p

i
,
M




(
t
)



0

,

t

T

,














T

i
,
M


c

o


(
t
)



τ
i
d


,


t

T

,








β

l
M

min




β

i
,

l
M



(
t
)



β

l
M

max


,


t

T










P

2



-

seller






(
SBS
)


:








max



p
S

(
t
)

,

β

i
,

l
S







Y

s
S


(
t
)


=



u
S

[


p
S

(
t
)

]

-





i




e

i
,
S


[



D

i
,
S


(
t
)

,


T

i
,
S


s

t


(
t
)

,


T

i
,
S


w

t


(
t
)


]














s
.
t
.







p

i
,
S




(
t
)



0

,

t

T

,














T

i
,
S


c

o


(
t
)



τ
i
d


,


t

T

,








β

l
s

min




β

i
,

l
s



(
t
)



β

l
s

max


,


t


T
.






Based on P2-seller, each of the sellers periodically announces a latest market prices pk(t) based on the user's requirement Di,k(t) and a current network state (such as the calculated waiting time period Ti,kwt(t) and the calculated dwell time period Ti,Sst(t)) in the time slot t. With the changes of the requirement and the network state, the seller dynamically adjusts the pricing strategy until market equilibrium is reached, and the allocated computing power βi,lk(t) is adaptively adjusted.


Fourth Embodiment

In the third embodiment, it is assumed in the present disclosure that the dwell time period Ti,Sst(t) and the waiting delay Ti,kwt(t),∈{M,S} are known. However, due to the random movements of the users and burst computation requirements, the dwell time period and the waiting delay are uncertain at a beginning of each of time slots. Decisions may be made by using average historical values of the dwell time period and the waiting delay or predicting the dwell time period and the waiting delay. However, in a real time-varying environment, it is difficult to perform high-precision predictions. Imprecise results affect computation performance and offloading success rate. For example, in a case that a predicted dwell time period and a predicted waiting delay are respectively greater than an actual dwell time period and an actual waiting delay, the computation offloading cost increases, or even the offloading process fails. Therefore, a two-stage stochastic programming method is performed according to the present disclosure to perform posteriori remedy to compensate for previous inaccurate predictions.


In order to deal with an uncertain dwell time period Ti,Sst(t) and an uncertain waiting delay Ti,kwt(t), a scenario set having uncertain parameters is considered in the present disclosure. Ωi,Sst(t) represents a scenario set based on possible dwell time periods of the MD i in a SBS, and Ωi,kwt(t) represents a scenario set based on possible waiting delays of the MD i in a BS k, k∈{M, S}. Based on a Cartesian product, a composite scenario ΩSst(t) based on dwell time periods of all MDs and a composite scenario Ωkwt(t) based on waiting delays of all MDs may be respectively expressed as:









Ω
S
st

(
t
)

=





i
=
1

m



Ω

i
,
S

st

(
t
)


=



Ω

1
,
S

st

(
t
)

×

×


Ω

m
,
S

st

(
t
)




,

and








Ω
k
wt

(
t
)

=





i
=
1

m



Ω

i
,
k

wt

(
t
)


=



Ω

1
,
k

wt

(
t
)

×

×



Ω

m
,
k

wt

(
t
)

.







It is noted that based on Ubi[Di(t)]=ui[Di(t)]−ci[Di(t),Ti,Sst(t)]−ei,L[Di(t),Ti,Sst(t)]−si[Di(t)], a revenue of a MD is related to a scenario Ωi,Sst(t) based on a dwell time period. Therefore, the revenue of the MBS is related to the composite scenario ΩMwt(t) and the revenues of the BSs are related to the composite scenario ΩS(Q)=ΩSst(t)×ΩSwt(t), where ΩS(t) represents a composite scenario based on composite time (that is, dwell time period and waiting delays in the SBSs). Hereafter, a game-based distributed two-stage stochastic programming model is analyzed in the present disclosure.


To simplify the analysis, analysis is performed based on the SBSs in the present disclosure. The analysis based on MBS may be easily derived from the analysis based on the SBSs.


In an uncertain and time-varying network environment, actual values of stochastic variables are obtained after performing processes, that is, the dwell time period Ti,Sst(t) and the waiting delay Ti,kwt(t) are obtained after performing the game and uploading offloaded tasks to the cloud. However, after observing actual situations, compensation may be performed on the game strategy by performing a posterior recourse action. Based on the stochastic programming theory, a decision set includes the following two groups according to the present disclosure.


For a decision group in a first stage (game stage), it is required to adopt a strategy for offloading a task Di(t) and a strategy for announcing a price pi,S(t) before obtaining Ti,Sst(t) and Ti,Swt(t) through the game. This stage is referred to as a first stage or a game stage.


For a decision group in a second stage (recourse stage), allocation of computing power βi,lS(t) may be performed after observing the realizations of Ti,Sst(t) and Ti,Swt(t), which is referred to as a second stage decision. This stage is referred to as a second stage or a recourse stage.


Ti,Sst(t)∈Ωi,Sst(t) represents a realization of a dwell time period in a time slot t, and ωS(t)=(T1,Sst, . . . , Tm,Sst(t), Ω1,Sqt, . . . , Ωm,Sqt) represents a composite realization of the dwell time period in the time slot t. p[Ti,Sst(t)]∈[0,1] and p[ωS(t)]∈[0,1] represent probabilities. Based on the stochastic programming theory, an optimal problem of P2-buyer may be expressed as a two-stage stochastic programming problem:









P

3

-

buyer



(

two
-
stage

)

:












max



f

i
,
L


(
t
)

,


D

i
,
k


(
t
)






Y

b
i


(
t
)


=



V
i



{



u
i

[


D
i

(
t
)

]

-


s
i

[


D
i

(
t
)

]











-


E


Ω

i
,
S

st

(
t
)





{


c
i

(



D

i
,
t


(
t
)

,



T

i
,
S


s

t


(
t
)





"\[LeftBracketingBar]"




T

i
,
S


s

t


(
t
)




Ω

i
,
S

st

(
t
)





)

}









-


E


Ω

i
,
S

st

(
t
)





{


e

i
,
L


(



D

i
,
t


(
t
)

,



T

i
,
S


s

t


(
t
)





"\[LeftBracketingBar]"




T

i
,
S


s

t


(
t
)




Ω

i
,
S


s

t


(
t
)





)

}


}







+





k


{

L
,
M
,
S

}






Q
i

(
t
)




D

i
,
k


(
t
)




-



Q
i

(
t
)




A
i

(
t
)
















s
.
t
.



T

i
,
L


p

t


(
t
)




τ
i
d


,











f

i
,
L

min




f

i
,
L


(
t
)



f

i
,
L

max


,

t

T

,











0



D
i

(
t
)




Q
i

(
t
)


,

t

T

,

and












Q
i



(
t
)


_

=



lim

T


+





sup


1
T






t
=
0


T
-
1



E


{


Q
i

(
t
)

}







+


.








Similarly, an optimal problem of P2-seller(SBS) may be expressed as:








P3
-

seller



(

two
-
stage

)

:










max





p
S

(
t
)

,


β

i
,

l
S



(
t
)






Y

s
S


(
t
)


=



u
S

[


p
S

(
t
)

]

-


E


Ω
S

(
t
)


[

R

(



β

i
,

l
S



(
t
)

,


Ω
S

(
t
)


)

]











s
.
t
.



p

i
,
S


(
t
)



0

,

t

T

,









T

i
,
S


c

o


(
t
)



τ
i
d


,

t

T

,

and








β

l
s

min




β

i
,

l
s



(
t
)



β

l
s

max


,

t

T

,








where







R

(



β

i
,

l
s



(
t
)

,


Ω
S

(
t
)


)


=





i





e

i
,
S


[



D

i
,
S


(
t
)

,


T

i
,
S

st

(


ω
S

(
t
)

)

,


T

i
,
S

wt

(


ω
S

(
t
)

)


]



,






and



R

(



β

i
,

l
S



(
t
)

,


Ω
S

(
t
)


)





represents a recourse function.


Expressions of an optimal problem for a macro base station may be derived by those skilled in the art based on the expressions for the small base stations, and not repeated in the embodiment.


Fifth Embodiment

As shown in FIG. 1, each of the MDs may move randomly between two or more SBSs. However, it is only required to perform a decision once in each of the time slots by using the game strategy based on two-stage stochastic programming, thus a sub-optimal solution may be obtained. Therefore, statistical features of Ti,Sst(t) and Ti,Swt(t) may be accurately captured, and the offloading strategy, the pricing strategy and the computation power allocation strategy may be accurately defined, and the game strategy may be further developed by using multi-stage stochastic programming. In the multi-stage stochastic programming, a computation offloading process is divided into H sub-slices (H□{τ1, τ2, . . . , τH}). In a sub-slice τh, multi-stage stochastic programming is performed, and then the task offloading Dih), τh∈H, the price pi,SH) and the computation power allocation βi,lS(t) are optimal. ξi,Ssth) represents a possible dwell time period in the sub-slice τh, and ξSh) represents possible composite time in the sub-slice τh. The dwell time periods in all composite scenarios in all sub-time slots are expressed as:








ξ

i
,
S


s

t


=





H


h
=
1




ξ

i
,
S


s

t


(

τ
h

)


=



ξ

i
,
S


s

t


(

τ
1

)

×

×


ξ

i
,
S


s

t


(

τ
H

)




,




and


the composite time in all composite scenarios in all sub-time slots is expressed as:







ξ
S

=





h
=
1

H



ξ
S

(

τ
h

)


=



ξ
S

(

τ
1

)

×

×



ξ
S

(

τ
H

)

.







After dividing a task time period t of an offloading task into H sub-time slots, a distributed stochastic programming model having 2 H stages is obtained as follows.


A task offloading optimization problem of a MD/buyer i may be transformed to the following optimization problem of P3-buyer(multi-stage):







P

3

-

buyer



(

multi
-
stage

)

:












max



f

i
,
L


(

τ
h

)

,


D

i
,
k


(

τ
h

)




Y

b
i




{



D
i

(

τ
1

)

,


,


D
i

(

τ
H

)

,


f

i
,
L


(

τ
1

)

,


,


f

i
,
L


(

τ
H

)


}









=


V
i




{



u
i

[


D
i

(

τ
1

)

]

-


c
i

[


D
i

(

τ
1

)

]

-


e

i
,
L


[


D
i

(

τ
1

)

]

-


s
i

[


D
i

(

τ
1

)

]














+

E



ξ

i
,
S

st

(

τ
h

)

|


ξ

i
,
S

st

(

τ

h
-
1


)









h
=
2

H


{



u
i

[


D
i

(

τ
h

)

]

-


s
i

[


D
i

(

τ
h

)

]


}












-

E



ξ

i
,
S

st

(

τ
h

)

|


ξ

i
,
S

st

(

τ

h
-
1


)









h
=
2

H



c
i

[



D
i

(

τ
h

)

,


T

i
,
S


s

t




(

τ
h

)



]












-


E





ξ

i
,
S

st

(

τ
h

)





"\[LeftBracketingBar]"



ξ

i
,
S

st

(

τ

h
-
1


)










h
=
2

H



e

i
,
L


[



D
i

(

τ
h

)

,


T

i
,
S

st

(

τ
h

)


]



}










+


Q
i

(

τ
1

)





D
i

(

τ
1

)


+


E



ξ

i
,
S

st

(

τ
h

)





"\[LeftBracketingBar]"



ξ

i
,
S

st

(

τ

h
-
1


)









h
=
2

H




Q
i

(

τ
h

)




D
i

(

τ
h

)















-


Q
i

(

τ
1

)





A
i

(

τ
1

)


-


E



ξ

i
,
S

st

(

τ
h

)

|


ξ

i
,
S

st

(

τ

h
-
1


)








h
=
2

H




Q
i

(

τ
h

)




A
i

(

τ
h

)


















s
.
t
.


f

i
,
L

min





f

i
,
L


(

τ
h

)



f

i
,
L

max


,


τ
h


H

,














h
=
1

H



D
i

(

τ
h

)


=


D
i

(

τ
h

)


,


τ
h


H

,

and










0





h
=
1

H



D
i

(

τ
h

)





Q
i

(

τ
h

)


,


τ
h


H

,





where Dih) represents processed tasks of MD i in the sub-time slot τh, and fi,Lh) represents a local computation frequency of MD i in the sub-time slot τh.


Similarly, an optimization problem of a SBS/seller is transformed to:







P

3

-

seller



(

multi
-
stage

)

:












max



p

i
,
S


(

τ
h

)

,


β

i
,
S


(

τ
h

)




Y

s
S




{



p

i
,
S




(

τ
1

)


,


,


p

i
,
S


(

τ
H

)

,


β

i
,
S




(

τ
1

)


,


,



β

i
,
S


(

τ
H

)


}









=



u
S

[


p
S



(

τ
1

)


]

-




i
=
1

m


{



e

i
,
S


[


D

i
,
S




(

τ
1

)


]

+


p

i
,
S

f

[


D

i
,
S


(

τ
1

)

]


}













+

E



ξ
S

(

τ
h

)


|



ξ
S

(

τ

h
-
1


)



"\[LeftBracketingBar]"











h
=
2

H



u
S

[


p
S



(

τ
h

)


]












-

E



ξ
S

(

τ
h

)


|



ξ
S

(

τ

h
-
1


)



"\[LeftBracketingBar]"











j
=
1

m





h
=
2

H



e

i
,
S


[



D

i
,
S




(

τ
h

)


,


T

i
,
S


s

t




(

τ
h

)


,


T

i
,
S


w

t




(

τ
h

)



]

















s
.
t
.








p

i
,
S




(

τ
h

)



0

,


τ
h


H

,














E

ξ
s




{




h
=
1

H


[



T

i
,
S


u

p


(

τ
h

)

+


T

i
,
S


w

t


(

τ
h

)

+





l
k



L
k







D
i

(

τ
h

)



γ
i



β

i
,

l
k








}




τ
i
d


,

and









β

l
S

min




β

i
,

l
S



(

τ
h

)



β

l
S

max


,


τ
h


H

,

where








E

ξ
s




{




h
-
1

H


[



T

i
,
S


u

p


(

τ
h

)

+


T

i
,
S


w

t


(

τ
h

)

+





l
k



L
k







D
i

(

τ
h

)



γ
i



β

i
,

l
k








}




τ

i


d





to ensure that a sum of computation offloading time periods in all sub-slices does not exceed the constraint time length τid.


Similarly, a multi-stage stochastic model for a macro base station may be established by those skilled in the art based on the idea of establishing the multi-stage stochastic model for the small base station, which is not repeated in the present disclosure.


Sixth Embodiment

In the embodiment, a scenario tree is used to solve the optimization problem of P3-buyer(multi-stage) and the optimization problem of P3-seller(multi-stage), transforming the stochastic programming problem to deterministic equality programming (DEP).


πi,Sstξi,Sst is defined as an implementation of ξi,Sst, and πS∈ξS is defined as an implementation of ξS. The scenario tree is branched according to the implementations of ξi,Ssth) and ξSh), τh∈H FIG. 4 shows a typical scenario tree based on dwell time periods of a MD/Buyer i, in which the evolution of ξi,Ssth), τh∈H is described based on two implementations. In the scenario tree based on the dwell time periods, a root node is associated with a first decision stage in which no dwell time period is observed. The root node is connected to child nodes, and the child nodes are associated with a next stage. Each of the nodes is connected to an associated child node in the next stage until to a leaf node. Each of the child nodes has two implementations associated with two random dwell time periods Ti,Ssth):Ti,Sst,1h),Ti,Sst,2h).


After establishing the scenario tree, the stochastic programming problem between the buyer P3-buyer(multi-stage) and the seller P3-seller(multi-stage) is transformed to a DEP problem according to the present disclosure.


PTπi,Sst represents a path from the root node to a leaf node in the scenario tree based on the dwell time periods. For a determined scenario πi,Sst∈ξi,Sst, PTπi,Sst is determined. Di1) represents a task offloading decision at the root node, and Dih) represents a offloading decision at a node in a 2 h-th stage in the path PTπi,Sst. The multi-stage stochastic programming of the MD/buyer i may be transformed to the following DEP problem:











max



f

i
,
L


(

τ
h

)

,


D

i
,
S


(

τ
h

)





Y

b
i


(
t
)


=


V
i



{



u
i

[


D
i

(

τ
1

)

]

-


c
i

[


D
i

(

τ
1

)

]

-


e

i
,
L


[


D
i

(

τ
1

)

]

-


s
i

[


D
i

(

τ
1

)

]










+





π

i
,
S

st



ξ

i
,
S

st




p


(

π

i
,
S

st

)






h
=
2

H


{



u
i

[


D
i

π

i
,
S

st


(

τ
h

)

]

-


s
i

[


D
i

π

i
,
S

st


(

τ
h

)

]


}










-





π

i
,
S

st



ξ

i
,
S

st





p

(

π

i
,
S

st

)






h
=
2

H


{


c
i

[



D
i

π

i
,
S

st


(

τ
h

)

,


T

i
,
S


st
,


π



i
,
S

st



(

τ
h

)


]

}











-





π

i
,
S

st



ξ

i
,
S

st





p

(

π

i
,
S

st

)






h
=
2

H


{


e

i
,
L


[



D
i

π

i
,
S

st


(

τ
h

)

,


T

i
,

S
'



st
,

π

i
,
S

st



(

τ
h

)


]

}





}








+

Q
i




(

τ
1

)



D
i

π

i
,
S

st




(

τ
1

)


-


Q
i



(

τ
1

)



A
i



(

τ
1

)








+





π

i
,
S

st



ξ

i
,
S

st




p


(


π

i
,
S

s


f

)






h
=
2

H


{



Q
i



(

τ
h

)




D
i

π

i
,
S

st


(

τ
h

)


-


Q
i



(

τ
h

)



A
i



(

τ
h

)



}


















s
.
t
.






f

i
,
L

min




f

i
,
L




(

τ
h

)




f

i
,
L


max


,


τ
h


H

,






















h
=
1

H



D
i

(

τ
h

)


=


D
i

(

τ
h

)


,


τ
h


H

,










0





h
=
1

H



D
i

(

τ
h

)





Q
i

(

τ
h

)


,


τ
h


H

,

and




















D

i
,

k
ˆ



π

i
,
S

st


(

τ
h

)

=


D

i
,


π

i
,

S



st




(

τ
h

)



,



π

i
,
S


s

t



,



π

i
,
S


s

t




ξ

i
,
S


s

t











π

i
,
S


s

t




π

i
,

S




s

t



,

PT
(



D

i
,

k
ˆ



π

i
,
S

st


(

τ
h

)

=


PTD

i
,


π

i
,

S



st


(

τ
h

)











where p(πi,Sst) represents a probability of a scenario πi,S′st, and the constraint









D

i
,

k
ˆ



π

i
,
S

st


(

τ
h

)

=


D

i
,


π

i
,

S



st


(

τ
h

)


,



π

i
,
S

st


,


π

i
,
S

st




ξ

i
,
S

st






is a unexpected constraint indicating that the








π

i
,
S

st



π

i
,

S



st


,

PT
(



D

i
,

k
ˆ



π

i
,
S

st


(

τ
h

)

=


PTD

i
,


π

i
,

S



st


(

τ
h

)







offloading decision should be equivalent in different paths.


Similarly, PTπS represents a path from a root node to a leaf node in a composite scenario tree. For a determined scenario πS∈ξSh), a DEP model of a SBS/seller is expressed as:











max



p

i
,
S


(

τ
h

)

,


β

i
,
S


(

τ
h

)





Y

s
S


(
t
)


=



u
S

[


p
S

(

τ
1

)

]

-




i
=
1

m



{



e

i
,
S


[


D

i
,
S


(

τ
1

)

]

-


p

i
,
S

f

[


D

i
,
S


(

τ
h

)

]


}









+





π
S



ξ
S






p

(

π
S

)






h
=
2

H




u
2

[


p
S

π
s


(

τ
h

)

]












-




i
=
1



m








π
S



ξ
S





p

(

π
S

)






h
=
2

H



e

i
,
S


[



D

i
,
S


π
s


(

τ
h

)

,


T

i
,
S


st
,

π
s



(

τ
h

)

,


T

i
,
S


qt
,

π
s



(

τ
h

)


]












-




i
=
1



m








π
S



ξ
S




p


(

π
S

)






h
=
2

H



p

i
,
S

f

[



D

i
,
S


π
s


(

τ
h

)

,


T

i
,
S


st
,

π
s



(

τ
h

)

,


T

i
,
S


qt
,

π
s



(

τ
h

)


]


















s
.
t
.








p

i
,
S


(

τ
h

)


0

,


τ
h


H

,




















E

ξ
s




{





h
=
1

H




T

i
,
S

up

(

τ
h

)


+


T

i
,
S

wt

(

τ
h

)

+





l
k



L
k








D
i

(

τ
h

)



γ
i



β

i
,

l
k






}




τ

i


d


,











β

l
S

min




β

i
,

l
S



(

τ
h

)



β

l
S

max


,


τ
h


H

,















D

i
,
S


π
s




(

τ
h

)


=


D

i
,
S


π

s






(

τ
h

)



,



π
S


,


π

S






ξ
S










π
S



π

S




,


PT

(


D

i
,
S


π
s


(

τ
h

)

)

=

PT

(


D

i
,
S


π

s




(

τ
h

)

)










where p(πS) represents a probability of a scenario πS, and DiπSh) represents a task processed in an h-th stage in a path PTπS.


Two stages are included in each of sub-time slots. As shown in FIG. 5, in an even-numbered stage, computation power recourse βi,Sh) is performed on an odd-numbered stage to compensate for the uncertain dwell time period and the uncertain computation waiting delay in the sub-time slot. In a sub-time slot, offloading task recourse Di,Sh) is performed on a previous sub-time slot to perform accurate compensation and ensure that offloaded tasks may be processed in a constraint time period. In general, the optimal strategy for the multi-stage stochastic programming is related to the number of stages, that is, more divided stages indicate more optimal strategy and more complex solution process.


Seventh Embodiment

In the embodiment, the resource allocation strategy described above in the present disclosure is analyzed. To simplify the analysis, an optimal game strategy in one stage is analyzed in the present disclosure, which may be easily extended to multiple stages in the same way.


1. Analysis of Optimal Game


(1) Analysis of the Optimal Strategy for MDs













Y

b
i


(
t
)






D

i
,
L


(
t
)



=



V
i



{


ρ
i



(

1
+


D

i
,
L


(
t
)


)



ln


2


}


+


Q
i

(
t
)
















Y

b
i


(
t
)






D

i
,
M


(
t
)



=


V


{



ρ
i



(

1
+


D

i
,
M


(
t
)


)



ln


2


-

θ
i

-



λ
i




P
i

(
t
)




r

i
,
M


(
t
)


-


p

i
,
M


(
t
)


}


+


Q
i

(
t
)














Y

b
i


(
t
)






D

i
,
S


(
t
)



=

{






V


{



ρ
i



(

1
+


D

i
,
S


(
t
)


)



ln


2


-

θ
i

-



λ
i




P
i

(
t
)




r

i
,
M


(
t
)


-


p

i
,
S




(
t
)



}


+


Q
i

(
t
)


,





if



T

i
,
S


s

t



<



λ
i




P
i

(
t
)




r

i
,
M


(
t
)










V


{



ρ
i



(

1
+


D

i
,
S


(
t
)


)



ln


2


-

η
i

-



λ
i




P
i

(
t
)




r

i
,
S


(
t
)


-


p

i
,
S




(
t
)



}


+


Q
i

(
t
)


,





if



T

i
,
S


s

t



<



λ
i




P
i

(
t
)




r

i
,
S


(
t
)











According to P2-buyer, the above first-order partial derivatives











Y

b
i


(
t
)






D

i
,
L


(
t
)



,






Y

b
i


(
t
)






D

i
,
M


(
t
)





and







Y

b
i


(
t
)






D

i
,
S


(
t
)








and may be obtained. In addition, second-order partial derivatives











2



Y

b
i


(
t
)






(


D

i
,
L


(
t
)

)

2



<=
0

,





2



Y

b
i


(
t
)






(


D

i
,
M


(
t
)

)

2



<

0


and






2



Y

b
i


(
t
)






(


D

i
,
S


(
t
)

)

2




<
0





may be obtained. Since








f

i
,
L

min




f

i
,
L


(
t
)



f

i
,
L

max


,

t

T

,

0



D
i

(
t
)




Q
i

(
t
)


,


t


T


and





Q
i



(
t
)


_



=



lim

T


+





sup


1
T


E





t
=
0


T
-
1



{


Q
i

(
t
)

}






+








are affine functions, Ybi(t) is a convex function for Di(t). The optimization problem for the buyer/MDs may be solved by Lagrangian multipliers and Karush-Kuhn-Tucker (KKT) conditions, and the optimal strategy is expressed as:









f

i
,
L

*

(
t
)

=


(



ρ
i




A
L

·
ln


2


-
1

)




γ
i


τ
i
d




,

t

T










D

i
,
k

*

(
t
)

=



ρ
i




A
k

·
ln


2


-
1


,

k


{

M
,
S

}


,

t

T






where







A
L

=

-



Q
i

(
t
)


V
i




;








A
M

=



p

i
,
M


(
t
)

+

θ
i

+



λ
i




P
i

(
t
)




r

i
,
M


(
t
)


-



Q
i

(
t
)


V
i




;
and







A
S

=

{







θ
i

+



λ
i




P
i

(
t
)




r

i
,
M


(
t
)


+


p

i
,
S


(
t
)

-



Q
i

(
t
)


V
i



,



if



T

i
,
S


s

t



<



λ
i




P
i

(
t
)




r

i
,
M


(
t
)











η
i

+



λ
i




P
i

(
t
)




r

i
,
M


(
t
)


+


p

i
,
S


(
t
)

-



Q
i

(
t
)


V
i



,



if



T

i
,
S


s

t







λ
i




P
i

(
t
)




r

i
,
M


(
t
)







.






(2) Analysis of the Optimal Strategy for BSs


According to P2-seller(SBS), the first-order partial derivative of Ysk(t) to pi,k(t) may be obtained as follows:











Y

s
k


(
t
)






p

i
,
k


(
t
)



=



D

i
,
k


(
t
)

+



p

i
,
k


(
t
)







D

i
,
k


(
t
)






p

i
,
k


(
t
)




-



2


λ
k



k
k




D

i
,
k


(
t
)



γ
i
2



T

i
,
k


p

t









D

i
,
k


(
t
)






p

i
,
k


(
t
)












-

δ

i
,
k

f









D

i
,
k


(
t
)






p

i
,
k


(
t
)



·
1



{


T

i
,
k


c

o


,


(
t
)



τ
i
d



}





In a case that a trading price in the market meets pi,k(t)≥0, it is determined that











2



Y

s
k


(
t
)






(


p

i
,
k




k

(
t
)


)

2



<
0

,

k



{

M
,
S

}

.






In addition, since pi,M(t)≥0, t∈T, Ti,Mco(t)≤τid, t∈T, βlMmin≤βi,lM(t)≤βlMmax, t∈T, pi,S(t)≥0, t∈T, Ti,Sco(t)≤τid, t∈T, and βlSmin≤βi,lS(t)≤βlSmax, t∈T are affine functions and Ysk(i) is a convex function for pi,k(t), the optimal problem for the BSs/sellers may be solved based on Lagrangian multipliers and KKT, and the optimal strategy is expressed as:








P

i
,
k

*

(
t
)

=


2


λ
k



k

l
k







D

i
,
k

*

(
t
)



γ
i
2



T

i
,
k


p

t




-



D

i
,
k

*

(
t
)


Θ
k











where



Θ
k


=





D

i
,
k

*

(
t
)






p

i
,
k


(
t
)




,

k



{

M
,
S

}

.






Definition 1: in a case that a price pi,k(t) of a seller k is determined, Di,kSE(t) meets









Y

b
i


(


D

i
,
k


S

E


(
t
)

)

=


sup



D

i
,
k

min

(
t
)




D

i
,
k


(
t
)




D

i
,
k

max

(
t
)





{


Y

b
i


(


D

i
,
k


(
t
)

)

}



,




t

T


;





and in a case that an offloading task Di,k(t) of a buyer i is determined, pi,kSE(t) meets









Y

s
k


(


p

i
,
k


S

E


(
t
)

)

=


sup



p

i
,
k


(
t
)


0




{


Y

s
k


(


p

i
,
k


(
t
)

)

}



,



t


T
.







Hereinafter, it is proved that the optimal solution (Di,k*(t), pi,k*(t)) is (Di,kSE(t), pi,kSE(t)) based on the following three lemmas according to the present disclosure.


Lemma 1: in a case that a price pi,k(t) of a BS/seller k is determined, a revenue function Ybi(Di,k(t)) of a MD/buyer reaches a maximum value at Di,k*(t).


Proof: based on the above analysis, it is known that Ybi is a convex function for Di,k(t). Therefore, the revenue function Ybi(Di,k (t)) reaches a maximum value at Di,k*(t). According to Definition 1, Di,k*(t) is an SE solution Di,kSE(t).


Lemma 2: for a buyer, an optimal offloading task Di,k*(t) decreases as a price pi,k(t) of a seller increases.


Proof: based on









f

i
,
L

*

(
t
)

=


(



ρ
i




A
L

·
ln


2


-
1

)




γ
i


τ
i
d




,

t

T

,




it may be obtained that:











D

i
,
k

*

(
t
)






p

i
,
k


(
t
)



=



-


ρ
i


A
k
2




ln

2

<
0.





Therefore, it can be obtained that Di,k*(t) is a monotonically decreasing function for pi,k(t). That is, a higher transaction price indicates fewer tasks to be offloaded by the buyer, thus little revenue or no revenue may be obtained by the seller. Therefore, the seller should provide an appropriate price to maximize the revenue. The optimal price of the seller may be obtained by solving











Y

s
k


(


p

i
,
k


(
t
)

)






p

i
,
k


(
t
)



=
0.




Lemma 3: in a case that an optimal offloading task Di,k*(t) of a MD/buyer i is constant, Ysk(pi,k(t)) reaches a maximum value at pi,k*(t).


Proof: it has been proved in the present disclosure that the revenue Ysk of the seller is a convex function for pi,k(t). Therefore, according to Definition 1, Ysk(pi,k(t)) reaches a maximum value at pi,k*(t), and pi,k*(t) is an SE solution pi,kSE(t).


In short, (Di,k*(t), pi,k*(t)) is the optimal task offloading and pricing decision, and is the SE solution (Di,kSE(t), pi,kSE(t)).


It can be seen from FIGS. 6 to 7 that price is not to be reduced with the iterations, and the price gradually be converged to an optimal pricing strategy as the number of iterations increases. In FIG. 6, referring to a point at which the number of iterations is 50, four curves from top to bottom respectively represent a second stage of a MBS game, a first stage of the MBS game, a first stage of a SBS game, and a second stage of the SBS game. In FIG. 7, referring to a point at which the number of iterations is 50, four curves from top to bottom respectively represent a second stage of a SBS game, a first stage of the SBS game, a first stage of a MBS game and a second stage of the MBS game. In addition, the number of unloaded tasks decreases with the increase of price. When the price is no longer increased, the unloading strategy is stable, verifying the effectiveness of the method. It can be seen from FIGS. 8 to 9 that the transaction price in the market gradually increases with the backlog of the task queue of the buyers, and the revenues of the buyers gradually decreases with the increase of the offloading cost. It can be seen from FIG. 9 that there is a trade-off [O(1/V),O(V)] between an average queue backlog and the revenue, verifying the rationality of the method.


In the embodiments according to the present disclosure, a subscript S represents a parameter associated with a small base station, a subscript M represents a parameter associated with a macro base station, and a subscript i represents a parameter associated with a mobile device.


Although the embodiments of the present disclosure have been shown and described, it should be understood by those skilled in the art that a variety of variations, modifications, replacements and variants can be made based on these embodiments without departing from the principles and spirit of the present disclosure, and the scope of the present disclosure is limited by the appended claims and their equivalents.

Claims
  • 1. A distributed computation offloading method based on computation-network collaboration in a stochastic network, comprising: establishing a local computing model and an edge-cloud computing model based on differentiated service quality requirements, task processing costs, and the type of a device in a ultra-dense network UDN environment;establishing, based on the local computing model and the edge-cloud computing model, a device revenue maximization problem model by using a Lyapunov optimization theory and a minimum drift decreasing utility function at the device;establishing, based on the local computing model and the edge cloud computing model, a MEC server revenue maximization problem model by using a maximum value theory at a multi-access edge computing MEC server;respectively establishing, based on a random movement of a user and a burst computation requirement, a composite scenario set based on a dwell time period and a composite scenario set based on a waiting delay;performing compensation on a game strategy by performing a posteriori recourse action, and establishing a game-based stochastic programming model for the device and the MEC server;establishing a scenario tree to transform a multi-stage stochastic programming problem of the device and the MEC server to a DEP problem;solving an optimization problem of the device and an optimization problem of the MEC server based on a Lagrangian multiplier and a KKT condition to obtain an optimal task offloading strategy of the device to the MEC server and an optimal quotation strategy of the MEC server to the device; andin a case that both the optimal task offloading strategy of the device to the MEC server and the optimal quotation strategy of the MEC server to the device meet a Stackelberg equilibrium solution, offloading, by the device based on the optimal task offloading strategy, a task to the MEC server.
  • 2. The distributed computation offloading method according to claim 1, wherein the establishing a device revenue maximization problem model comprises:
  • 3. The distributed computation offloading method according to claim 1, wherein in a case that a task is offloaded to a macro base station, the MEC server revenue maximization problem model is expressed as:
  • 4. The distributed computation offloading method according to claim 1, wherein in a case that a task is offloaded to a small base station, the MEC server revenue maximization problem model is expressed as:
  • 5. The distributed computation offloading method according to claim 1, wherein the establishing a composite scenario set based on a dwell time period and a composite scenario set based on a waiting delay comprises: in a case that a possible dwell time period and a possible waiting time period of a mobile device in a small base station are known, obtaining, based on a Cartesian product, a composite scenario based on dwell time periods of all mobile devices and a composite scenario based on waiting delays of all the mobile devices, wherein a composite scenario set based on the dwell time periods of all the mobile devices is expressed as:
  • 6. The distributed computation offloading method according to claim 1, wherein compensation is performed on the game strategy by using the posteriori recourse action, that is, a two-stage optimal programming model is adopted, parameters are obtained based on the device revenue maximization problem model and the MEC server revenue maximization problem model in a first stage, compensation is performed on the parameters obtained in the first stage by using the posteriori recourse action in a second stage, and the device revenue maximization problem model after the compensation performed by using the posterior recourse action is expressed as:
  • 7. The distributed computation offloading method according to claim 6, wherein a recourse function R(βi,lS(t),ΩS(t)) is expressed as:
  • 8. The distributed computation offloading method according to claim 6, wherein a task offloading process is divided into H sub-time slots, a distributed programming model with 2H stages is obtained, and the device revenue maximization problem model is expressed as:
  • 9. The distributed computation offloading method according to claim 1, wherein the optimal strategy of the device is obtained based on the Lagrangian multiplier and the KKT condition by using the following equations:
Priority Claims (1)
Number Date Country Kind
202111095551.7 Sep 2021 CN national
PCT Information
Filing Document Filing Date Country Kind
PCT/CN2021/128920 11/5/2021 WO