APPARATUSES AND METHODS FOR SELECTING CANDIDATES END POINTS IN A FIXED COMMUNICATION NETWORK

Description

FIELD OF THE INVENTION

The disclosure relates to apparatuses and methods for selecting candidates end points in a fixed communication network.

BACKGROUND

As network operators develop new and better technologies, they have to adapt the current running fixed communication network with the latest technologies. This adaptation to the latest technologies is called migration from an existing technology to a newer technology.

To optimize the migration of the fixed communication network and prioritize the area of a network where the needs are the higher, an operator wants to know where the migration should be done first. To know this, it has to identify the end points of the network where a higher bandwidth is or would be necessary.

One way to achieve this is to test the current end points of the network with a higher bandwidth.

However, since many fixed access networks like, for example, Passive Optical Network and Cable networks are shared medium, this can only be done to a few end points at a time to limit the overbooking factor and maintain the speed test success probability in an acceptable range. Operators are currently looking for a solution to identify in each network branch the end points who are the most likely to use a higher bandwidth. One solution is to offer them a short time trial period so they can test if they need and/or will need a higher bandwidth in the future.

As the number of end points able to support higher bandwidth of the short time trial period in each network branch at the same time is limited, it is critical to have a recommendation engine to identify the end points who are likely to need the higher servicers after the short time trial period with a maximum of reliability (highest hit rate).

However, for now, there is no objective method to identify the end points with higher needs other than predict them thanks to educated guesses such as simply monitoring their current telemetry history and recommending the end points associated to the subscribers having the highest data consumption but even those simple method have not been integrated and automated at BSS/OSS level nor in the access network controller.

Thus, there is a need for apparatuses and methods for selecting candidates end points.

SUMMARY

Apparatuses and methods for selecting candidates end points in a fixed communication network are disclosed.

In some embodiments, the disclosure provides a method for selecting candidates end points comprising:

- providing a first training dataset identifying training candidate end points to be allocated a bandwidth increment for a short time period, wherein the training candidate end points belong to a fixed communications network,
  
  the training dataset further comprising for each training candidate end point at least:
- a bandwidth allocation of the training candidate end point,
- a traffic consumption history of the training candidate end point,
- providing a first training success list identifying successful training candidate end points to be allocated a bandwidth increment for a long time period, wherein the successful training candidate end points are a subset of the first training candidate end points;
- performing supervised training of a first success prediction model for predicting the first training success list from the first training dataset
- determining a first predicted list by implementing the first success prediction model on a first reservoir dataset identifying first potential candidate end points belonging to the fixed communications network wherein the first predicted list identifies candidates end points as a subset of the first potential candidate end points,
  
  the first reservoir dataset comprising for each first potential candidate end point at least:
- a bandwidth allocation of the first potential candidate end point,
- a traffic consumption history of the first potential candidate end point, and
- selecting a first candidate list identifying first candidate end points to be allocated a bandwidth increment, wherein the first candidate end points belong to the fixed communications network,
  
  wherein the first candidate list includes part or all of the first predicted list.

Thanks to these features, the method enables to select the end points of the fixed communication network with the highest probability of need or that require a bandwidth increment for a long time period.

This long time period is longer than the short time period. This long period can be comprised for example between one month to one or several years. The short time period can be comprised for example between one week to one month.

The method may also comprise one or more of the following features.

In an embodiment, the method further comprises:

- performing supervised training of another first success prediction model for predicting the first training success list from the first training dataset, the another first success prediction model being different from the first success prediction model.
- determining another first predicted list by implementing the another first success prediction model on the first reservoir dataset identifying first potential candidate end points belonging to the fixed communications network,
  
  wherein the another first predicted list identifies candidates end points as a subset of the first potential candidate end points, wherein the first candidate list includes part or all of the first predicted list and part of the another first predicted list.

Thanks to these features, two different prediction models are trained and can be compared. Any number of prediction models can be trained based on the same training dataset and success list. A prediction model trained with the first training dataset and the first success list is a prediction model of first generation.

Training various different prediction models can be useful to best cover or explore the fixed communication network. In addition, one model can be more explainable than the other. This more explainable model can be used to gain insights on the reasons why an end point has a higher probability to need a bandwidth increment for a long time period.

In an embodiment, the method further comprises:

- providing a second training dataset including the first candidate list and comprising for each second candidate end point, a bandwidth allocation of the second candidate end point, a traffic consumption history of the second candidate end point, and a value of the bandwidth increment for the short time period;
- providing a second success list identifying successful first candidate end points to be allocated a bandwidth increment for a long time period, wherein the successful second candidate end points are a subset of the second candidate end points;
- performing supervised training of a second success prediction model for predicting the second success list from the second training dataset,
- determining a second predicted list by implementing the second success prediction model on a second reservoir dataset identifying second potential candidate end points belonging to the fixed communications network wherein the second predicted list identifies candidates end points as a subset of the second potential candidate end points,
  
  the second reservoir dataset comprising for each second potential candidate end point, a bandwidth allocation of the second potential candidate end point, a traffic consumption history of second potential candidate end point,
- selecting a second candidate list identifying second candidate end points to be allocated a bandwidth increment, wherein the second candidate end points belong to the fixed communications network,
  
  wherein the second candidate list includes part or all of the second predicted list.

Thanks to these features, a candidate list more accurate than the first candidate list is selected thanks to a second supervised training.

In an embodiment, the method further comprises:

- performing supervised training of another second success prediction model for predicting the second training success list from the second training dataset, the another second success prediction model being different from the second success prediction model,
- determining another second predicted list by implementing the another second success prediction model on the second reservoir dataset identifying second potential candidate end points belonging to the fixed communications network,
  
  wherein the second candidate list includes part of the second predicted list and part of the another second predicted list.

Thanks to these features, in a similar way with the various prediction model of first generation, training various different prediction models of second generation can be useful to best cover or explore the fixed communication network. In addition, one model can be more explainable than the other. This more explainable model can be used to gain insights on the reasons why an end point has a higher probability to need a bandwidth increment for a long time period.

In an embodiment, performing supervised training of a second success prediction model comprises performing supervised re-training of the first success prediction model for predicting the second success list from the second training dataset, wherein the second success prediction model is the re-trained first success prediction model.

Thanks to these features, the same prediction model is retrained and is more accurate. In other words, it is possible to improve from one generation to another the same prediction model.

In an embodiment, the first candidate list further identifies first candidate end points (O1) outside the first predicted list and/or outside the another first predicted list.

Thanks to these features, the risk to see the performance of this model degrading over time is reduced. Indeed, these features allow continuously exploring the entire end points base of the network, to detect new trends as soon as they appear in the network.

In an embodiment, selecting a first candidate list further comprises:

- selecting a first candidate end point outside the first predicted list as a function of at least a position of the first candidate end point in the fixed communications network, a bandwidth allocation of the first candidate end point, and a traffic consumption history of the first candidate end point; and/or
- randomly selecting a first candidate end point outside the first predicted list.

Thanks to these features, the exploration of the end points base of the network is set empirically and randomly. It is then possible to compare the accuracy of the prediction model with the end points selected thanks to methods different from implementing a prediction model.

In an embodiment, the method further comprises determining a success rate of the first predicted list as a function of the second success list and determining a success rate of the first candidate end points outside the first predicted list as a function of the second success list.

Thanks to these features, it is possible to evaluate and monitor the performance of the different methods of determining or selecting the end points.

In an embodiment, the method further comprises:

- in response to determining that the success rate of the first predicted list is higher than the success rate of the first candidate end points outside the first predicted list, the second candidate list is selected so that a proportion of the second predicted list within the second candidate list is higher than a proportion of the first predicted list within the first candidate list,
- providing a third training dataset including the second candidate list and comprising for each third candidate end point at least a position of the second candidate end point in the fixed communications network, a bandwidth allocation of the second candidate end point, a traffic consumption history of the second candidate end point, providing a third success list identifying successful third candidate end points to be allocated a bandwidth increment for a long time period, wherein the successful third candidate end points are a subset of the third candidate end points;
- performing supervised training of a third success prediction model for predicting the third success list from the third training dataset,
- determining a third predicted list by implementing the third success prediction model on a third reservoir dataset identifying third potential candidate end points belonging to the fixed communications network, wherein the third predicted list identifies candidates end points as a subset of the third potential candidate end points,
  
  the third reservoir dataset comprising for each third potential candidate end point at least, a bandwidth allocation of the third potential candidate end point, a traffic consumption history of third potential candidate end point,
- selecting a third candidate list identifying third candidate end points to be allocated a bandwidth increment, wherein the third candidate end points belong to the fixed communications network,
  
  wherein the third candidate list includes part or all of the third predicted list.

Thanks to these features, a third prediction model has been trained with the results of the first and second prediction models. Therefore, it is expected to be more accurate than the prediction model(s) of first generation and the prediction model(s) of second generation.

In an embodiment, the method further comprises:

- determining a success rate of the first predicted list as a function of the first success list and determining a success rate of the another first candidate end points of the another first predicted list as a function of the first success list.
- in response to determining that the success rate of the first predicted list is higher than the success rate of the another first candidate end points of the another first predicted list, the second candidate list is selected so that a proportion of the second predicted list within the second candidate list is higher than a proportion of the another second predicted list within the second candidate list.

Thanks to these features, a prediction model more accurate according to its success rate is responsible for a larger part of the next candidate list while a smaller part comes from a different prediction model to continue exploring the end points base.

In an embodiment, performing supervised training of a third success prediction model comprises:

- performing supervised re-re-training of the first success prediction model for predicting the third success list from the third training dataset, wherein the third success prediction model is the re-re-trained first success prediction model, or
- performing supervised re-training of the second success prediction model for predicting the third success list from the third training dataset, wherein the third success prediction model is the re-trained second success prediction model.

Thanks to these features, it is possible to improve from one generation to another the same prediction model.

In an embodiment, the first success prediction model, the second success prediction and/or the third success prediction model is one of the following:

- a logistic regression based model,
- a naïve bayes based model,
- a decision tree based model,
- a random forest based model,
- a support vector machine model,
- a K-nearest neighbor model,
- a neural network based model.

Thanks to these features, either a simple model is used (logistic regression, naïve bayes, decision tree, . . . ) and in this case the importance of every input parameter can be recovered from the trained model parameters. Or a more complex supervised learning model is chosen (random forest, neural network) preventing the direct interpretation of the model parameters. In this latter case, many techniques are available to still determine the contribution to each input parameter to the model decision. These techniques include for example Permutation Feature Importance, Partial Dependene Plots, Individual Conditional Expectations, LIME and SHAP.

In an embodiment, the method according to the disclosure is implemented by an network access manager of the fixed communication network.

In an embodiment, the fixed communication network is an optical distribution network (ODN), for example a Gigabit-capable Passive Optical Network (GPON).

In some embodiments, the disclosure provides an apparatus comprising means for:

- providing a first training dataset identifying training candidate end points to be allocated a bandwidth increment for a short time period, wherein the training candidate end points belong to a fixed communications network,
  
  the training dataset further comprising for each training candidate end point at least: a bandwidth allocation of the training candidate end point, a traffic consumption history of the training candidate end point,
- providing a first training success list identifying successful training candidate end points to be allocated a bandwidth increment for a long time period, wherein the successful training candidate end points are a subset of the first training candidate end points;
- performing supervised training of a first success prediction model for predicting the first training success list from the first training dataset
- determining a first predicted list by implementing the first success prediction model on a first reservoir dataset identifying first potential candidate end points belonging to the fixed communications network wherein the first predicted list identifies candidates end points as a subset of the first potential candidate end points,
  
  the first reservoir dataset comprising for each first potential candidate end point at least:
  
  a bandwidth allocation of the first potential candidate end point,
  
  a traffic consumption history of the first potential candidate end point, and
- selecting a first candidate list identifying first candidate end points to be allocated a bandwidth increment, wherein the first candidate end points belong to the fixed communications network,
  
  wherein the first candidate list includes part or all of the first predicted list.

In some example embodiments, the means in the apparatus further comprises:

- At least one processor; and
- At least one memory including a computer program code, the at least one memory and computer program code configured to, with the at least one processor, cause the operations of the apparatus.