USE CASES
RL, thanks to its versatility, finds applications in numerous scenarios, even very different from each other. Some examples of interest in the telco world are:
• Solving "classic" Machine Learning tasks: the problem is restructured as a decision-making task. An example is time series forecasting. The agent learns to predict future values (action) based on past observations (environment) and an evaluation metric, such as prediction accuracy (reward).
• Routing optimization: based on a metric of interest to be optimized (e.g., bandwidth, delay, etc.), the agent, when a communication request occurs between two nodes, chooses the optimal path to optimize the metric in the long term. This means that optimization will also consider possible future communications following a proactive approach.