Search Menu

Intelligent automation for flapping: working towards autonomous networks

The Autonomous Network Journey (ANJ) is Telefónica’s strategic transformation program to evolve towards autonomous networks, combining automation, Artificial Intelligence and advanced data analysis. The goal is to achieve networks capable of detecting, analyzing, deciding and acting autonomously, reducing human intervention and improving operational efficiency and customer experience.

Subscribe to Telefónica’s blog and find out before anyone else.





As part of this journey, one of the key focuses has been to address technical incidents that have an impact on services, such as the flapping of interfaces in network equipment.

The problem: What is flapping and why is it critical?

A router interface is flapped when it is repeatedly turned on and off in a short period of time. This unstable behavior can lead to:

  • Constant interruptions in connectivity, affecting the availability of services and the customer experience.
  • Packet loss, deteriorating the quality of communications.
  • Increased latency, negatively affecting real-time services such as voice or video.
  • Instability in routing, with potential impact on the entire network.
  • High operational load, as it requires continuous attention from technical teams.

At Telefónica España, this scenario was an operational challenge that generated multiple manual actions to restore the service quickly, demanding a lot of time and coordination between the teams.

Understanding the causes

The technical teams of the ANJ program carried out an exhaustive analysis of historical data, research with the industry and internal tests. The main causes of flapping identified were:

  1. Physical problems: faulty cables, loose connectors, or hardware failures.
  2. Configuration errors: Poorly defined parameters or IP conflicts.
  3. Software failures: Errors in the router’s operating system or drivers.
  4. Link problems: interference, desynchronization or incorrect speed negotiation.
  5. Traffic overload, which can lead to instability in the interface.

The solution: The “Shutteador”

Telefónica Spain designed an innovative solution based on advanced data analysis. Using clustering techniques, common patterns of flapping were identified, and a specific algorithm was developed to classify and treat them in a differentiated way.

This is how the “Shutteador” was born, an algorithm that allows:

  • Automatically detect flapping events.
  • Classify them into homogeneous groups according to their pattern of behavior.
  • Execute automatic corrective actions tailored to each type of incident.

This Closed Loop operating model allows action on the network without human intervention, closing the complete cycle: detection → analysis → decision → action.

“The Shutteador is helping us to ensure the optimal quality of the network in the face of complex and intermittent events, helping in the ability to diagnose and solve immediately without manual intervention,” says Juan Luis Vázquez, Connectivity Services Support Manager.

Real impact: 70% reduction in service disruption

Thanks to this solution, Telefónica Spain has been able to:

  • Reduce the impact of flapping on services by 70%.
  • Eliminate the need for manual action, freeing up technicians’ resources and time.
  • Advance to Level 4 of autonomy in this type of case, within the maturity model of the ANJ.

According to Clara Casas, Director of Network Support and Services: “Our main objective is to provide the service in an excellent way. Developments such as the Shutteador minimize the impact on our customers in the event of a breakdown, as well as focusing on the E2E automation strategy of the resolution process.”

Future: scaling the solution

This innovation is the beginning of a wider rollout. Telefónica is already working on extending the Shutteador to the entire IP network, prioritizing areas with the greatest potential for impact. In parallel, virtual and predictive action mechanisms are explored, to further anticipate problems before they affect the service.

The flapping approach is a clear example of how Telefónica applies automation and intelligence within the framework of the ANJ to solve complex problems and ensure a more robust and reliable network.

With solutions such as the Shutteador, the company not only improves operational efficiency, but also takes a firm step towards the autonomous network of the future.

Share it on your social networks


Communication

Contact our communication department or requests additional material.

Exit mobile version