Telecom Fault Management System (netAICE)Large telecommunications networks are comprised of a variety of systems and components: routers, switches, wireless access points, network interface controllers, modems. There are literally hundreds of thousands of network elements in a major Telco’s network. Connecting it all together is cabling so great in length that it could be wrapped around Earth several times.
This diagram illustrates the CyberVision NetAICE alarm correlation process and the approximate results during each stage
The challenge is to efficiently isolate the specific fault that is at the root of the alarm storm. The size and complexity of today’s networks makes the levels of human intervention required to perform this function prohibitively high. Instead telecom companies are increasingly turning to powerful Fault Management systems to do the root cause analysis. For over a decade CyberVision has been supplying solutions to the telecom industry. CyberVision’s NetAICE (Artificial Intelligence Correlation Engine) system takes an innovative approach towards alarm correlation and root cause analysis. By applying our Artificial Intelligence-based correlation engine, NetAICE delivers superior root cause analysis, offering the following benefits:
A significant advantage of CyberVision’s NetAICE is our “Enhanced Impact Analysis Module” (EIAM). When faults occur, this module calculates the possible consequences and predicts the future state of network elements. EIAM also helps estimate a problem’s severity, its topological disposition, and helps plan steps for problem resolution. EIAM is especially useful for identifying possible SLA violations. The core of an industrial Fault Management system is its correlation engine, which is responsible for associating alarm dependencies, and filtering and sorting out spurious alarms.. When performing ideally, a correlation engine sets the stage for fast and accurate determination of root cause. More typically, a correlation engine might generate hundreds of misleading reports obscuring the actual problem. In this case, it often requires reviewing every alarm or testing in manual mode to determine root cause. The majority of existing Fault Management systems on the market use only a few alarm correlation methods – correlating on average only 15% of incoming alarms. The advantage of CyberVision’s NetAICE solution is that it combines the four most effective correlation methods, reducing the number of alarms by 70% - 90%, and it can achieve these results processing as many as 100 alarms per second. The four most effective correlation methods:
Each method is activated on an as-needed basis depending on the type of alarm, severity level, uncertainty level, etc. During non-topology correlation CyberVision’s NetAICE compiles a preliminary list of alarms by discarding alarms that are deemed irrelevant or unessential. The remaining alarms are sorted and aggregated according to their parameters and rule sets. Bayesian Belief Networks is a mathematical technique for representing probable relationships between network faults and possible sources. Using specific mathematical algorithms for identifying the root cause of a problem, it takes into account the relationship between network elements to calculate the probable cause. Finally, the Neural Network approach for alarm correlation is a particularly powerful feature in CyberVision’s NetAICE solution. Artificial Intelligence has proven to be an essential feature for managing next generation networks, especially in situations that involve a near infinite range of scenarios and a changing network architecture. Neural Networks offer flexibility and can be trained to perform a variety of tasks. When rule-based analysis fails, Neural Networks can identify root cause alarms in cases of incomplete information and can learn new alarm patterns following network topology modifications.
|

Telecom Solutions