ebook img

Alarms, KPIs, and Measurements PDF

585 Pages·2013·2.36 MB·English
by  
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Alarms, KPIs, and Measurements

® EAGLE XG Diameter Signaling Router Alarms, KPIs, and Measurements Reference Guide 910-6827-001 Revision A October 2013 Copyright 2013 Tekelec. All Rights Reserved. Printed in USA. Legal Information can be accessed from the Main Menu of the optical disc or on the Tekelec Customer Support web site in the Legal Information folder of the Product Support tab. Table of Contents Chapter 1: Introduction...............................................................................32 Overview...............................................................................................................................................33 Scope and Audience............................................................................................................................33 Manual Organization...........................................................................................................................33 Documentation Admonishments.......................................................................................................34 Related Publications............................................................................................................................34 Customer Care Center.........................................................................................................................35 Emergency Response...........................................................................................................................37 Locate Product Documentation on the Customer Support Site....................................................38 Chapter 2: Alarms and Events, KPIs, and Measurements Overview....................................................................................................39 Displaying the file list..........................................................................................................................40 Data Export...........................................................................................................................................40 Data Export elements...............................................................................................................40 Configuring data export .........................................................................................................42 Tasks.......................................................................................................................................................43 Active Tasks..............................................................................................................................43 Scheduled Tasks.......................................................................................................................46 Chapter 3: Alarms and Events...................................................................49 General alarms and events information...........................................................................................50 Alarms and events overview..................................................................................................50 Alarm and event ID ranges ....................................................................................................52 Alarm and event types............................................................................................................52 Viewing active alarms.............................................................................................................54 Active alarms data export elements .....................................................................................54 Exporting active alarms...........................................................................................................55 Generating a report of active alarms.....................................................................................56 Viewing alarm and event history..........................................................................................57 Historical events data export elements ................................................................................57 Exporting alarm and event history........................................................................................58 Generating a report of historical alarms and events...........................................................59 910-6827-001 Revision A, October 2013 ii IP Front End, IPFE (5000-5999)...........................................................................................................60 5001 - IPFE Backend Unavailable..........................................................................................60 5002 - IPFE address configuration error...............................................................................60 5003 - IPFE state sync run error..............................................................................................61 5005 - IPFE Backend In Stasis.................................................................................................62 5007 - Out of Balance: Low.....................................................................................................62 5008 - Out of Balance: High....................................................................................................62 5009 - No available servers in target set................................................................................63 5010 - Unknown Linux iptables command error ................................................................64 5011 - System or platform error prohibiting operation......................................................64 5012 - Signaling interface heartbeat timeout........................................................................65 5013 - Throttling traffic............................................................................................................65 5100 - Traffic overload.............................................................................................................66 OAM (10000-10999)..............................................................................................................................66 Alarms formatting information..............................................................................................66 10000 - Incompatible database version.................................................................................67 10001 - Database backup started............................................................................................67 10002 - Database backup completed.....................................................................................67 10003 - Database backup failed..............................................................................................67 10004 - Database restoration started......................................................................................68 10005 - Database restoration completed...............................................................................68 10006 - Database restoration failed........................................................................................68 10008 - Database provisioning manually disabled .............................................................69 10009 - Config and Prov db not yet synchronized .............................................................69 10010 - Stateful db from mate not yet synchronized...........................................................69 10011 - Cannot monitor table..................................................................................................70 10012 - Table change responder failed .................................................................................70 10013 - Application restart in progress ................................................................................70 10020 - Backup failure .............................................................................................................71 10074 - Standby server degraded while mate server stabilizes.........................................71 10075 - Application processes have been manually stopped............................................72 10078 - Application not restarted on standby server due to disabled failure cleanup mode ....................................................................................................................................72 10100 - Log export started.......................................................................................................72 10101 - Log export successful.................................................................................................73 10102 - Log export failed.........................................................................................................73 10103 - Log export already in progress.................................................................................73 10104 - Log export file transfer failed....................................................................................74 10105 - Log export cancelled - user request..........................................................................74 10106 - Log export cancelled - duplicate request.................................................................74 10107 - Log export cancelled - queue full.............................................................................75 910-6827-001 Revision A, October 2013 iii 10108 - Duplicate scheduled log export task........................................................................75 10109 - Log export queue is full.............................................................................................76 10151 - Login successful..........................................................................................................76 10152 - Login failed..................................................................................................................76 10153 - Logout successful........................................................................................................77 10154 - User Account Disabled..............................................................................................77 10200 - Remote database reinitialization in progress..........................................................77 Session Binding Repository, SBR (12000-12999)..............................................................................78 12003 - SBR Congestion State.................................................................................................78 12007 - SBR Active Sess Binding Threshold.........................................................................79 12010 - SBR Proc Term.............................................................................................................79 Communication Agent, ComAgent (19800-19909)..........................................................................80 19800 - Communication Agent Connection Down.............................................................80 19801 - Communication Agent Connection Locally Blocked............................................81 19802 - Communication Agent Connection Remotely Blocked........................................82 19803 - Communication Agent stack event queue utilization...........................................82 19804 - Communication Agent configured connection waiting for remote client to establish connection...........................................................................................................83 19805 - Communication Agent Failed To Align Connection.............................................84 19806 - Communication Agent CommMessage mempool utilization.............................85 19807 - Communication Agent User Data FIFO Queue utilization..................................86 19808 - Communication Agent Connection FIFO Queue utilization...............................86 19810 - Communication Agent Egress Message Discarded...............................................87 19811 - Communication Agent Ingress Message Discarded..............................................88 19814 - Communication Agent Peer has not responded to heartbeat..............................88 19816 - Communication Agent Connection State Changed...............................................89 19817 - Communication Agent DB Responder detected a change in configurable control option parameter...................................................................................................89 19820 - Communication Agent Routed Service Unavailable.............................................89 19821 - Communication Agent Routed Service Degraded................................................90 19822 - Communication Agent Routed Service Congested...............................................91 19823 - Communication Agent Routed Service Using Low-Priority Connection Group...................................................................................................................................91 19824 - Communication Agent Pending Transaction Utilization.....................................92 19825 - Communication Agent Transaction Failure Rate...................................................92 19826 - Communication Agent Connection Congested......................................................93 19830 - Communication Agent Service Registration State Change..................................94 19831 - Communication Agent Service Operational State Changed................................94 19832 - Communication Agent Reliable Transaction Failed..............................................94 19833 - Communication Agent Service Egress Message Discarded.................................95 19842 - Communication Agent Resource-Provider Registered.........................................95 910-6827-001 Revision A, October 2013 iv 19843 - Communication Agent Resource-Provider Resource State Changed.................96 19844 - Communication Agent Resource-Provider Stale Status Received......................96 19845 - Communication Agent Resource-Provider Deregistered.....................................96 19846 - Communication Agent Resource Degraded...........................................................96 19847 - Communication Agent Resource Unavailable.......................................................97 19848 - Communication Agent Resource Error...................................................................98 19850 - Communication Agent Resource-User Registered................................................98 19851 - Communication Agent Resource-User Deregistered............................................98 19852 - Communication Agent Resource Routing State Changed....................................99 19853 - Communication Agent Resource Egress Message Discarded..............................99 19854 - Communication Agent Resource-Provider Tracking Table Audit Results ...............................................................................................................................................99 19855 - Communication Agent Resource Has Multiple Actives.....................................100 19857 - Communication Agent Service Provider Operational State Changed..............100 19860 - Communication Agent Configuration Daemon Table Monitoring Failure.....100 19861 - Communication Agent Configuration Daemon Script Failure..........................101 19862 - Communication Agent Service Provider Registration State Changed.............102 19900 - Process CPU Utilization...........................................................................................102 19901 - CFG-DB Validation Error........................................................................................102 19902 - CFG-DB Update Failure...........................................................................................103 19903 - CFG-DB post-update Error......................................................................................103 19904 - CFG-DB post-update Failure...................................................................................104 19905 - Measurement Initialization Failure........................................................................104 Diameter Signaling Router (DSR) Diagnostics (19910-19999).....................................................105 19910 - Message Discarded at Test Connection.................................................................105 19911 - Test message discarded ...........................................................................................105 Diameter Signaling Router, DSR (22000-22999).............................................................................106 Diameter Alarms and Events...............................................................................................106 Range Based Address Resolution (RBAR) Alarms and Events.......................................142 Application Alarms and Events...........................................................................................145 Full Address Based Resolution (FABR) Alarms and Events...........................................150 Policy DRA (PDRA) Alarms and Events............................................................................155 Policy SBR (pSBR) Alarms and Events...............................................................................160 Charging Proxy Application (CPA) Alarms and Events..................................................165 Tekelec Virtual Operating Environment, TVOE (24400-24499)..................................................170 24400 - TVOE libvirtd is down ............................................................................................170 24401 - TVOE libvirtd is hung .............................................................................................170 24402 - all TVOE libvirtd connections are in use ..............................................................171 Computer Aided Policy Making, CAPM (25000-25499)...............................................................171 25000 - Rule Template failed to be updated.......................................................................171 25001 - Action failed within the Rule Template ...............................................................172 910-6827-001 Revision A, October 2013 v 25002 - Stop Rule Template processing after action failure.............................................172 25003 - Exit Trigger point after action failure....................................................................172 OAM Alarm Management (25500-25899).......................................................................................173 25500 - No DA-MP Leader Detected Alarm.......................................................................173 25510 - Multiple DA-MP Leader Detected Alarm.............................................................173 Platform (31000-32700)......................................................................................................................173 Alarms formatting information............................................................................................174 31000 - S/W fault....................................................................................................................174 31001 - S/W status.................................................................................................................174 31002 - Process watchdog failure.........................................................................................174 31003 - Tab thread watchdog failure...................................................................................175 31100 - Database replication fault........................................................................................175 31101 - Database replication to slave failure......................................................................175 31102 - Database replication from master failure..............................................................175 31103- DB Replication update fault.....................................................................................176 31104 - DB Replication latency over threshold..................................................................176 31105 - Database merge fault................................................................................................176 31106 - Database merge to parent failure...........................................................................177 31107 - Database merge from child failure.........................................................................177 31108 - Database merge latency over threshold................................................................177 31109 - Topology config error...............................................................................................177 31110 - Database audit fault..................................................................................................178 31111 - Database merge audit in progress..........................................................................178 31112 - Stateful db synchronization from mate server ....................................................178 31113 - DB replication manually disabled..........................................................................178 31114 - DB replication over SOAP has failed.....................................................................179 31115 - Database service fault...............................................................................................179 31116 - Excessive shared memory........................................................................................179 31117 - Low disk free.............................................................................................................179 31118 - Database disk store fault..........................................................................................180 31119 - Database updatelog overrun...................................................................................180 31120 - Database updatelog write fault...............................................................................180 31121 - Low disk free early warning...................................................................................180 31122 - Excessive shared memory early warning..............................................................181 31123 - Database replication audit command complete...................................................181 31124 - Database replication audit command error..........................................................181 31125 - Database durability degraded.................................................................................181 31126- Audit blocked.............................................................................................................182 31127 - DB Replication Audit Complete.............................................................................182 31130 - Network health warning..........................................................................................182 31140 - Database perl fault....................................................................................................183 910-6827-001 Revision A, October 2013 vi 31145 - Database SQL fault...................................................................................................183 31146- DB mastership fault...................................................................................................183 31147- DB upsynclog overrun..............................................................................................183 31148- DB lock error detected...............................................................................................184 31200 - Process management fault.......................................................................................184 31201 - Process not running..................................................................................................184 31202 - Unkillable zombie process.......................................................................................184 31206 - Process mgmt monitoring fault..............................................................................185 31207 - Process resource monitoring fault..........................................................................185 31208 - IP port server fault....................................................................................................185 31209 - Hostname lookup failed..........................................................................................185 31213 - Process scheduler fault.............................................................................................186 31214 - Scheduled process fault...........................................................................................186 31215 - Process resources exceeded.....................................................................................186 31216 - SysMetric configuration error.................................................................................186 31220 - HA configuration monitor fault.............................................................................187 31221 - HA alarm monitor fault...........................................................................................187 31222 - HA not configured....................................................................................................187 31223 - HA Heartbeat transmit failure................................................................................187 31224 - HA configuration error............................................................................................188 31225 - HA service start failure............................................................................................188 31226 - HA availability status degraded.............................................................................188 31227 - HA availability status failed....................................................................................188 31228 - HA standby offline...................................................................................................189 31229 - HA score changed.....................................................................................................189 31230 - Recent alarm processing fault.................................................................................189 31231 - Platform alarm agent fault.......................................................................................190 31232- Late heartbeat warning.............................................................................................190 31233 - HA Secondary Path Down......................................................................................190 31240 - Measurements collection fault................................................................................190 31250 - RE port mapping fault..............................................................................................191 31260 - Database SNMP Agent.............................................................................................191 31270 - Logging output..........................................................................................................191 31280 - HA Active to Standby transition............................................................................191 31281 - HA Standby to Active transition............................................................................192 31282- HA Management Fault.............................................................................................192 31283- HA Server Offline......................................................................................................192 31284 - HA Remote Subscriber Heartbeat Warning..........................................................193 31290- HA Process Status......................................................................................................193 31291- HA Election Status.....................................................................................................193 31292- HA Policy Status........................................................................................................193 910-6827-001 Revision A, October 2013 vii 31293- HA Resource Link Status..........................................................................................194 31294- HA Resource Status...................................................................................................194 31295- HA Action Status.......................................................................................................194 31296- HA Monitor Status.....................................................................................................194 31297- HA Resource Agent Info...........................................................................................195 31298- HA Resource Agent Detail.......................................................................................195 31299 - HA Notification Status.............................................................................................195 31300 - HA Control Status.....................................................................................................195 32113 - Uncorrectable ECC memory error..........................................................................196 32114 - SNMP get failure.......................................................................................................196 32115 - TPD NTP Daemon Not Synchronized Failure......................................................196 32116 - TPD Server's Time Has Gone Backwards.............................................................197 32117 - TPD NTP Offset Check Failure...............................................................................197 32300 – Server fan failure......................................................................................................197 32301 - Server internal disk error.........................................................................................198 32302 – Server RAID disk error............................................................................................198 32303 - Server Platform error................................................................................................198 32304 - Server file system error............................................................................................198 32305 - Server Platform process error.................................................................................199 32307 - Server swap space shortage failure........................................................................199 32308 - Server provisioning network error.........................................................................199 32312 - Server disk space shortage error.............................................................................200 32313 - Server default route network error........................................................................200 32314 - Server temperature error.........................................................................................200 32315 – Server mainboard voltage error.............................................................................201 32316 – Server power feed error..........................................................................................201 32317 - Server disk health test error....................................................................................202 32318 - Server disk unavailable error..................................................................................202 32319 – Device error...............................................................................................................202 32320 – Device interface error..............................................................................................202 32321 – Correctable ECC memory error.............................................................................203 32322 – Power Supply A error..............................................................................................203 32323 – Power Supply B error..............................................................................................203 32324 – Breaker panel feed error..........................................................................................203 32325 – Breaker panel breaker error....................................................................................204 32326 – Breaker panel monitoring error.............................................................................207 32327 – Server HA Keepalive error.....................................................................................207 32331 – HP disk problem......................................................................................................208 32332 – HP Smart Array controller problem......................................................................208 32333 – HP hpacucliStatus utility problem........................................................................208 32334 - Multipath device access link problem...................................................................208 910-6827-001 Revision A, October 2013 viii 32335 - Switch link down error............................................................................................209 32336– Half Open TCP Socket Limit...................................................................................209 32337 - E5-APP-B Firmware Flash.......................................................................................209 32338 - E5-APP-B Serial mezzanine seating.......................................................................210 32339 - TPD Max Number Of Running Processes Error..................................................210 32340 - TPD NTP Daemon Not Synchronized Error.........................................................210 32341 - TPD NTP Daemon Never Synchronized Error....................................................210 32342 - TPD NTP Offset Check Error..................................................................................211 32343 - TPD RAID disk problem..........................................................................................211 32344 - TPD RAID controller problem................................................................................211 32403 – PM&C backup failed................................................................................................212 32500 – Server disk space shortage warning......................................................................212 32501 – Server application process error............................................................................212 32502 – Server hardware configuration error....................................................................213 32503 – Server RAM shortage warning...............................................................................213 32505 – Server swap space shortage warning....................................................................213 32506 – Server default router not defined..........................................................................213 32507 – Server temperature warning...................................................................................214 32508 – Server core file detected..........................................................................................214 32509 – Server NTP Daemon not synchronized................................................................214 32510 – CMOS battery voltage low......................................................................................215 32511 – Server disk self test warning...................................................................................215 32512 – Device warning.........................................................................................................215 32513 – Device interface warning........................................................................................215 32514 – Server reboot watchdog initiated...........................................................................216 32515 – Server HA failover inhibited..................................................................................216 32516 – Server HA Active to Standby transition...............................................................216 32517 – Server HA Standby to Active transition...............................................................217 32518 – Platform Health Check failure................................................................................217 32519 – NTP Offset Check failure........................................................................................217 32520 – NTP Stratum Check failure.....................................................................................217 32521 – SAS Presence Sensor Missing.................................................................................218 32522 – SAS Drive Missing...................................................................................................218 32523 – DRBD failover busy.................................................................................................218 32524 – HP disk resync..........................................................................................................218 32525 – Telco Fan Warning...................................................................................................219 32526 – Telco Temperature Warning...................................................................................219 32527 – Telco Power Supply Warning................................................................................219 32528 – Invalid BIOS value...................................................................................................220 32529– Server Kernel Dump File Detected.........................................................................220 32530– TPD Upgrade Failed.................................................................................................220 910-6827-001 Revision A, October 2013 ix 32531– Half Open Socket Warning Limit...........................................................................220 32532– Server Upgrade Pending Accept/Reject................................................................221 32533 - TPD Max Number of Running Processes Warning.............................................221 32534 - TPD NTP Source Is Bad Warning...........................................................................221 32535 - TPD RAID disk resync.............................................................................................222 32603 – PM&C backup to remote server failed..................................................................222 Chapter 4: Key Performance Indicators (KPIs)....................................223 General KPIs information.................................................................................................................224 KPIs overview.........................................................................................................................224 KPIs..........................................................................................................................................224 Viewing KPIs .........................................................................................................................224 KPIs data export elements ...................................................................................................224 Exporting KPIs........................................................................................................................225 KPIs server elements .........................................................................................................................226 Computer Aided Policy Making (CAPM) KPIs.............................................................................227 Charging Proxy Application (CPA) KPIs.......................................................................................227 Communication Agent (ComAgent) KPIs......................................................................................228 Connection Maintenance KPIs.........................................................................................................228 Diameter (DIAM) KPIs......................................................................................................................228 IP Front End (IPFE) KPIs...................................................................................................................229 Full Address Based Resolution (FABR) KPIs.................................................................................230 Policy Diameter Routing Agent (PDRA) KPIs...............................................................................230 Policy SBR (pSBR) KPIs.....................................................................................................................230 Range Based Address Resolution (RBAR) KPIs............................................................................231 Session Binding Repository (SBR) KPIs..........................................................................................232 Chapter 5: Measurements.........................................................................233 General measurements information................................................................................................235 Measurements.........................................................................................................................235 Measurement elements .........................................................................................................236 Generating a measurements report.....................................................................................236 Measurements data export elements ..................................................................................237 Exporting measurements reports.........................................................................................238 Address Resolution Exception measurements..............................................................................239 RxRbarDecodeFailureResol..................................................................................................241 RxFabrInvalidImsiMcc..........................................................................................................241 RxRbarResolFailAll................................................................................................................241 RxRbarResolFailCmdcode....................................................................................................242 RxRbarResolFailDbFail.........................................................................................................242 910-6827-001 Revision A, October 2013 x

Description:
XG Diameter Signaling. Router. Alarms, KPIs . Chapter 2: Alarms and Events, KPIs, and Measurements. Overview. 5011 - System or platform error prohibiting operation64 . 10078 - Application not restarted on standby server due to disabled failure cleanup mode .
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.