53-1003142-01 27 June 2014 Fabric Watch Administrators Guide Supporting Fabric OS v7.3.
© 2014, Brocade Communications Systems, Inc. All Rights Reserved. Brocade, the B-wing symbol, Brocade Assurance, ADX, AnyIO, DCX, Fabric OS, FastIron, HyperEdge, ICX, MLX, MyBrocade, NetIron, OpenScript, VCS, VDX, and Vyatta are registered trademarks, and The Effortless Network and the On-Demand Data Center are trademarks of Brocade Communications Systems, Inc., in the United States and in other countries. Other brands and product names mentioned may be trademarks of others.
Contents Preface..................................................................................................................................... 7 Document conventions......................................................................................7 Text formatting conventions.................................................................. 7 Command syntax conventions.............................................................. 7 Notes, cautions, and warnings....................................
Fabric Watch Thresholds....................................................................................................... 25 Threshold values...........................................................................................25 In-between buffer values...................................................................25 Threshold triggers......................................................................................... 26 Above event trigger................................................
Performance Monitor class areas....................................................... 55 Performance monitoring setting guidelines......................................... 55 Performance Monitor class default settings........................................ 55 thConfig command.......................................................................................... 56 thConfig command examples..............................................................57 16 Gbps SFP and QSFP monitoring...................
System monitoring using Web Tools...............................................100 Fabric Watch threshold configuration using Web Tools..............................101 Configuring threshold traits............................................................. 101 Configuring threshold alarms.......................................................... 102 Enabling or disabling threshold alarms for individual elements...... 102 Configuring alarms for FRUs using Web Tools.........................................
Preface ● Document conventions......................................................................................................7 ● Brocade resources............................................................................................................ 9 ● Contacting Brocade Technical Support.............................................................................9 ● Document feedback........................................................................................................
Notes, cautions, and warnings Convention Description value In Fibre Channel products, a fixed value provided as input to a command option is printed in plain text, for example, --show WWN. [] Syntax components displayed within square brackets are optional. Default responses to system prompts are enclosed in square brackets. {x|y|z} A choice of required parameters is enclosed in curly brackets separated by vertical bars. You must select one of the options.
Brocade resources Brocade resources Visit the Brocade website to locate related documentation for your product and additional Brocade resources. You can download additional publications supporting your product at www.brocade.com. Select the Brocade Products tab to locate your product, then click the Brocade product name or image to open the individual product page. The user manuals are available in the resources module at the bottom of the page under the Documentation category.
Document feedback • Brocade Supplemental Support augments your existing OEM support contract, providing direct access to Brocade expertise. For more information, contact Brocade or your OEM. • For questions regarding service levels and response times, contact your OEM/Solution Provider. Document feedback To send feedback and report errors in the documentation you can use the feedback form posted with the document or you can e-mail the documentation team.
About This Document ● Supported hardware and software.................................................................................. 11 ● What's new in this document...........................................................................................11 Supported hardware and software Although many different software and hardware configurations are tested and supported by Brocade Communications Systems, Inc. for Fabric OS 7.2.
What's new in this document 12 Fabric Watch Administrators Guide 53-1003142-01
Fabric Watch ● Fabric health................................................................................................................... 13 ● Fabric Watch overview....................................................................................................13 ● Role-based access control..............................................................................................14 ● Fabric Watch licensing...........................................................................................
Role-based access control Fabric Watch provides customizable monitoring thresholds. You can configure Fabric Watch to provide notification before problems arise, such as reporting when network traffic through a port is approaching the bandwidth limit. This information enables you to perform pre-emptive network maintenance, such as trunking or zoning, and avoid potential network failures. Fabric Watch lets you define how often to measure each switch and fabric element and specify notification thresholds.
Alert configuration Alert configuration When Fabric Watch is improperly configured, a large number of error messages can be sent over a short period of time, making it difficult to find those messages that are actually meaningful. If this happens, there are a few simple ways to improve the configuration. When large numbers of unimportant messages are received, examining the source can identify those classes that need to be reconfigured.
Post-processing of messages NOTE For each class area, there are setting guidelines and recommendations for whether you should leave the setting at the default or change the settings. If a change is recommended, the reason for the change and the suggested settings are provided in each of the configuration chapters. The default settings are listed in these chapters as well. Post-processing of messages After you have configured thresholds and alerts, determine to where the messages will be sent.
Security monitoring • EE (end-to-end) Performance Monitor - Monitors RX and TX performance between two devices. • Filter Performance Monitor - Measures the number of frames transmitted through a port that match specific values in the first 64 bytes of the frame. Because the entire Fibre Channel frame header and many of upper protocol's header fall within the first 64 bytes of a frame, filter-based monitoring can measure different types of traffic transmitted through a port.
Port persistence Port persistence The data collected in port monitoring can vary a great deal over short time periods. Therefore, the port can become a source of frequent event messages (the data can exceed the threshold range and return to a value within the threshold range). Fabric Watch uses port persistence for a port event that requires the transition of the port into a marginal status.
Logical switch support Refer to Fabric Watch reports on page 107 for information on viewing the current switch policies using the Switch Status Policy report. Logical switch support Fabric Watch can monitor the switch health on eight logical switches. You can configure thresholds and notifications for ports that belong to a particular logical switch. Each logical switch has its own Fabric Watch configuration and triggers notifications based on its local configuration.
Fabric Watch event settings Fabric Watch event settings Fabric Watch uses two types of settings: factory default settings and user-defined custom settings. • Factory default settings are automatically enabled. These settings vary depending on hardware platform, and cannot be modified. • You can create custom configurations to suit your unique environment.
SNMP trap counters SNMP agent is configured to a specific error message level, error messages at that level trigger SNMP traps.
Data values changes. These events include login failures, zone configuration changes, firmware downloads, and other configuration changes; in other words any critical changes that have a serious effect on the operation and security of the switch. Important information related to event classes is also tracked and made available. For example, you can track changes from an external source by the user name, IP address, or type of management interface used to access the switch.
Fabric Watch The following classes are not supported in Access Gateway mode: • • • • Fabric Security E_Port (Port subclass) VE_Port (Port subclass) Fabric Watch Administrators Guide 53-1003142-01 23
Fabric Watch support in Access Gateway mode 24 Fabric Watch Administrators Guide 53-1003142-01
Fabric Watch Thresholds ● Threshold values.............................................................................................................25 ● Threshold triggers........................................................................................................... 26 ● Time bases......................................................................................................................27 ● Fabric Watch alarm behavior.................................................................
Threshold triggers FIGURE 1 In-between buffer values Threshold triggers This section describes how Fabric Watch compares a fabric element’s data value against a threshold value to determine whether or not to trigger an event. It describes how a specified buffer zone affects event triggering. For Fabric Watch to monitor data values for one of the following conditions, the alarm setting must be set to a nonzero value.
Below event trigger FIGURE 2 Above event trigger with buffer zone Below event trigger The Below event trigger generates an event when a data value becomes less than the low threshold boundary. When a buffer is defined, the event will be triggered only when the value goes below the lower threshold. A second event will not be generated until the value crosses the buffer region set above the lower threshold.
Time base set to other than none The figure below shows a high limit of 65° Celsius placed on a counter measuring temperature. During each sample period, Fabric Watch measures the temperature and compares it to the high threshold. If the measured temperature exceeds the high threshold, it triggers an event. FIGURE 3 Time base set to none Time base set to other than none If you specify a time base value other than none (minute, hour, or day), Fabric Watch does not use the current data value.
Fabric Watch alarm behavior FIGURE 4 Event trigger Fabric Watch alarm behavior Fabric Watch alarm behavior depends on the threshold states associated with the Above, Below and Changed thresholds. Threshold states can be INFORMATIVE, IN_RANGE, and OUT_OF_RANGE. Notifications are generated only for the following transitions: • IN_RANGE to OUT_OF_RANGE • OUT_OF_RANGE to IN_RANGE No alarm is generated for INFORMATIVE to IN_RANGE (or IN_RANGE to INFORMATIVE).
Fabric Watch alarm behavior 30 Fabric Watch Administrators Guide 53-1003142-01
Fabric Watch Threshold Components ● Fabric Watch classes, areas, and elements................................................................... 31 Fabric Watch classes, areas, and elements Fabric Watch uses a hierarchical organization to track the network device information it monitors. There is a class, area, and element associated with every monitored behavior. Classes are the highest level in the system, subdivided into one or more areas. Areas contain one or more elements.
Elements The Resource class and Environment class areas and actions are configured using the sysMonitor command. The FRU class actions are configured using the fwFruCfg command Elements Fabric Watch defines an element as any fabric or switch component that the software monitors. Within each area, the number of elements is equivalent to the number of components being monitored. For instance, on a 64-port switch, each area of the Port class includes 64 elements.
Fabric Watch Threshold Components TABLE 1 Fabric Watch classes (Continued) Class Description Performance Monitor Serves as a tuning tool. The Performance Monitor class groups areas that track the source and destination of traffic. Use the Performance Monitor class thresholds and notifications to determine traffic load and flow and to reallocate resources appropriately. The Performance Monitor class is divided into the following areas: EE (end-to-end) Performance Monitor, and Filter Performance Monitor.
Elements 34 Fabric Watch Administrators Guide 53-1003142-01
Fabric Watch Activation ● Interfaces for activating Fabric Watch.............................................................................35 ● Activating Fabric Watch using a Telnet session..............................................................35 ● Activating Fabric Watch using SNMP............................................................................. 36 ● Activating Fabric Watch using Web Tools.......................................................................
Activating Fabric Watch using SNMP 4. Enter the license key with the licenseAdd key command, where key is the Fabric Watch license key. License keys are case-sensitive, so type the license key exactly as it appears. switch:admin> licenseadd "R9cQ9RcbddUAdRAX" 5. Enter the licenseShow command to verify successful activation. If the license is not listed, verify that you typed the key correctly; if you did not, then repeat step 4.
Fabric Watch Activation In the above figure, the MIB browser populated the left side of the window with a MIB tree that you can navigate. 4. Open Web Tools and select Tasks > Manage > Switch Admin. 5. Click Show Advanced Mode. 6. On the SNMP tab, enter the IP address of the trap receiver and the severity level, and click Apply. NOTE The severity level must be Informational (4) in order to forward threshold alerts. 7.
Activating Fabric Watch using Web Tools 9. Enter the IP address for the switch in the Host field in the MIB browser. Enter the community string in the Community field. To perform set operations, enter the write community in the Write Community field. 10.View and listen for trap details from a MIB browser menu. NOTE Any changes related to Fabric Watch, such as changing the status of the temperature sensor, will generate traps. 11.Expand the tree on the left to find the Fabric Watch OID information.
Fabric Watch Activation 1. Open the Web browser and type the IP address of the device in the Address field. For example, http://10.77.77.77 or https://10.77.77.77 2. Press Enter. A browser window opens to open Web Tools. A login dialog box opens. 3. Enter your user name and password. 4. Select a switch from the Fabric Tree and log in if necessary. 5. Select Tasks > Manage > Fabric Watch.
Activating Fabric Watch using Web Tools 40 Fabric Watch Administrators Guide 53-1003142-01
Fabric Watch Configuration ● Fabric Watch configuration tasks.................................................................................... 41 ● Setting Fabric Watch custom and default values............................................................ 43 ● E-mail notification configuration...................................................................................... 43 ● Notification configuration.................................................................................................
Fabric Watch Configuration TABLE 2 Fabric Watch configuration tasks (Continued) Configuration task Command Location of procedure Set the following parameters for port monitoring: portThConfig Port Monitoring on page 65 • • • • • • • • Port type Area type Time base Threshold level Trigger (boundary level) Action (notification type) Buffer Port fencing Set the port persistence time. fwSet --port -persistence Setting the port persistence time on page 79 Configure port fencing.
Setting Fabric Watch custom and default values Setting Fabric Watch custom and default values Use the following commands to switch between custom and default values. These commands reset all thresholds for all classes: • fwSetToCustom — Sets the boundary and alarm level to custom. • fwSetToDefault — Restores the boundary and alarm level to the default. E-mail notification configuration In environments where it is critical that you are notified about errors quickly, you can use e-mail notifications.
Disabling an e-mail alert Fabric Watch displays e-mail alert information such as: Mail Recipient Information ____________________________________ Email Alert = enabled Mail Recipient = sysadmin@mycompany.com The system returns to the main fwMailCfg menu. Disabling an e-mail alert 1. Enter 2 in the fwMailCfg menu to disable e-mail alerts for a specific class. 2. Select a class for which Fabric Watch should disable e-mail alerts from the Config Show menu.
Setting the recipient e-mail address for an e-mail alert Setting the recipient e-mail address for an e-mail alert 1. Enter 5 in the fwMailCfg menu to specify the recipient to whom Fabric Watch should send the e-mail alert for a class. The Config Show menu displays. 2. Select a class. The following prompt displays: Mail To: [NONE] 3. Enter the e-mail address of the person responsible for the specific class of alerts.
Notification configuration Notification configuration Notifications act as a signal or alert that notifies you when a threshold has been crossed. When you use alarm notifications, error messages are sent to designated locations such as an error log, SNMP trap view, or e-mail. With an error log, you can log in to a particular switch to view the error messages that have been captured for that particular switch. You can parse the log file to make error message searches quicker and easier.
Fabric, Security, SFP, and Performance Monitoring ● Fabric monitoring guidelines and default settings........................................................... 47 ● Security monitoring guidelines and default settings........................................................ 50 ● SFP monitoring guidelines and default settings.............................................................. 52 ● Performance monitoring guidelines and default settings................................................
Fabric monitoring setting guidelines TABLE 3 Fabric class areas (Continued) Area Description Segmentation changes (SC) Tracks the cumulative number of segmentation changes. Segmentation changes occur because of one of the following: • • Zone conflicts. Incompatible link parameters. During E_Port and VE_Port initialization, ports exchange link parameters, and incompatible parameters result in segmentation. This is a rare event. • Domain conflicts.
Fabric, Security, SFP, and Performance Monitoring TABLE 4 Fabric class default settings Area Description Default threshold settings Default alarm settings Threshold state Domain ID changes Monitors forcible DOMAIN ID changes Unit: D_ID Changes Below: 0 Informative Time Base: none Above: 0 Informative Unit: Downs Below: 0 Informative Time Base: none Above: 0 Informative Unit: Logins Below: 0 Informative Time Base: none Above: 0 Informative Unit: Reconfigs Below: 0 Informative Time
Security monitoring guidelines and default settings Security monitoring guidelines and default settings The Security class monitors all attempts to breach your SAN security, helping you fine-tune your security measures. Security class areas The table below lists Fabric Watch areas in the Security class and describes what each area indicates. Although it is recommended that you leave the entire Security class in its default state (no alerts), you can configure the Security class using the thConfig command.
Fabric, Security, SFP, and Performance Monitoring TABLE 6 Security class area default settings Area Description Default threshold settings Default alarm settings Threshold state DCC violations (DV) Monitors DCC violations Unit: Violations Below: 0 Informative Time Base: minute Above: 3 Out_of_range Unit: Violations Below: 0 Informative Time Base: minute Above: 3 Out_of_range Unit: Violations Below: 0 Informative Time Base: minute Above: 3 Out_of_range Below: 0 Informative Above: 3
SFP monitoring guidelines and default settings TABLE 6 Security class area default settings (Continued) Area Description Default threshold settings Default alarm settings Threshold state No-FCS (NF) Monitors No FCS violations Unit: Violations Below: 0 Informative Time Base: minute Above: 3 Out_of_range Unit: Violations Below: 0 Informative Time Base: minute Above: 3 Out_of_range Unit: Violations Below: 0 Informative Time Base: minute Above: 3 Out_of_range Unit: Violations Below: 0
SFP class areas SFP class areas The table below lists Fabric Watch areas in the SFP class and describes each area. Although it is recommended that you leave the entire SFP class in its default state (no alerts), you can configure the SFP class using the thConfig command. NOTE SFPs connected to GbE ports are not monitored. TABLE 7 SFP class areas Area Description Temperature Measures the physical temperature of the SFP, in degrees Celsius.
Performance monitoring guidelines and default settings TABLE 8 SFP class default settings (Continued) Area Description Default threshold settings Default alarm settings Threshold state Receive power (RXP) Monitors receive power Unit: µWatts Below: 1 Out_of_range Time Base: none Above: 1 Out_of_range Unit: µW Below: 1 Out_of_range Time Base: none Above: 1 Out_of_range Unit: mAmps Below: 1 Out_of_range Time Base: none Above: 1 Out_of_range Below: 1 Out_of_range Above: 1 Out_of_rang
Performance Monitor class areas Performance Monitor class areas The table below lists Fabric Watch areas in the Performance Monitor class and describes each area. Although it is recommended that you leave the entire Performance Monitor class in its default state (no alerts), you can configure the Performance class using the thConfig command.
thConfig command TABLE 11 End-to-End Performance Monitor class default settings Area Description Default threshold settings Default alarm settings Threshold state End-to-end receive performance (RX performance) Monitors the receiving traffic between a SID_DID pair in a port Unit: KBps Below: 0 Informative Time Base: none Above: 0 Informative Below: 0 Informative Above: 0 Informative Low: 0 High: 0 Buffer: 0 End-to-end transmit performance (TX performance) Monitors the transmit Unit: KBps
thConfig command examples TABLE 12 Configuration options for thConfig command (Continued) Class name Valid area types Threshold Threshold action Configuration recommendation Security TV - Telnet violations Default or Custom Default or Custom Use the Security class default settings for areas and alarm configuration. There is no reason to alter the default settings. Default or Custom Default or Custom Use the SFP default settings. The traits are SFP-specific and there is no reason to alter them.
Pausing and continuing monitoring NOTE Both the APM license and the Fabric Watch license must be installed on the platform configured in AG mode to use the frame monitoring and EE monitoring capabilities. The APM license provides the counters and the Fabric Watch license provides the monitoring and alert mechanisms for these counters. To set the high threshold of the RX area, enter the thConfig command using the following parameters.
Voltage and temperature monitoring using the QSFP Fabric Watch also monitors the Brocade Quad SFP (QSFP) and, as for 16 Gbps SFPs, if configured thresholds are crossed, Fabric Watch generates an SNMP alarm, a RASLog message, and an e-mail alert for the following SFP areas: • • • • Current Voltage Temperature RXP NOTE On core blades, only 16 Gbps QSFPs can be installed.
Specifying the 16 Gbps SFP type Specifying the 16 Gbps SFP type You can use the --sfpType operand as part of the thconfig command to manage the actions and thresholds for the Current, Voltage, RXP, TXP, and Temperature areas of the 16 Gbps SFPs as shown in the following example.
Displaying the number of 16 Gbps SFP operational hours TABLE 13 16 Gbps and QSFP configurable SFP types (Continued) SFP Type 10GLWL Others Serial Number KD N/A Area Default High Default Low TXP (μW) 1999 125 Current (mA) 10 3 Temperature (°C) 90 -5 Voltage (mV) 3600 2970 RXP (μW) 2230 14 TXP (μW) 2230 60 Current (mA) 95 10 Temperature (°C) 85 -10 Voltage (mV) 3630 2970 RXP (μW) 5000 0 TXP (μW) 5000 0 Current (mA) 50 0 Refer to the Fabric OS Command Reference for
Recommended settings for Fabric, SFP, Performance, and Security classes Recommended settings for Fabric, SFP, Performance, and Security classes The table below lists the recommended settings for the Fabric, SFP, Security, and Performance classes discussed in this document. For all of these classes, it is recommended that you use the default settings.
Fabric, Security, SFP, and Performance Monitoring TABLE 14 Recommended Environment and Resource class settings (Continued) E=Error_Log, S=SNMP_Trap, P=Port_LOG_LOCK, M=EMAIL_ALERT, PF=Port Fence HTTP violations X Violations Minute 1 2 0 X E,S SCC violations X Violations Minute 1 2 0 X E,S DCC violations X Violations Minute 1 2 0 X E,S Login violations X Violations Minute 1 2 0 X E,S SLAP failures X Violations Minute 1 2 0 X E,S TS out-of-sync X Violations Min
Recommended settings for Fabric, SFP, Performance, and Security classes 64 Fabric Watch Administrators Guide 53-1003142-01
Port Monitoring ● Port class areas.............................................................................................................. 65 ● Port class guidelines and default settings....................................................................... 66 ● Port configuration............................................................................................................ 69 ● portThConfig command procedures................................................................................
Port class guidelines and default settings TABLE 15 Port class areas (Continued) Area Description Signal loss The number of times that a signal loss occurs in a port. Signal loss indicates that no data is moving through the port. A loss of signal usually indicates a hardware problem. Sync loss The number of times a synchronization error occurs on the port. Two devices failed to communicate at the same speed. Synchronization errors are always accompanied by a link failure.
Physical port setting guidelines Physical port setting guidelines It is recommended that you use the default settings listed in Port class default settings on page 67 for most Port class areas. Consider the Port class to be a superset containing the E_Port, FOP_Port, and FCU_Port subclasses. If you make a change to a default setting for an area in the Port class, it applies to all of the subclasses.
Port Monitoring TABLE 16 Port class default settings (Continued) Area Description Default threshold settings Default alarm settings Threshold state State changes (ST) Monitors state changes.
Port configuration Port configuration Use the portThConfig command to configure thresholds for Fabric Watch event monitoring for all ports of a specified type and to display the configuration and current port status in real time. The command syntax is detailed in the Fabric OS Command Reference. Before you configure thresholds, you must first identify and select the appropriate class and areas, which are described in Port class areas on page 65.
Port type: physical port Port type: physical port The Port class is a superset containing the E_Port, FOP_Port, and FCU_Port subclasses. In general, use the default settings listed in Port class default settings on page 67, or use the generic Port class to configure an area whose settings are common to one or more of the port subclasses; for example, configuring all physical ports to monitor invalid CRC counts. In most cases, the default settings are adequate for the physical port.
Pausing and continuing monitoring • Set the action to take when a trigger occurs. Here, for the low threshold, only log the event in the RASLog. For the high threshold, log the event in the RASLog and issue an SNMP trap. • Set the buffer to 0 (the default is 100). 2. Apply the new custom settings so they become effective. switch:admin> portthconfig --apply port -area itw -action cust -thresh_level custom 3.
E_Port class default settings frame drops if left unchecked; therefore, set the alarm to fence the port. Refer to Port type: E_Port, FOP_Port, or FCU_Port on page 79 for instructions. • Areas: Receive (Rx) and Transmit (Tx) Performance Rx and Tx Performance areas are used to monitor the bandwidth utilization of the interswitch links (ISLs) in the fabric. Set the high boundary to 75 percent and the alarms to Above and In-Between conditions.
Port Monitoring TABLE 17 E_Port class default settings (Continued) Area Description Default threshold settings Default alarm settings Threshold state Invalid transmission words (ITW) Monitors the number of invalid words transmitted.
FOP_Port and FCU_Port subclass setting guidelines TABLE 17 E_Port class default settings (Continued) Area Description Default threshold settings Default alarm settings Threshold state Received packets (RXP) Monitors the receive rate, by percentage.
FOP_Port and FCU_Port subclass default settings • Areas: Link Failure Count, Loss of Synchronization Count Change the default high boundary from 500 to 15 (per minute) for Link Failure Count and from 500 to 45 (per minute) for Loss of Synchronization Count. Leave the Buffer setting to 0 (the default). Set the alarm configurations to send alerts to both the error log and SNMP. These settings are the same for an HBA port or a storage port.
Port Monitoring • • • • • • Link Failure Count Loss of Synchronization Count Primitive Sequence Protocol Error Invalid Transmission Word Invalid CRC Count Class 3 Discards The only ports which fall into the FCU_Port subclass are internal copper-based ports on embedded switches. Copper-based ICL ports are not part of the FCU_Port subclass, and optical ICL ports are not part of the FOP_Port subclass.
Port Monitoring TABLE 18 FOP_Port and FCU_Port subclass default settings (Continued) Area Description Default threshold settings Default alarm settings Threshold state Sync loss Monitors the number of loss of synchronization errors.
VE_Port class default settings TABLE 18 FOP_Port and FCU_Port subclass default settings (Continued) Area Description Default threshold settings Default alarm settings Threshold state Link reset Monitors the number of link resets sent by a given port (LR-Out) and received on a given port (LR-In).
Port type: E_Port, FOP_Port, or FCU_Port usually found at less than one percent. Now there is support for configuring packet loss percentages in decimals. To set the high threshold for packet loss for a VE port, enter the portThConfig command using the following parameters: switch:admin> portthconfig --set ve-port -area PKTLOSS -highthreshold -value 0.
Port fencing configuration TABLE 20 Port fencing class and subclass areas Port type Areas supported for port fencing Physical ports Cyclic Redundancy Checks (CRC) Invalid Transmission Words (ITW) Link Reset (LR) Protocol Error (PE) State Change (ST) Class 3 Discard Frames (C3TXO) FOP_Ports Cyclic Redundancy Checks (CRC) Invalid Transmission Words (ITW) Link Reset (LR) Protocol Error (PE) State Change (ST) Class 3 Discard Frames (C3TXO) E_Ports Cyclic Redundancy Checks (CRC) EX_Ports Invalid Transm
Port fencing recommended area settings Port fencing recommended area settings Cyclic redundancy check (CRC) errors and invalid transmission words (ITW) can occur on normal links. They have also been known to occur during certain transitions such as server reboots. When these errors occur more frequently, they can cause a severe impact. While most systems can tolerate infrequent CRC errors or invalid words, other environments can be sensitive to even infrequent instances.
Disabling port fencing The following example configures port fencing on an FOP_Port for the Class 3 discard frame area. portFencing --enable fop-port --area C3TX_TO Disabling port fencing Use the portFencing --disable command to disable port fencing for the specified areas on all ports of the specified port types. You can use the portFencing --show command to display the configuration.
Port health and CRC monitoring Refer to the Brocade Network Advisor User Manual for detailed instructions on how to add, edit, view, and remove thresholds. Port health and CRC monitoring There are two types of CRC errors that can be logged on a switch; taken together they can assist in determining which link introduced the error into the fabric. The two types are plain CRCs, which have bad end-of-frame (EOF) markers and CRCs with good EOF (crc g_eof) markers.
Port Monitoring TABLE 22 Recommended configuration for the port class (Continued) E=Error_Log, S=SNMP_Trap, P=Port_LOG_LOCK, M=EMAIL_ALERT, F=Port Fence E_Port State Changes X Changes Minute 0 50 0 X Link Reset X Errors Minute 0 500 50 X C3 Discard X Errors Minute 0 5 0 X Link Loss X Errors Minute 0 0 0 X Sync Loss X Errors Minute 0 45 0 X E,S Signal Loss X Errors Minute 0 45 0 X E,S Errors Minute 0 5 0 Protocol Error X X Invalid Words X Errors
Port Monitoring TABLE 22 Recommended configuration for the port class (Continued) E=Error_Log, S=SNMP_Trap, P=Port_LOG_LOCK, M=EMAIL_ALERT, F=Port Fence RX Performance X Percentage Minute 0 85 0 X E TX Performance X Percentage Minute 0 85 0 X E State Changes X Changes Minute 0 5 0 X Link Reset X Errors Minute 0 500 50 X E C3 Discard X Errors Minute 0 5 0 X E Trunk Util X Percentage Minute 0 100 0 X FOP_Port and FCU_Port Link Loss X Errors Minute 0 15
Recommended port configuration settings 86 Fabric Watch Administrators Guide 53-1003142-01
System Monitoring ● Environment monitoring.................................................................................................. 87 ● Resource class settings.................................................................................................. 88 ● System monitoring using the sysMonitor command........................................................89 ● Recommended environment and resource monitoring settings......................................92 ● Switch monitoring................
Resource class settings NOTE Fabric Watch no longer supports fan monitoring. Event Manager (EM) now manages fan monitoring and the switch status is calculated based on the fan status reported by EM.
Resource class default settings TABLE 25 Resource class area Area Description Flash Monitors the compact flash space available by calculating the percentage of flash space consumed and comparing it with the configured high threshold value.
Using the nosave command System monitoring is disabled by default. You must run both the --config -mem and the --config -cpu commands to enable both memory and CPU system monitoring. Using the nosave command The nosave command prevents the configuration changes from being saved persistently. This option allows you to make and view changes without overwriting the saved configuration. CAUTION When you use --config with the --nosave option and the switch reboots, your changes are lost.
Resource class settings --config env -area temp -highth -value 99 -trigger above -action raslog 2. Apply the changes: switch:admin> sysmonitor --apply env -area temp -action_level cust -thresh_level cust Resource class settings The flash area of the Resource class monitors the percentage of compact flash memory used on the system.
Displaying the current memory usage threshold Displaying the current memory usage threshold To display the current memory usage threshold, enter the sysMonitor --show mem command.
Switch status policy planning Switch status policy planning Fabric Watch monitors the health of the switch under various classes. The table below lists the current overall switch status policy parameters in a switch and identifies the factors that affect their health. Note that not all switches use the listed monitors. Use the switchStatusPolicySet command to manually change the policy setting. Refer to the Fabric OS Command Reference for more information.
Brocade 6505 default policy NOTE The presence of four or more FS8-18 encryption blades in the DCX Data Center Backbone causes the Fabric Watch switch status policy for power supplies to assume a policy setting of 2,1. Brocade 6505 default policy The default Fabric Watch policy for the Brocade 6505 is one power supply in the left bay with an optional configuration of two power supplies in both the left and right bays.
FRU monitoring • Flash — Flash thresholds monitor flash memory. • Marginal Ports — Ports that move into the marginal state for reasons such as insufficient buffer credits. • Port Persistence Time — Fabric Watch waits for the port persistence time duration before it declares the port to be in the MARGINAL state when it crosses the high threshold. • Faulty Ports — Ports that are faulty because of hardware faults, such as invalid SFPs. • Missing SFPs — Monitors the number of ports without SFPs.
Specifying triggers for FRU alarms NOTE The Off state is applicable only to fans on some platforms, such as the Brocade DCX and Brocade DCX-4S. The Off state is not applicable to the power supply, slot, or WWN FRUs. 1. Establish a Telnet or PuTTY connection to the switch. 2. Log in using administrative privileges. 3. Enter the fwFruCfg command at the command prompt. The fwFruCfg command displays your current FRU configuration. The types of FRUs are different for the various platforms. 4.
Recommended FRU settings Recommended FRU settings TABLE 30 Recommended FRU settings E=ERROR_LOG, M=EMAIL_ALERT Class Area Absent FRU Slot X X Power Supply X X Fan X X WWN X X Fabric Watch Administrators Guide 53-1003142-01 Inserted On Off Faulty Below Above 97
Recommended FRU settings 98 Fabric Watch Administrators Guide 53-1003142-01
Fabric Watch Configuration Using Web Tools ● Using Web Tools to configure Fabric Watch...................................................................99 ● Fabric Watch threshold configuration using Web Tools................................................101 ● Configuring alarms for FRUs using Web Tools.............................................................103 ● Configuring alarm filters using Web Tools....................................................................
System monitoring using Web Tools FIGURE 7 Fabric Watch Window The Fabric Watch Explorer pane on the left side of the window displays the available classes. Not all classes are available for all switches. The status bar at the bottom of the window provides you with a summary of recent actions, and the date and time the module was last updated. System monitoring using Web Tools The Fabric Watch license must be installed to view and modify the System Monitor details.
Fabric Watch threshold configuration using Web Tools Fabric Watch threshold configuration using Web Tools The Threshold Configuration tab enables you to configure event conditions. From this tab, you configure threshold traits, alarms, and e-mail configurations. NOTE Use the procedures in this section to configure threshold traits for all classes except for the FRU class. Use the procedure described in Configuring alarms for FRUs using Web Tools on page 103 for the FRU class.
Configuring threshold alarms Configuring threshold alarms After you update the threshold information, use the Alarm Configuration subtab to customize the notification settings for each event setting.
Configuring alarms for FRUs using Web Tools 1. Open the Fabric Watch window. 2. In the Fabric Watch Explorer pane, select a class. You can set alarms for information on a switch only if that information is monitored by Fabric Watch for that switch; not all alarm options are available for all switches. 3. Select the Threshold Configuration tab. 4. Under Area Selection, select the area with the alarms that you want to enable or disable. 5. Select the Element Configuration subtab. 6.
Configuring alarm filters using Web Tools Configuring alarm filters using Web Tools The Fabric Watch window provides GUI support for the CLI command fwalarmsfilterset. This option is used to configure the alarm filtering for Fabric Watch. By disabling the alarms, all non-environment and non-resource class alarms are suppressed. By enabling the alarms, all class alarms are generated. To configure the alarm filter, perform the following steps: 1. Open the Fabric Watch window. 2.
E-mail notification using Web Tools NOTE For the FRU class, only the Name, State, and Time columns are displayed. In addition, if the FRU area is Fan, the Name column refers to either a fan or a fan FRU, depending on the switch model.
Enabling the e-mail alert 5. In the Domain Name field, enter the domain name (between 4 and 32 characters). 6. Click Apply. Enabling the e-mail alert You can set a different e-mail alert configuration for each FRU class. For example, you can set one email notification for SFPs and another for E_Ports. Before configuring e-mail alert recipients, you must set up the e-mail notification recipient’s DNS server and domain name. Refer to Configuring the e-mail server on a switch on page 105. Fabric OS 7.3.
Fabric Watch Reports ● Fabric Watch reports ....................................................................................................107 ● Switch Availability Monitor report.................................................................................. 108 ● Switch Health report......................................................................................................108 ● Switch Status Policy report....................................................................................
Switch Availability Monitor report Switch Availability Monitor report The Switch Availability Monitor (SAM) report lets you see the uptime and downtime for each port. It also enables you to check if a particular port is failing more often than the others. NOTE SAM report details do not display the health status of GbE ports. Fabric Watch only monitors and reports the status for physical and virtual FC ports. You can run reporting commands in Fabric Watch to get instant access to switch information.
Generating a Switch Health report • Current health of each port, based on the currently configured policy settings. • High-level state of the switch, the power supplies, and temperature monitor. • All ports that are in an abnormal state and the current health state of each port. The switch health report is available even without Fabric Watch, but for licensed Fabric Watch users, the marginal and faulty ports are included in the report.
Port Detail report Down PowerSupplies 2 Temperatures 2 Fans 2 Flash 0 MarginalPorts 6.15% FaultyPorts 16.50% MissingSFPs 20.00% ErrorPorts 20.10% Number of Ports: 512 Marginal 1 1 1 1 2.25% 12.19% 10.89% 20.96% Port Detail report If the Switch Health report shows marginal throughput or decreased performance, use the Port Detail report to see statistics on each port. The Port Detail report is a Fabric Watch licensed product. You can also see port details by health.
Fabric Watch Reports 215 216 217 G HEALTHY 000:00 VE HEALTHY 061:19 VE HEALTHY 061:19 (output truncated) - - - - - - - - - - - - - NOTE Output of the Port Detail report depends on the ports that belong to the current Admin Domain context. If a port does not belong to the current Admin Domain, nothing other than the port number is displayed for that port; for example: 000 ----------------Not a member of current Admin Domain-------------------.
Generating a Port Detail report 112 Fabric Watch Administrators Guide 53-1003142-01
Index A command dnsConfig 20 errShow 21 fmConfig 16 fwclassinit 35 fwFruCfg 15, 95 fwMailCfg 20 licenseAdd 35 portThConfig 17, 65, 69 above event triggers 26 access gateway mode, restrictions in Fabric Watch 22 action configuration guidelines 15 activating Fabric Watch using a Telnet session 35 using SNMP 36 using Web Tools 38 alarm behavior 29 alarm configuration report for Fabric Watch 104 alarm notification configuration 46 portThConfigcommand fwFruCfg 15 snmpConfig 20 switchStatusPolicySet 92 sysM
E E_Port default settings 72 E_Port setting guidelines 71 e-mail, testing a message 44 e-mail alert how to disable 44 how to enable 44 setting recipient e-mail address 45 e-mail notification configuration 43 email notifications 105 enabling Fabric Watch threshold alarms 102 environment class area 87 default settings 87 monitoring 87 recommended settings 92 environment class areas 87 environment monitoring guidelines 87 event settings 26 event triggers above 26 below 27 F fabric class areas 47 default
I port class interface types 35 invalid CRC area, configuring 70 IP address, setting for notification 45 L areas 65 default settings 67 guidelines and default settings 66 port configuration invalid CRC area 70 recommended settings table 83 port fencing licenseAdd key command 35 locked port log notification type 21 M management information base (MIB) 19 memory configuration limits 91 configuring the usage threshold 92 MIBs, using remotely 19 monitoring customizing settings 15 fabric events 16 fabric
recommended settings 62 settings, customizing 14 setting time base 27 SFP support for 10 Gbps and 16 Gbps 17 SFP class monitoring guidelines 52 SFP monitoring recommended settings 62 V VE_Port class default settings 78 W Web Tools, using to activate Fabric Watch 38 SNMP components of 19 using to activate Fabric Watch 36 switch monitoring components 16 switch policies 18 switch status policy implementing 94 viewing 94 switch status policy configuration 41 switch status policy planning 92 switch tempe