HP XC System Software Release Notes Version 3.
© Copyright 2007, 2008 Hewlett-Packard Development Company, L.P. Confidential computer software. Valid license from HP required for possession, use or copying. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license. The information contained herein is subject to change without notice.
Table of Contents About This Document.........................................................................................................7 Intended Audience.................................................................................................................................7 Typographic Conventions......................................................................................................................7 HP XC and Related HP Products Information....................................
5.6.1 HP ProLiant DL140 G3 and DL145 G3 Node Imaging Fails When Graphics Cards Are Present.............................................................................................................................................27 6 Software Upgrades......................................................................................................29 6.1 Do Not Upgrade If You Want Or Require The Voltaire InfiniBand Software Stack.......................
15.2.3 New preupgradesys-lxc.8......................................................................................................47 15.2.4 New upgradesys-lxc.8...........................................................................................................48 Index.................................................................................................................................
About This Document This document contains release notes for HP XC System Software Version 3.2. This document contains important information about firmware, software, or hardware that might affect the system. An HP XC system is integrated with several open source software components. Some open source software components are being used for underlying technology, and their deployment is transparent.
Variable [] {} ... | WARNING CAUTION IMPORTANT NOTE The name of a placeholder in a command, function, or other syntax display that you replace with an actual value. The contents are optional in syntax. If the contents are a list separated by |, you can choose one of the items. The contents are required in syntax. If the contents are a list separated by |, you must choose one of the items. The preceding element can be repeated an arbitrary number of times. Separates items in a list of choices.
HP XC Program Development Environment The Program Development Environment home page provide pointers to tools that have been tested in the HP XC program development environment (for example, TotalView® and other debuggers, compilers, and so on). http://h20311.www2.hp.com/HPC/cache/276321-0-0-0-121.html HP Message Passing Interface HP Message Passing Interface (HP-MPI) is an implementation of the MPI standard that has been integrated in HP XC systems.
Standard LSF is also available as an alternative resource management system (instead of LSF-HPC with SLURM) for HP XC. This is the version of LSF that is widely discussed on the Platform Web site.
• http://linuxvirtualserver.org Home page for the Linux Virtual Server (LVS), the load balancer running on the Linux operating system that distributes login requests on the HP XC system. • http://www.macrovision.com Home page for Macrovision®, developer of the FLEXlm™ license management utility, which is used for HP XC license management. • http://sourceforge.
Compiler Web Sites • http://www.intel.com/software/products/compilers/index.htm Web site for Intel® compilers. • http://support.intel.com/support/performancetools/ Web site for general Intel software development information. • http://www.pgroup.com/ Home page for The Portland Group™, supplier of the PGI® compiler. Debugger Web Site http://www.etnus.com Home page for Etnus, Inc., maker of the TotalView® parallel debugger. Software RAID Web Sites • http://www.tldp.org/HOWTO/Software-RAID-HOWTO.
HP Encourages Your Comments HP encourages comments concerning this document. We are committed to providing documentation that meets your needs. Send any errors found, suggestions for improvement, or compliments to: feedback@fc.hp.com Include the document title, manufacturing part number, and any comment, error found, or suggestion for improvement you have concerning this document.
1 New and Changed Features This chapter describes the new and changed features delivered in HP XC System Software Version 3.2. 1.1 Base Distribution and Kernel The following table lists information about the base distribution and kernel for this release as compared to the last HP XC release. HP XC Version 3.2 HP XC Version 3.1 Enterprise Linux 4 Update 4 Enterprise Linux 4 Update 3 HP XC kernel version 2.6.9-42.9hp.XC HP XC kernel version 2.6.9-34.7hp.XC Based on Red Hat kernel version 2.6.9-42.0.8.
The following are the key features of SVA: • • • • • Capturing and managing visualization-specific cluster information Managing visualization resources and providing facilities for requesting and allocating resources for a job in a multi-user, multi-session environment Providing display surface configuration tools to allow easy configuration of multi-panel displays Providing launch tools, both generic and tailored to a specific application, that launch applications with appropriate environments and display
the login role on the head node yet keep the head node load to a minimum because login sessions are not being spawned. This configuration choice is documented in the HP XC System Software Installation Guide. 1.9 System Management and Monitoring Enhancements System management and monitoring utilities have been enhanced as follows: • A new resource monitoring tool, resmon, has been added. resmon is a job-centric resource monitoring Web page initially inspired by the open-source clumon product.
1.
2 Important Release Information This chapter contains information that is important to know for this release. 2.1 Firmware Versions The HP XC System Software is tested against specific minimum firmware versions. Follow the instructions in the accompanying hardware documentation to ensure that all hardware components are installed with the latest firmware version. The master firmware tables for this release are available at the following Web site: http://www.docs.hp.com/en/linuxhpc.
3 Hardware Preparation Hardware preparation tasks are documented in the HP XC Hardware Preparation Guide. This chapter contains information that was not included in that document at the time of publication. 3.1 Upgrading BMC Firmware On HP ProLiant DL140 G2 and DL145 G2 Nodes This note applies only if the hardware configuration contains HP ProLiant DL140 G2 or DL145 G2 nodes and you are upgrading an existing HP XC system from Version 2.1 or Version 3.0 to Version 3.2.
4 Software Installation On The Head Node This chapter contains notes that apply to the HP XC System Software Kickstart installation session. 4.1 Manual Installation Required For NC510F Driver The unm_nic driver is provided with the HP XC software distribution, however, it does not load correctly. If your system has a NC510F 10 GB Ethernet card, run the following commands to load the driver: # depmod -a # modprobe -v unm_nic Then, edit the /etc/modprobe.
5 System Discovery, Configuration, and Imaging This chapter contains information about configuring the system. Notes that describe additional configuration tasks are mandatory and have been organized chronologically. Perform these tasks in the sequence presented in this chapter. The HP XC system configuration procedure is documented in the HP XC System Software Installation Guide.
# modprobe tg3 # modprobe e1000 7. Follow the instructions in the HP XC System Software Installation Guide to complete the cluster configuration process (beginning with the cluster_prep command). 5.2 Notes That Apply To The Discover Process The notes in this section apply to the discover utility. 5.2.1 Discovery of HP ProLiant DL140 G3 and DL145 G3 Nodes Fails When Graphics Cards Are Present When an HP ProLiant DL140 G3 or DL145 G3 node contains a graphics card, the nodes often fail to PXE boot.
This message is displayed because the C52xcgraph configuration script is probing the InfiniBand switch to determine how many HCAs with an IP address are present. Because the HCAs have not yet been assigned an IP address, C52xcgraph does not find any HCAs with an IP address and prints the message. This message does not prevent the cluster_config utility from completing. To work around this issue, after the cluster is installed and configured, run /opt/hptc/hpcgraph/sbin/hpcgraph-setup with no options. 5.
6 Software Upgrades This chapter contains notes about upgrading the HP XC System Software from a previous release to this release. Installation release notes described in Chapter 4 (page 23) and system configuration release notes described in Chapter 5 (page 25) also apply when you upgrade the HP XC System Software from a previous release to this release. Therefore, when performing an upgrade, make sure you also read and follow the instructions in those chapters. 6.
7 System Administration, Management, and Monitoring This chapter contains notes about system administration, management, and monitoring. 7.1 Perform A Dry Run Before Using The si_updateclient Utility To Update Nodes The si_updateclient utility can leave nodes in an unbootable state in certain situations. You can still use si_updateclient to deploy image changes to nodes.
8 HP XC System Software On Red Hat Enterprise Linux The notes in this chapter apply when the HP XC System Software is installed on Red Hat Enterprise Linux. 8.1 Enabling 32–bit Applications To Compile and Run To compile and run 32-bit applications on a system running HP XC System Software on Red Hat Enterprise Linux 4 on HP Integrity platforms, use the following commands to install the glibc-2.3.4-2.25.i686.
9 Programming and User Environment This chapter contains information that applies to the programming and user environment. 9.1 MPI and OFED InfiniBand Stack Fork Restrictions With the introduction of the OFED InfiniBand stack in this release, MPI applications cannot call fork(), popen(), and system() between MPI_Init and MPI_Finalize. This is known to affect some applications like NWChem. 9.
10 Cluster Platform 3000 At the time of publication, no release notes are specific to Cluster Platform 3000 systems.
11 Cluster Platform 4000 At the time of publication, no release notes are specific to Cluster Platform 4000 systems.
12 Cluster Platform 6000 This chapter contains information that applies only to Cluster Platform 6000 systems. 12.1 Network Boot Operation and Imaging Failures on HP Integrity rx2600 Systems An underlying issue in the kernel is causing MAC addresses on HP Integrity rx2600 systems to be set to all zeros (for example, 00.00.00.00.00), which results in network boot and imaging failures. To work around this issue, enter the following commands on the head node to network boot and image an rx2600 system: 1.
13 Integrated Lights Out Console Management Devices This chapter contains information that applies to the integrated lights out (iLO and iLO2) console management device. 13.1 iLO2 Devices In Server Blades Can Hang There is a known problem with the iLO2 console management devices that causes the iLO2 devices to hang. This particular problem has very specific characteristics: • • • This problem is typically seen within one or two days of the initial cluster installation.
14 Interconnects This chapter contains information that applies to the supported interconnect types: • InfiniBand Interconnect (page 45) • Myrinet Interconnect (page 45) • QsNetII Interconnect (page 45) 14.1 InfiniBand Interconnect The notes in this section apply to the InfiniBand interconnect. 14.1.1 enable Password Problem With Voltaire Switch Version 4.
14.3.1 Possible Conflict With Use of SIGUSR2 The Quadrics QsNetII software internally uses SIGUSR2 to manage the interconnect. This can conflict with any user applications that use SIGUSR2, including for debugger use. To work around this conflict, set the environment variable LIBELAN4_TRAPSIG for the application to a different signal number other than the default value 12 that corresponds to SIGUSR2.
15 Documentation This chapter describes known issues with the HP XC documentation. 15.1 Documentation CD Search Option If you are viewing the main page of the HP XC Documentation CD, you cannot perform a literature search from the Search: option box at the top of the page. To search http://www.docs.hp.com or to search all of HP's global Web service, click on the link for More options. The Advanced search options page is displayed, and you can perform the search from the advanced page. 15.
preupgradesys-lxc(8) NAME preupgradesys-lxc - Prepares a system for an XC software upgrade SYNOPSIS Path: /opt/hptc/lxc-upgrade/sbin/preupgradesys-lxc DESCRIPTION Running the preupgradesys-lxc command is one of several commands that are part of the process to upgrade HP XC System Software on Red Hat Enterprise Linux to the next release of HP XC System Software on Red Hat Enterprise Linux The software upgrade process is documented in the HP XC System Software Installation Guide.
The upgradesys-lxc utility is run immediately after the head node is upgraded with the new XC release software and any other required third-party software products. The upgradesys-lxc utility performs the following tasks to upgrade your system: o Makes a backup copy of the database from the previous release. o Modifies attributes in the database to signify that the system has been upgraded. o Removes RPMs from the previous release that are no longer supported in the new release.
Index B H base operating system, 15 hardware preparation tasks, 21 hardware support, 15 HowTo, 18 Web site, 8 HP documentation providing feedback for, 13 HP Scalable Visualization Array (see SVA) HP-MPI fork restrictions with kernel version, 35 fork restrictions with OFED, 35 init failed, 35 multiple rail support, 35 C C52xcgraph error, 26 clear_counters command, 45 client node disk partition, 16 cluster_config utility, 26 C52xcgraph error message, 26 new features, 16 CP3000 system, 37 CP4000 system, 39
OVP enhancements, 17 P partition size limit, 16 patches, 19 Q qsnet diagnostics database, 46 QsNet interconnect, 45 R reporting documentation errors feedback e-mail address for, 13 resmon utility, 17 S si_updateclient utility, 31 signal Quadrics QsNet, 46 software RAID documentation, 12 mdadm utility, 12 SVA, 15 system administration notes, 31 system configuration, 25 system management enhancements, 17 notes, 31 system monitoring, 17 T temperature graph, 17 U unified parallel C, 17 UPC, 17 upgrade, 29