HP Tru64 UNIX and TruCluster Server Version 5.1.B-4 Patch Summary and Release Notes (13156)

Addresses a problem on LAN clusters related to improper keep-alive timeouts that can be
identified when the following console message is displayed during normal operations (that
is, no know failures and no nodes are rebooting):
— WARNING: ics_socket_event: error 60 on channel 0, assume node # is down
Fixes a problem that occurs when the interconnect is configured using NetRAIN,
cluster_rebuild_delay is set significantly below the default value, and members are rebooting
or failures are occurring on the active links. The console message seen when this occurs is
“CNX QDISK: Yielding to foreign owner with provisional quorum.”
Fixes a problem in which I/O barriers may be stalled when a drive becomes hung.
Prevents write failures from a cluster NFS client that may occur when a second user without
write access is concurrently reading the file.
Fixes a problem that occurs during reboots on heavily loaded cluster using the LAN
interconnect and generates the following messages:
— WARNING: ics_socket_event: error 54 on channel 0
— WARNING: ics_socket_event: error 60 on channel 0
Fixes kmf in drd_kgs_bid_stop_server_io_drained when a node leaves during a drd kgs
transaction.
Corrects a problem in which drd continually tries to perform a munsa unreject on the drive
when a device is deleted while it is in the munsa reject state.
Corrects a problem in which multiple path failures cause drd to return ENODEV even when
a server is available in the cluster.
Fixes several error handling in drd for device error conditions.
Fixes problem in which a device cannot be opened due to heavy load on the device.
Fixes a problem in which a CD-ROM is not mountable in a cluster.
Fixes loss of quorum disk.
Makes quorum disk parameters configurable.
Eliminates a window for kernel memory fault panics on AdvFS system calls that are
performed via function shipping using the clu_msfs_syscall_fship routine.
Improves drd tracing.
Fixes a Sierra Cluster KCH set free race condition.
Fixes two errors in clu_upgrade that prevents completing the setup stage.
Prevents a get_cs_toks() KMF/assert crash.
Fixes a rm_audit_sync_block panic that occurs when using a long fiber as the Memory
Channel interconnect.
Fixes a timing window in the Internode Communications Subsystem ddr device error
handling.
Fixes the rm_audit_sync_block panic when using a long fiber with VHUB as the Memory
Channel interconnect.
Fixes clu_bdmgr to facilitate CLSM sliced disks for cluster_root domain.
Modifies the manner of checking for user file limits for CFS remote DIO writes.
Ensures that signals for EFBIG writes are properly generated on a client.
Ensures the correct processing of CFS in future releases.
Fixes a multiple free problem of 32-byte memory bucket caused by multiple callbacks from
KCH to CLUA.
Fixes an incorrect if statement, which although a low- risk problem, could block access to a
disk device.
Corrects a confusing error message.
Fixes a problem seen in a LAN cluster when the CPUs on a member system are not installed
contiguously in the lower order slots.
Allows the quorum disk to be used in spite of transient errors with the quorum disk hardware.
4.2 Summary of TruCluster Server Software Patches 165