Update nhc.conf to include correct gpfs5 and gpfs4 node ranges post tic2dcb move
The nhc.conf needs to be updated to use the new ranges for gpfs4 and gpfs5 clients so that the mount tests succeed.
The following gpfs5 are in drain because the range is incorrect. This triggering a drain state and is impacting node availability in particular A100 nodes.
[root@cheaha-master01 user]# scontrol show nodes=c[0205,0220,0252-253,0255] | egrep 'NodeName| State=|Reason'
NodeName=c0205 Arch=x86_64 CoresPerSocket=64
State=IDLE ThreadsPerCore=1 TmpDisk=1077994 Weight=1 Owner=N/A MCS_label=N/A
NodeName=c0220 Arch=x86_64 CoresPerSocket=64
State=IDLE+DRAIN ThreadsPerCore=1 TmpDisk=1077994 Weight=1 Owner=N/A MCS_label=N/A
Reason=GPFS5ADD: c0220 - Add to GPFS5 post TIC to DCB move [root@2025-06-26T11:42:46]
NodeName=c0252 Arch=x86_64 CoresPerSocket=64
State=RESERVED+DRAIN ThreadsPerCore=1 TmpDisk=6102416 Weight=1 Owner=N/A MCS_label=N/A
Reason=NHC: check_hw_ib: No IB port is ACTIVE (LinkUp 100 Gb/sec). [root@2025-07-10T18:22:52]
NodeName=c0253 Arch=x86_64 CoresPerSocket=64
State=RESERVED+DRAIN ThreadsPerCore=1 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
Reason=NHC: check_hw_ib: No IB port is ACTIVE (LinkUp 100 Gb/sec). [root@2025-07-10T18:22:52]
NodeName=c0255 Arch=x86_64 CoresPerSocket=64
State=RESERVED+DRAIN ThreadsPerCore=1 TmpDisk=6102416 Weight=1 Owner=N/A MCS_label=N/A
Reason=NHC: check_fs_mount: /data not mounted [root@2025-07-10T18:22:53]
The updated gpfs5 range is c0[194-223,252-253,255].