Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: warewulf-nhc | Distribution: openSUSE Tumbleweed |
Version: 1.4.3 | Vendor: openSUSE |
Release: 1.4 | Build date: Thu Nov 16 20:22:58 2023 |
Group: Productivity/Clustering/Computing | Build host: reproducible |
Size: 227123 | Source RPM: warewulf-nhc-1.4.3-1.4.src.rpm |
Packager: http://bugs.opensuse.org | |
Url: http://warewulf.lbl.gov/trac | |
Summary: Warewulf Node Health Check (NHC) |
Warewulf Node Health Check (NHC) is a periodic "node health check" script to be executed on each compute node to verify that the node is working properly. Nodes which are determined to be "unhealthy" can be marked as down or offline so as to prevent jobs from being scheduled or run on them. This helps increase the reliability and throughput of a cluster by reducing preventable job failures due to misconfiguration, hardware failures, etc.
BSD-3-Clause
* Thu Nov 16 2023 Christian Goll <cgoll@suse.com> - updated to 1.4.3 with following new features: * toggle BASH tracing or NHC debugging via SIGUSR1/SIGUSR2, respectively * check_nvsmi_healthmon(): New check from CSC for GPU health monitoring via nvidia-smi * Provide added detail to tracing info (-x mode) * Based on feedback from Moe Jette of SchedMD, pull node job data directly from Slurm via squeue instead of the previous method that only worked for single-node jobs. * Support for recent additions to the Slurm node states (e.g., "planned") * Pathname expansion has been disabled on startup, and re-enabled only when being actively used, to avoid "unintended" expansions of wildcards at random points throughout the code. * Correct clobbering of BASH built-in variables and add tests to prevent future recurrence * Switch "system UID" boundary handling to a more accurate source of truth, and ensure that the code matches the math, naming, and intent. * Reorder resource manager detection to improve accurate detection, especially with respect to Slurm vs. PBS (all variants) - removed test-test_lbnl_file.nhc-Put-all-process-substitution.patch * Fri Mar 20 2020 Christian Goll <cgoll@suse.com> - updated to 1.4.2 with following new features: * Support for negating *any* match string anywhere * check_net_ping(): New check for monitoring of network connectivity * check_ps_*(): Process owner parameters now accept match strings * check_cmd_dmesg(): New check to validate/verify or catch/flag * check_fs_mount(): Create missing mount points as necessary * New command-line flag: "-e <check>" will override config file, - added patch to fix error during test phase * test-test_lbnl_file.nhc-Put-all-process-substitution.patch * Tue Feb 13 2018 cgoll@suse.com - version 1.4.1 * Too many changes. See ChangeLog file for details * Fri Nov 16 2012 scorot@free.fr - version 1.2.1 * Too many changes. See ChangeLog file for details * Sat Oct 20 2012 scorot@free.fr - disable noarch for SLE 11 * Sat Oct 20 2012 scorot@free.fr - fix typo in spec file * Wed Aug 15 2012 scorot@free.fr - package is noarch - fix Url * Sun Jun 10 2012 scorot@free.fr - first package - version 1.1.4
/etc/logrotate.d/warewulf-nhc /etc/nhc /etc/nhc/nhc.conf /etc/nhc/scripts /etc/nhc/scripts/common.nhc /etc/nhc/scripts/lbnl_cmd.nhc /etc/nhc/scripts/lbnl_dmi.nhc /etc/nhc/scripts/lbnl_file.nhc /etc/nhc/scripts/lbnl_fs.nhc /etc/nhc/scripts/lbnl_hw.nhc /etc/nhc/scripts/lbnl_job.nhc /etc/nhc/scripts/lbnl_moab.nhc /etc/nhc/scripts/lbnl_net.nhc /etc/nhc/scripts/lbnl_nv.nhc /etc/nhc/scripts/lbnl_ps.nhc /usr/lib/tmpfiles.d/warewulf-nhc.conf /usr/libexec/nhc /usr/libexec/nhc/node-mark-offline /usr/libexec/nhc/node-mark-online /usr/sbin/nhc /usr/sbin/nhc-genconf /usr/sbin/nhc-wrapper /usr/share/doc/packages/warewulf-nhc /usr/share/doc/packages/warewulf-nhc/nhc.conf /usr/share/doc/packages/warewulf-nhc/nhc.cron /usr/share/licenses/warewulf-nhc /usr/share/licenses/warewulf-nhc/COPYING /usr/share/licenses/warewulf-nhc/LICENSE /var/lib/nhc
Generated by rpm2html 1.8.1
Fabrice Bellet, Sun Feb 9 01:37:00 2025