This morning at about 06:00 our Site BDII stopped working. This was
traced to the slapd BDII process which had died.
[root@srvslngrd002 ~]# /etc/init.d/bdii status
BDII slapd PID file exists but the process died [FAILED]
This has since been restarted and our site is advertising again. This
has been an ongoing issue (failing once every 3 or so months) for about
a year now.
The Site BDII advertises services and availability of a Grid Site's cluster and software. When the process is not running no jobs will be sent to the site, even if the site is still functional.