This patch enables the scheduler to pick up jobs that were left running after it crashes, and see them to completion.

-autoserv writes .autoserv_execute file with pid + exit status
-PidfileRunMonitor class in monitor_db uses this file to track running autoserv processes, double-checks with /proc filesystem
-monitor_db default recovery changed to
  -pick up running QueueEntry processes
  -kill other autoserv processes
  -requeue verifying queue entries
  -reverify hosts that were left in verify/repair
-monitor_db optional host recovery flag verifies Repair Failed hosts, and looks for running hosts with no corresponding active queue entries

The --no-recover scheduler option has been removed.  The scheduler always recovers running jobs when starting up.  There is a new --recover-hosts options, which forces the scheduler to reverify all dead hosts.  Shutdown mode has also been removed from the scheduler, since the scheduler may now be upgraded by simply killing and restarting it.

Signed-off-by: Steve Howard <showard@google.com>



git-svn-id: http://test.kernel.org/svn/autotest/trunk@1332 592f7852-d20e-0410-864c-8624ca9c26a4
3 files changed