1. 20f4706 -check ACLs directly in the scheduler (bypassing ineligible_host_queues) by showard · 16 years ago
  2. b751751 Add __init__.py file. by showard · 16 years ago
  3. 04c82c5 Rewrite scheduling algorithm to use two queries + some data processing, rather than a separate query for each "idle" host. This should be considerably faster. It also gives us the opportunity to eliminate the whole ACL checking with ineligible_host_queues thing, which has been a nightmare. But one step at a time... by showard · 16 years ago
  4. ce38e0c The beginning of a unit test for the scheduler. Right now it only tests the job scheduling algorithm (i.e. Dispatcher._find_more_work() and the methods it uses). by showard · 16 years ago
  5. 30eed1f A bit of refactoring to monitor_db.py to clean up some code and make it more testable. by showard · 16 years ago
  6. 93ff7ea Rename monitor_db to monitor_db.py. This makes it import-able, which is necessary for unit testing. by showard · 16 years ago
  7. 5492748 Add distinct to query to cut time spent in half by showard · 16 years ago
  8. 57881ee It occurred to me that because of the change to batch up emails, if an exception occurs that kills the scheduler, it wouldn't send out the email. Fixed that. by showard · 16 years ago
  9. c2ac77f Risk: Medium by jadmanski · 16 years ago
  10. 7cf9a9b Batch up notification emails within a single tick, and send em out all together. by showard · 16 years ago
  11. ec11316 -make scheduler monitor number of running tasks and keep it limited to some maximum, set in global config by showard · 16 years ago
  12. a093972 Every time we modify ACLs we have to recompute ineligible host queues. We can't do that by deleting the old ones and then writing the new ones, since there would be a moment when the hosts are unprotected. So instead we write the new ones and then delete the old ones, which leaves a moment when there might be duplicate ineligible_host_queues. This is harmless, but the scheduler was asserting that there were never duplicates (just for safety I guess, since that used to be true), so I removed the assertion and made the code handle duplicates. by showard · 16 years ago
  13. e44a46d notify_email is a global config parameter which monitor_db reads out. by mbligh · 16 years ago
  14. 62ba2ed -include acl-inaccessible hosts in ineligible_host_queues blocks. by mbligh · 16 years ago
  15. 5244cbb Never delete hosts or labels. Instead, mark them as invalid. by mbligh · 16 years ago
  16. 6437ff5 Use the new parser library directly inside of autoserv, instead of by mbligh · 16 years ago
  17. cadb353 Fix bug in logging to host logs. by mbligh · 16 years ago
  18. d64e570 Tested by scheduling jobs against machines that had Repair Failed status and aborting jobs to cause RebootTask to be called and observing that tmp directories get created and deleted, and no directories are created in root autotest directory. by mbligh · 16 years ago
  19. 1b87bc5 Modify all the common.py to set up an autotest_lib.* namespace as well by mbligh · 16 years ago
  20. 90a549d -when no pidfile is found by PidfileRunMonitor, just wait, and after a timeout, send email and act as if process failed by mbligh · 16 years ago
  21. b03ba64 Patch to reduce the rate of reparse, and to make the parser locking by mbligh · 16 years ago
  22. 4eb2df2 Specify a boolean string converter for MySQLdb. Some older versions of MySQLdb do not include this, and it breaks monitor_db. by mbligh · 16 years ago
  23. bb42185 This patch enables the scheduler to pick up jobs that were left running after it crashes, and see them to completion. by mbligh · 16 years ago
  24. 104e9ce This fixes some issues with global_config relating to the fact by mbligh · 16 years ago
  25. 38c2d03 Changed sighandler in autoserv to call SIGKILL on its children instead of SIGTERM. by mbligh · 16 years ago
  26. dbdac6c Continuously reparse the status logs whenever new logs are written out by mbligh · 16 years ago
  27. 16c722d Remove ReverifyTask altogether, and trust the return code of autoserv repair (which does a reverify itself). The flowchart on the wiki is updated. by mbligh · 16 years ago
  28. d5c9580 Implemented abort functionality in scheduler. by mbligh · 16 years ago
  29. dffd637 scheduler release hosts by mbligh · 16 years ago
  30. 48c10a5 We don't want to pass -n to autoserv for host-specific tasks (verify, repair, re by mbligh · 16 years ago
  31. e258668 Verify repair fixes for scheduler by mbligh · 16 years ago
  32. 6f8bab4 Catch any errors due to mysql losing its connection. If it does lose by mbligh · 16 years ago
  33. 8ce2c4a On verify failure for a synch job, stop all other queue_entries, not just active ones. by mbligh · 16 years ago
  34. 4314a71 Testing mode support for the scheduler. In testing mode, the scheduler runs a dummy autoserv script and doesn't try to parse results. This is part of an ongoing project to create an automated scheduler test. by mbligh · 16 years ago
  35. b090f14 more on global config by mbligh · 16 years ago
  36. 36768f0 add missing monitor_db code by mbligh · 16 years ago
  37. c40fa92 Change ConmuxSSHHost to SSHHost by mbligh · 17 years ago
  38. 6203ace Split stdout and stderr for monitor_queue into separate files, by mbligh · 17 years ago
  39. b7ef301 Comment fix to monitor queue plus ignore non-dirs in queue dir by mbligh · 17 years ago
  40. af0b811 Make scheduler stuff executable by mbligh · 17 years ago
  41. 8dcb745 Various fixes and updates for monitor_queue by mbligh · 17 years ago
  42. a4649af kill existing monitor queues before restarting by mbligh · 17 years ago
  43. 88d3256 Queues now have a .machines file by mbligh · 17 years ago
  44. dcc0499 The new monitor_queue script works much like the old one but now by mbligh · 17 years ago