9de4c2c4d2f08e5e30b6d1c913ad64cb8da56b0b - platform/external/libchrome

commit	9de4c2c4d2f08e5e30b6d1c913ad64cb8da56b0b	[log] [tgz]
author	skyostil@chromium.org <skyostil@chromium.org@0039d316-1c4b-4281-b951-d872f2087c98>	Fri May 30 04:02:20 2014 +0900
committer	Qijiang Fan <fqj@google.com>	Thu Jun 04 16:47:38 2020 +0900
tree	3ab63a1d35e6a007bba24483b24d9df821859d1b
parent	5dca6d0e40dc47e36bd146ae9e3e5f7d5e09de2d [diff]

Fix silent failures in test runner that lead to long timeouts Previously the test runner was using pexpect with a timeout to run the test script. The problem with this is that it can cause the test runner to hang if the child test script exits while still having live child processes of its own. The exact sequence of operations leading to the failure is: 1. TestRunner calls pexpect.run('test_script', timeout, ...). 2. The test script spawns at least one sub process (e.g., the host forwarder), which is attached to the same stdout as the test script itself. 3. Pexpect calls select on the test script's stdout file descriptor to receive output from the script. This is a blocking call. 4. The test script dies for some reason (e.g., disconnected device), failing to clean up all the sub processes it created in step 2. 5. Because the sub processes still have a reference to the (now-dead) test script's stdout, the select call in step #3 is not unblocked, so it hangs until the timeout fires (1.5h by default). The root cause of the problem is that the test script's sub processes outlive their parent (which is standard Unix behavior). In order to fix this we would have to change all the known daemons spawned by every test script to be proper daemons that release the controlling terminal and make the test script kill all of its subprocesses when exiting (i.e., prctl(PR_SET_PDEATHSIG, TERM)). Because the above fix involves a large number of changes and is a little fragile in the sense that we may miss updating some scripts by mistake, this patch takes a defensive approach. Namely, we fix the problem by replacing the use of pexpect with a custom loop which combines the call to select() as well as polling for the exit status of the test script. The call to select uses a smaller timeout (1s), so we can detect a dying test script almost immediately. This patch also simplifies some existing code in cmd_helper. BUG=355952 TEST=src/build/android/test_runner.py perf -v --release --single-step \ --src/tools/perf/run_benchmark -v --output-format=buildbot \ --report-root-metrics --browser=android-chromium-testshell \ rasterize_and_record_micro.key_mobile_sites Review URL: https://codereview.chromium.org/300063017 git-svn-id: svn://svn.chromium.org/chrome/trunk/src@273530 0039d316-1c4b-4281-b951-d872f2087c98 CrOS-Libchrome-Original-Commit: ea32361631c5fcb246f2df0b16855552706790aa