Fix implementation of vload with [u]long3/4 and double3/4.

Bug: 18380209

These implementations were accidentally attempting to return <3 x i64> or
<4 x i64>, or the double equivalents. The ABI requires that this be converted
into a stack return instead, so we transform our hand-written bitcode to do
exactly that.

Change-Id: I2be489b23bf639b16d8762a11a8430f40ea5b16c
(cherry picked from commit b9675775b030b187b8528cba2d8e0e5c0a7bf8f7)
1 file changed