Optimize x86_64 TLAB allocation speed

Added assembly fast path code for resolved and initialized object
region TLAB allocations. Removed 3 instructions from TLAB fast path.

Added assembly fast path for array TLAB region allocators.

Should have more speedups for read barrier fast paths during
resolved and initialized alloc entrypoints.

Bug: 30162165

Test: test-art-host CC baker

Change-Id: I64dd06be5f18c8d6a5de0f15f0e2e7d488e99f18
1 file changed