19963231a3245358c0e8fdd74c4654761e62b6c8 - platform/external/mesa3d

commit	19963231a3245358c0e8fdd74c4654761e62b6c8	[log] [tgz]
author	Ilia Mirkin <imirkin@alum.mit.edu>	Fri Jan 13 00:41:11 2017 -0500
committer	Ilia Mirkin <imirkin@alum.mit.edu>	Mon Jan 16 21:13:09 2017 -0500
tree	34a60bbd334aff116e3062d11d54e83404fff109
parent	5ba380c226b127cbfad00dd647471e1518ba2cb2 [diff]

nv50/ir: optimize shl + and

Address loading can often end up as shl + shr + shl combinations. The
latter two are equal shifts, which get converted into an and mask.
However if the previous shl is more than the mask is trying to remove
(in terms of low bits), we can just remove the and entirely. This
reduces some large shaders by as many as 3% of instructions (out of 2K).

total instructions in shared programs : 6495509 -> 6491076 (-0.07%)
total gprs used in shared programs    : 954621 -> 954623 (0.00%)

                local        gpr       inst      bytes
    helped           0           0        1014        1014
      hurt           0           2           0           0

Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>

src/gallium/drivers/nouveau/codegen/nv50_ir_peephole.cpp[diff]

1 file changed

tree: 34a60bbd334aff116e3062d11d54e83404fff109