| commit | ef67d768691f7dd2c69f455f5d4479608eba792f | [log] [tgz] |
|---|---|---|
| author | Matt Arsenault <Matthew.Arsenault@amd.com> | Wed Sep 09 17:03:29 2015 +0000 |
| committer | Matt Arsenault <Matthew.Arsenault@amd.com> | Wed Sep 09 17:03:29 2015 +0000 |
| tree | 5e9f44b0964fc5a398ab7bf72fdb16ec2c9b2e9c | |
| parent | b5541fb09840ca8d5b3a22d46ff163695c95223e [diff] |
AMDGPU: Extract full 64-bit subregister and use subregs Instead of extracting both 32-bit components from the 128-bit register. This produces fewer copies and is easier for the copy peephole optimizer to understand and see the actual uses as extracts from a reg_sequence. This avoids needing to handle subregister composing in the PeepholeOptimizer's ValueTracker for this case. llvm-svn: 247162