[AMDGPU] Scalarization of global uniform loads. Summary: LC can currently select scalar load for uniform memory access basing on readonly memory address space only. This restriction originated from the fact that in HW prior to VI vector and scalar caches are not coherent. With MemoryDependenceAnalysis we can check that the memory location corresponding to the memory operand of the LOAD is not clobbered along the all paths from the function entry. Reviewers: rampitec, tstellarAMD, arsenm Subscribers: wdng, arsenm, nhaehnle Differential Revision: https://reviews.llvm.org/D26917 llvm-svn: 289076

commit: 18009560c59deceb2be577e0182f7016d6ee1121 [log] [tgz]
author: Alexander Timofeev <Alexander.Timofeev@amd.com> Thu Dec 08 17:28:47 2016 +0000
committer: Alexander Timofeev <Alexander.Timofeev@amd.com> Thu Dec 08 17:28:47 2016 +0000
tree: 05c51c8ad3b3de4f579ea884037cded642bec1b2
parent: eebed6229ad000a34959f4a9794b9e009eb0227d [diff] [blame]
diff --git a/llvm/lib/Target/AMDGPU/AMDGPUSubtarget.h b/llvm/lib/Target/AMDGPU/AMDGPUSubtarget.h
index 842711b0..939d137 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUSubtarget.h
+++ b/llvm/lib/Target/AMDGPU/AMDGPUSubtarget.h

@@ -114,6 +114,7 @@
   bool CFALUBug;
   bool HasVertexCache;
   short TexVTXClauseSize;
+  bool ScalarizeGlobal;
 
   // Dummy feature to use for assembler in tablegen.
   bool FeatureDisable;
@@ -401,6 +402,9 @@
     return alignTo(FlatWorkGroupSize, getWavefrontSize()) / getWavefrontSize();
   }
 
+  void setScalarizeGlobalBehavior(bool b) { ScalarizeGlobal = b;}
+  bool getScalarizeGlobalBehavior() const { return ScalarizeGlobal;}
+
   /// \returns Subtarget's default pair of minimum/maximum flat work group sizes
   /// for function \p F, or minimum/maximum flat work group sizes explicitly
   /// requested using "amdgpu-flat-work-group-size" attribute attached to
commit	18009560c59deceb2be577e0182f7016d6ee1121	[log] [tgz]
author	Alexander Timofeev <Alexander.Timofeev@amd.com>	Thu Dec 08 17:28:47 2016 +0000
committer	Alexander Timofeev <Alexander.Timofeev@amd.com>	Thu Dec 08 17:28:47 2016 +0000
tree	05c51c8ad3b3de4f579ea884037cded642bec1b2
parent	eebed6229ad000a34959f4a9794b9e009eb0227d [diff] [blame]