[CUDA] Added --[no-]cuda-include-ptx=sm_XX|all option. Currently we always include PTX into the fatbin along with the GPU code.It about doubles the size of the GPU binary we need to carry in the executable. These options allow control inclusion of PTX into GPU binary. This patch does not change the defaults, though we may consider making no-PTX the default in the future. Differential Revision: https://reviews.llvm.org/D45495 llvm-svn: 329737

commit: dde3dc27ee71f12eb145ce54158779ab4ddc38ed [log] [tgz]
author: Artem Belevich <tra@google.com> Tue Apr 10 18:38:22 2018 +0000
committer: Artem Belevich <tra@google.com> Tue Apr 10 18:38:22 2018 +0000
tree: eaccb2951afc4f620ca671f5b3cc153ef8909325
parent: 5da361a0b0f5e361c8285acdf9c9cd5d417416f9 [diff] [blame]
diff --git a/clang/docs/ClangCommandLineReference.rst b/clang/docs/ClangCommandLineReference.rst
index 8e9ef84..67eb56a 100644
--- a/clang/docs/ClangCommandLineReference.rst
+++ b/clang/docs/ClangCommandLineReference.rst

@@ -144,6 +144,10 @@
 
 CUDA GPU architecture (e.g. sm\_35).  May be specified more than once.
 
+.. option:: --cuda-include-ptx=<arg>, --no-cuda-include-ptx=<arg>
+
+Include (or not) PTX along with CUDA GPU binary for the given architecture (e.g. sm\_35). Argument may be 'all'. The option may be specified more than once. Default: --cuda-include-ptx=all
+
 .. option:: --cuda-host-only
 
 Compile CUDA code for host only.  Has no effect on non-CUDA compilations.
commit	dde3dc27ee71f12eb145ce54158779ab4ddc38ed	[log] [tgz]
author	Artem Belevich <tra@google.com>	Tue Apr 10 18:38:22 2018 +0000
committer	Artem Belevich <tra@google.com>	Tue Apr 10 18:38:22 2018 +0000
tree	eaccb2951afc4f620ca671f5b3cc153ef8909325
parent	5da361a0b0f5e361c8285acdf9c9cd5d417416f9 [diff] [blame]