Separately track input and output denormal mode AMDGPU and x86 at least both have separate controls for whether denormal results are flushed on output, and for whether denormals are implicitly treated as 0 as an input. The current DAGCombiner use only really cares about the input treatment of denormals.

commit: a3c814d23497bc71b8ed53c35f773366aff02922 [log] [tgz]
author: Matt Arsenault <Matthew.Arsenault@amd.com> Wed Nov 06 17:10:52 2019 -0800
committer: Matt Arsenault <arsenm2@gmail.com> Tue Feb 04 12:59:21 2020 -0500
tree: d6e42ab8a9b6747d7eeaf674484184c22e265702
parent: fce1eefb467e2bc3cd737ce78386e4970beefb7a [diff] [blame]
diff --git a/clang/lib/Basic/Targets/AMDGPU.cpp b/clang/lib/Basic/Targets/AMDGPU.cpp
index 0aaf681..a34d3d8 100644
--- a/clang/lib/Basic/Targets/AMDGPU.cpp
+++ b/clang/lib/Basic/Targets/AMDGPU.cpp

@@ -247,7 +247,7 @@
   if (!hasFP32Denormals)
     TargetOpts.Features.push_back(
       (Twine(hasFastFMAF() && hasFullRateDenormalsF32() &&
-             CGOpts.FP32DenormalMode == llvm::DenormalMode::IEEE
+             CGOpts.FP32DenormalMode.Output == llvm::DenormalMode::IEEE
              ? '+' : '-') + Twine("fp32-denormals"))
             .str());
   // Always do not flush fp64 or fp16 denorms.
commit	a3c814d23497bc71b8ed53c35f773366aff02922	[log] [tgz]
author	Matt Arsenault <Matthew.Arsenault@amd.com>	Wed Nov 06 17:10:52 2019 -0800
committer	Matt Arsenault <arsenm2@gmail.com>	Tue Feb 04 12:59:21 2020 -0500
tree	d6e42ab8a9b6747d7eeaf674484184c22e265702
parent	fce1eefb467e2bc3cd737ce78386e4970beefb7a [diff] [blame]