[CostModel][X86] Include the cost of 256-bit upper subvector extract/insertion in AVX1 v4i64 MUL Matches other MUL/ADD/SUB 256-bit case on AVX1 llvm-svn: 291149

commit: b01e84424138cdad032a1f7f0cc097c04af22028 [log] [tgz]
author: Simon Pilgrim <llvm-dev@redking.me.uk> Thu Jan 05 18:20:25 2017 +0000
committer: Simon Pilgrim <llvm-dev@redking.me.uk> Thu Jan 05 18:20:25 2017 +0000
tree: 7b6aa48ded34d5492a27f276b2a49fec44863954
parent: e9987a1d2fad218f2da3c78a8bee60ecc87262d9 [diff] [blame]
diff --git a/llvm/lib/Target/X86/X86TargetTransformInfo.cpp b/llvm/lib/Target/X86/X86TargetTransformInfo.cpp
index a5958f5..719f7e7 100644
--- a/llvm/lib/Target/X86/X86TargetTransformInfo.cpp
+++ b/llvm/lib/Target/X86/X86TargetTransformInfo.cpp

@@ -556,9 +556,9 @@
     // A v4i64 multiply is custom lowered as two split v2i64 vectors that then
     // are lowered as a series of long multiplies(3), shifts(3) and adds(2)
     // Because we believe v4i64 to be a legal type, we must also include the
-    // split factor of two in the cost table. Therefore, the cost here is 16
+    // extract+insert in the cost table. Therefore, the cost here is 18
     // instead of 8.
-    { ISD::MUL,     MVT::v4i64,    16 },
+    { ISD::MUL,     MVT::v4i64,    18 },
   };
 
   // Look for AVX1 lowering tricks.
commit	b01e84424138cdad032a1f7f0cc097c04af22028	[log] [tgz]
author	Simon Pilgrim <llvm-dev@redking.me.uk>	Thu Jan 05 18:20:25 2017 +0000
committer	Simon Pilgrim <llvm-dev@redking.me.uk>	Thu Jan 05 18:20:25 2017 +0000
tree	7b6aa48ded34d5492a27f276b2a49fec44863954
parent	e9987a1d2fad218f2da3c78a8bee60ecc87262d9 [diff] [blame]