AVX512BW: Enable packed word shift for 512bit vector. Enable lowering scalar immidiate shift v64i8 .Fix predicate for AVX1/2 shifts.

Differential Revision: http://reviews.llvm.org/D15713

llvm-svn: 256324
13 files changed