radeonsi: properly compute an LS-HS thread group size limit

"64 / max * 4" is less than "64 * 4 / max".

Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>
1 file changed