Apply deemphasis to both channels in the same loop for the simple case

This makes the decoder ~2.5% faster on x86 because the stereo loop
takes the same processing time as one mono loop due to the dependency chain
1 file changed