Merge "Support caption second UI structure (5/n)" into rvc-dev