Unfortunately, there is no simple switch/way to do this. This old quote from Multicore Faq still is true:
If you have 10 VSTs running and nine of them use almost no CPU power, but one uses most of it, Renoise can’t magically make the heavy VST faster. Only the VST itself could do that.
Also, to create “independent streams” you can’t split tracks that feed into each other (-> groups/sends), so some tracks have to be computed on a CPU to keep the signal path connected.
I’m sure there will be further optimisations here and there, but we need concrete examples and ideas (songs/setups where the current CPU handling seems to be bad). Then we could see what’s going on here and how we can improve things.