The only limit I've hit so far is really a raw fp performance. Given that
my demo is pretty simple, and collisions are mostly 50/50 sphere-sphere
and sphere-box, I still spend more than 20% of CPU time in the code
that was flying in C++. Same is for visibility calculations, terrain runtime updates, particles, etc. For example, I ended up ...