I'm talking about the scalability of actually delivering new code to servers (or "serverless" runtimes). Feature flags don't help with that.
Admittedly, it isn't a scalability problem you will run into right away.
But when you need to roll out an emergency fix, there is a big difference between deploying to thousands of servers that all have everything, and ten servers running a single service.