I've been using Claude and it's a game changer in my day to day. The caveat bein...

I've been using Claude and it's a game changer in my day to day. The caveat being of course that my tasks at a small "feature" level and all interactions are supervised. I see no evidence that this is going to change soon...

My other thought, that I can't articulate that well is....what about testing? Sure LLMs can generate tons of code but so what? If your two sentence prompt is for a tiny feature that's one thing. If you ask Claude to "build me a todo system" the results will likely rapidly diverge from what you're expecting. The specification for the system is the code, right? I just don't see how this can scale.