One more AI experiment this weekend.
Had it build a puzzle game that reminds a bit of Tetris.
Fed several pages of detail spec into the AI (Google Gemini 2.5 pro), got a working Phaser/JS game after the second "now build that part that you marked as TODO" prompt. Fixed some misunderstandings fast. Nearly despaired in the seemingly trivial rewrite of a high-score scene into a highest-level-reached scene. Claude 3.7 did a bit better but not much.
My conclusion:
- generating small apps (the limit increases with every LLM version) works near perfect by now.
- refactoring is still an issue, particularly when it comes to UI design / layout / styling.
- Without dev skills, it is very hard to create an app that does what you want.