I used Kiro.dev for 5 days to complete my hackathon project (analyzing GitHub repositories).
My quick eval: Kiro feels like a below mid-level dev.
- Coding skills: mid-level
- Engineering skills: below mid
- Discipline: below mid
How Kiro stacks up vs a senior dev
Here’s how Kiro stacks up vs. a senior dev (based on my 25 years in software):
| What Kiro does | What a senior does | Ideas to fix Kiro |
|—-|—-|—-|
| Chooses a random repo with 100+ forks | Chooses a few smaller repos | Ask Kiro to research repos suitable for testing |
| Implements one big, long-running command (multiple requests per fork) | Plans smaller steps: show-info, list-forks. Tests step by step → sees most forks are empty and skips them | Ask to force task decomposition (Kiro resists) and break all processes into smaller steps |
| Plans redundant, unnecessary, undisclosed features | Plans only what’s needed | Ask to stay minimal when planning features |
| Rewrites raw data into vague, emotional, emoji-heavy text | Reports raw data as is | Ask to transmit entity names/data directly, without rephrasing |
| Loses insights during planning/implementation | Keeps track of all key details | Summaries + separate notes sessions |
| Starts coding immediately during “specs” discussion | Plans first | Use separate sessions, ask for summaries, and store notes in a separate file |
| Ignores instructions (agent steering rules) | Gets fired | Must follow rules or refund |
| Creates new specs for tiny features instead of extending existing ones | Creates a new package only if reusable | Must respect current session scope |
| Crashes but still marks task as “completed” | Gets fired | Must either finish properly or refund |
| Outputs “successful all done complete” placeholders as results | Raises NotImplementedError | Should always raise for unimplemented features |
| Does a sloppy job | Hunts for a new job | Hopefully more careful with smaller tasks |
| Never runs proper tests | Runs thorough tests | Ask for full test coverage – but beware, your budget may vanish fast |
| Not ready to ship autonomously | Can work autonomously | Add more rules – will it help? |
| Burns through your budget for only uncertain results | Delivers within budget | Pricing should reflect useful results, not wasted usage |
Final
My opinion: Kiro isn’t ready to work fully autonomously. It burns through budget fast, delivers only so-so results, and needs tighter rules plus better pricing to be truly useful.
Will I hire Kiro? Definitely yes. We need agents with different angles – just like people – to handle different tasks.