AI can generate code. Most of it's terrible, and here's how to "vibe code" properly.

May 09, 2025

AI can generate code. Most of it's terrible, and here's how to "vibe code" properly.

(spoiler: AI is a bricklayer. You're still the architect; actual code and engineering knowledge required)

Update: Automator, the codebase I started making here, now has a public dev repo. I'll be working more on it and update it on this Blog + a couple other forums as determined. Check it out!

As AI adoption and usage grows, so does the doomer-speak in various circles on how the days of developers are numbered. It is easy to make such connections if you see a machine blurt out 80% plausible and good-looking code by giving it simple instructions. This, however, overlooks the intricate dance of logic, creativity, and problem solving skills that lie at the heart of programming. As a developer myself, I would like to showcase herein how to best use AI to help teach yourself development for a platform you have never laid your hands on. I do compilers and low level systems; in this article, I will be making a minimal prototype Android app that uses a LLM-like interface to help you navigate your phone. For the sake of brevity, I have only documented what I did on the first couple days of this project, as I figure and polish the app further for a proper OSS release.

Oh, and I have never touched Android development before, barring one app in 2017 which was actually just a website iframe masquerading as one. Moreover, this current app is still in-progress, but this article lays out my thoughts on how to best use AI and how it has helped me given the domain specific knowledge and context I have.

Before we begin: Some Context and Thoughts

See, the thing about programming is that no matter what end product you're aiming for, the fundamentals largely remain the same. At its core: programming is breaking a complex task into multiple doable chunks of small ones, and then completing each of them before you bring it all together. This isn't just about spitting lines of code; it's about architectural thinking, foresight, and a deep understanding of the "why behind the what". This requires a game plan, deep knowledge of algorithms to optimize your app, and clarity on every small step to take to get to your desired end product.

Herein lies a crucial distinction: AI, in its current form, doesn't understand in the human sense. It excels at pattern recognition and generation based on the vast datasets it's trained on, but it lacks genuine comprehension, intuition, and the ability to navigate ambiguity or novel situations that weren't in its training data. It can generate a snippet, but it can't truly innovate or strategize the overarching solution to a problem it has never encountered.

If you pretend that AI is just a "free developer" who will do whatever you want of them, that does not work out well. What you have to do, instead, is to use it as the most excellent search engine that has ever existed, which can filter out the noise that traditional search engines bring with them and get straight to the point. Ask AI "please make me a website with an animated background, search functionality, main page as a blog, and a dark/light theme toggle", and you will get something very rudimentary. Enter some logic in the mix, especially complex or novel business rules, and you will get a plethora of errors, or worse, code that looks plausible but is fundamentally flawed in its approach or riddled with subtle bugs that only surface under specific, unforeseen conditions. AI doesn't possess common sense, nor can it critically evaluate its own output in the context of a larger, evolving system without explicit, detailed human guidance.

Human developers are needed to translate vague ideas into concrete, actionable plans, to make judgment calls when there are multiple viable paths, to debug issues that require a deep understanding of the system's behavior, and to ensure the final product is not just functional but also ethical, secure, and user-friendly. AI might generate the bricks, but the human developer is the architect, the engineer, and the quality assurance, all rolled into one. Developers who can effectively prompt, guide, and critically evaluate AI-generated code will be even more valuable.

In fact, when I gave AI full freedom in a way that a novice might to build one aspect of the app, a chat interface that uses Gemini as a backend to talk to the user, alongside an hour of feeding it all the errors that it's own code gave and even hinting on how it might solve those errors, this is what I ended up with:

Do you see how elementary some of these errors are?

This is in fact, the later version after I hinted AI to clear most bugs, which it did so inefficiently. Rudimentary errors: like using wrong variable names, not knowing Kotlin convention, making up variables, hallucinating, using bad algorithms (O(n^2) text detection algorithm that could have been a simple regex, and then giving every bad regex solution imaginable after countless hints), and so on and so forth. The first build had over 95 errors for just a chat UI and a send button that initiates a POST on the Gemini API and parses it's message in a different text bubble, all using system components and UI elements (this is barely 100 lines of code). Even the algorithm part: the overhead of an unoptimized search algorithm will not be felt as much in your 100-item test case as it would in billions of real life search queries.

Which is all the reason as to why developers will always be needed, and why you should learn to code. Anyone can whip out an unoptimized, API-blasted-out-for-free, heats-up-your-phone, crashes-if-over-200-people-use app. You, on the other hand, create art.

The App

I am trying to create a proof of concept app that helps people with accessibility needs use their phones through LLMs. The concept is simple:

The user interfaces with their phone through an LLM (called "Automator"), can be voice-based
The "Automator" breaks down what the user wants to do in simple custom-defined YAML steps, and passes it to the "Actor"
The "Actor" takes the YAML, gets system-level accessibility privileges, and executes those actions one by one.
The "Automator" checks if the action is complete, and tries again if it is not yet complete.

(And yes, the concept is incredibly similar to TestDriver, which I checked out and LOVED. However, that product is meant for QA testing on UI-based apps, while this is an accessibility tool for people who do not know computers and have a language barrier. I have reached out to the people behind TestDriver and await their response, we both just came up with the concept independently. Their product is amazing in it's use case, check it out [Footnote 1])

This project, for me, is fundamentally an exploration in applied AI for human-computer interaction, specifically targeting accessibility for my elderly, Hindi-speaking family members. The aim is to create a more intuitive bridge to smartphone functionality by allowing them to use natural voice commands in their native language, which an LLM (the "Automator") then translates into actionable operations.

My approach to building this proof-of-concept accessibility app wasn't to ask AI to write the entire application. Instead, I focused on decomposing the overarching challenge into its constituent computer science problems—finite state machines, domain-specific languages, NLP pipeline stages, inter-process communication, and UI event handling.

For each sub-problem, I drilled down to the smallest logical "brick" or function I needed. Then, armed with the specific CS concept I was trying to implement, I effectively prompted AI to generate these individual bricks. This methodology allowed AI to handle much of the boilerplate Kotlin syntax and Android API calls, significantly boosting my productivity and helping me avoid elementary coding errors.

While complex, system-level logic errors are an inherent part of any development process, AI helped ensure the foundational components were syntactically sound and aligned with common patterns, allowing me to focus on the architectural integration and the core CS challenges.

1. The Core Interaction Loop: Automator & Actor as a Coordinated Pair

From my developer's vantage point, the Automator and Actor components operate as a tightly coupled system, best modeled as a finite state machine (FSM). This isn't just a conceptual FSM; it will have formally defined states (e.g., AWAITING_INPUT, AUTOMATOR_LLM_QUERY, ACTOR_YAML_PARSE, ACTOR_STEP_EXECUTE, AUTOMATOR_VERIFY_RESULT), explicit transition conditions triggered by events or data, and defined outputs for each state. The interaction between them is inherently asynchronous; the Automator might dispatch a YAML payload and then enter a state awaiting completion or update from the Actor, potentially involving polling or a callback mechanism. The integrity of this loop relies on robust state transition logic to prevent issues like deadlocks or race conditions, especially if the Automator needs to handle new user input while the Actor is still processing a previous command. The "contract" or protocol for data exchange between them (the YAML structure) is critical for this distributed system behavior.

How it works for me: I'm essentially designing a distributed control flow. The FSM ensures predictable behavior. My task is to ensure that all state transitions are handled, including error states (e.g., LLM_TIMEOUT, ACTOR_PERMISSION_DENIED, YAML_PARSE_ERROR), and that the system can gracefully recover or guide the user. This orchestration is key to the app's reliability. For instance, when designing the state transitions, I'd identify a specific need, like "create a Kotlin enum for FSM states" or "implement a function to transition from AWAITING_INPUT to AUTOMATOR_LLM_QUERY based on user input event." AI was instrumental in generating the Kotlin syntax for these enums, data classes for state payloads, or even basic function skeletons for state handlers, allowing me to rapidly prototype the FSM structure based on automata theory principles I already understood.