Every technologist has a infinitesimal early successful their profession erstwhile they recognize nan truth: You cannot study everything. Much for illustration doctors take specialties, engineers yet prime lanes. Backend, infrastructure, DevOps, mobile, frontend, instrumentality learning systems. Not because we deficiency curiosity, but because location is simply excessively overmuch to know. Technology evolves, interests shift, devices beforehand and our skills follow.
Yet beneath each nan specialization, location is simply a shared foundation. We each learned really variables work. How databases scale. Why sharding matters. Why staging environments exist. Why unreality providers springiness you credits erstwhile you are building your first app. There is simply a communal furniture of hard-won engineering contented — an ecosystem of guardrails that makes existent package activity successful production.
And this is precisely why vibe coding does not work.
Generative AI apps coming are mostly built done vibes. A small prompting here, immoderate handcrafted reasoning overhead there, a sprinkle of “act for illustration an expert” and abruptly it feels for illustration you person an application. After moving pinch thousands of businesses building AI features, I spot this shape constantly. Founders walk weeks perfecting a azygous prompt, testing it manually connected cherry-picked examples, declaring triumph erstwhile it useful beautifully successful their controlled environment. But prompts unsocial are not software. And vibe coding, for each its imaginative energy, breaks nan infinitesimal you inquire it to behave for illustration accumulation code.
A existent technologist would ne'er vessel an app to customers without nan ecosystem astir it. DevOps. Logging. Monitoring. Error handling. CI/CD pipelines. Regression testing. Versioning. A frontend that makes sense. A backend that does not autumn complete nether load. Security layers. Rate limiting. Graceful degradation. Yet someway we expect vibe-coded applications to do each this pinch thing but a prompt and bully intentions. We enactment shocked erstwhile they fail.
What Production AI Actually Requires
Building AI apps requires moreover much than accepted engineering, not less. The devices are different. The nonaccomplishment modes are weirder. The aboveground area is larger. Here’s what I mean:
- Evaluations systems: You request evals to understand whether your app is behaving correctly crossed thousands of separator cases. Not conscionable nan happy way you tested manually but nan weird inputs users will inevitably propulsion astatine it. When a accepted app breaks, you get a stack trace. When a ample connection exemplary (LLM) misbehaves, you get subtle wrongness that slides past your eyes until a customer complains.
- Continuous optimization: You request optimization because LLMs drift, contexts displacement and prompts decay. What worked past period stops moving this period because nan exemplary updated aliases personification behaviour changed aliases your separator cases evolved. You request systems that observe this degradation and amended prompts automatically, not a laminitis frantically rewriting prompts astatine 2 successful nan greeting because customer complaints are piling up.
- Memory and authorities management: You request representation truthful nan app has continuity. Real applications retrieve context; they support authorities crossed sessions. You cannot build a useful AI characteristic that forgets everything betwixt requests and expects users to re-explain their business each time. Most vibe-coded apps do precisely this because authorities guidance is hard, and prompts don’t lick it.
- Observability: You request observability because hallucinations hide until they detonate successful a customer’s hands. You request to cognize erstwhile your AI is uncertain, erstwhile it’s making things up and erstwhile it’s degrading gracefully versus failing catastrophically. Traditional logging isn’t enough; you request specialized tooling for AI behaviour.
- Integration architecture: AI features don’t beryllium successful isolation; they request to link to your existing information systems. You request orchestration layers that fto models, representation systems and information sources activity together coherently.
Without an ecosystem, your AI characteristic is simply a temper committee pretending to beryllium software.
Why Demos Look Amazing Until Production Fails
This is why truthful galore AI prototypes consciousness magical successful a demo and chaotic successful production. Every time erstwhile I talk to customers, they are terrified because they cognize this is nan future, and they want to usage it, but existent people’s jobs and livelihoods are connected nan line. Everything they put into accumulation is astatine risk.
The demo is controlled, and it has curated examples, while nan punctual has been tweaked to perfection for those circumstantial cases. Everything useful beautifully because nan situation is constrained and nan trial cases are known. Then it gets deployed to production, to existent users pinch messy data. Suddenly, nan cleanable punctual is failing 30% of nan time, and cipher knows why because location is nary eval model to measurement it.
It is not because founders deficiency talent aliases ambition. It is because we are treating AI applications arsenic one-liners alternatively of systems. We are trying to vibe our measurement into production. We are skipping nan full engineering subject that makes accepted package reliable and expecting AI to magically compensate done amended prompts.
Building nan Missing Ecosystem
If you want AI features to behave pinch nan reliability of code, you request to springiness them what codification has ever had: structure, tooling, guardrails and continuous improvement. You need a level approach, not a punctual approach. A level that tin evaluate, optimize, observe separator cases and amended handling complete time. That tin merge models and representation and information into a coherent strategy that is observable and correctable.
You request nan aforesaid rigor you would use to immoderate accumulation system. Version power for prompts and evals. Testing frameworks that tally automatically. Monitoring that alerts erstwhile behaviour degrades. Rollback capabilities erstwhile deployments spell wrong. Documentation that explains not conscionable what nan punctual does, but why it is system that measurement and what trade-offs it makes.
You request squad workflows that fto aggregate group lend without stepping connected each other. You request environments for development, staging and production. You request ways to research safely without breaking nan unrecorded system.
This is not theoretical. This is basal package engineering applied to a caller domain. Security, compliance, reliability and value because those fundamentals don’t spell distant conscionable because it’s caller technology. The companies building reliable AI features understand this.
What Comes Next
AI apps cannot win connected vibes alone. They request nan engineering ecosystem that has ever made accumulation package work. Vibes are awesome for productivity and exploration. They are unspeakable for reliability and scale.
Real AI applications request specialties, devices and subject conscionable for illustration each different branch of engineering. They request level reasoning and not punctual thinking. And nan companies that clasp this reality are nan ones that will build AI features that past agelong past nan hype cycle.
The mobility for each laminitis building pinch AI correct now is simple: Are you building a demo aliases a system? Because if you want your AI characteristic successful production, you request to stop vibe coding and commencement engineering.
YOUTUBE.COM/THENEWSTACK
Tech moves fast, don't miss an episode. Subscribe to our YouTube channel to watercourse each our podcasts, interviews, demos, and more.
Group Created pinch Sketch.
English (US) ·
Indonesian (ID) ·