Larry helps AI coding tools generate production-ready code through specialized generators instead of open-ended prompting alone.
It combines traditional generators such as schematics or Nx Devkit-based generators with encoded architecture standards and AST-based validation so cheaper models can reliably produce code that fits your codebase. The result is better consistency, lower token spend, and less review churn.
Larry API keys are issued through the Larry platform, and the token limits are intentionally generous for real day-to-day development work.
To create a key:
- Open Larry Dev Platform.
- Sign in with your GitHub account.
- Open the
API Keyssection. - Click
Generate key.
Use Larry through MCP in your AI coding tool of choice, with tool-aware instructions and repeatable workflows.
Use the interactive schema-filling wizard to pick a generator, fill the required input, and write generated code to your project.
Build your own Claude-based agent around Larry generators and orchestrate the generation flow yourself.
Import generator functions directly from the package and invoke them inside custom scripts, tools, or internal platforms.
Most AI coding workflows still ask a model to "figure out" your architecture from a mix of training data, partial codebase context, and whatever patterns it happens to notice in the current repo.
That works for simple tasks. It breaks down when you need code that is:
- consistent with your engineering standards
- structured the same way across teams and features
- cheap enough to use every day
- reliable enough to trust in larger codebases
Larry solves that by putting generators in the middle of the workflow.
Instead of asking a model to invent structure from scratch, Larry gives it a constrained path:
- understand the generation intent through a schema
- use templates and patterns encoded by senior engineers
- generate code for a specific architectural target
- validate and refine output with AST-aware checks
This is the important shift: AI is still involved, but it is no longer guessing the shape of the solution from scratch.
- Reduces variance in generated code across teams and sessions
- Makes code generation cheaper by enabling smaller, lower-cost models
- Encodes architecture standards once instead of repeating them in prompts
- Produces code that matches your file structure, naming, and patterns
- Lowers review overhead because generated code starts from approved templates
- Makes generators useful inside agent workflows, not just local scaffolding
The generator stack behind Larry currently includes:
react(created by the Codestrap team)- React components
- React hooks
- React modules
- TanStack Query hooks
- TanStack Form modules
- TanStack Table modules
palantir(created by the Codestrap team)- Compute modules
- DAO generators
- DAO delegates
- Factory generators
- Ontology generators
- more
- We welcome senior engineers who want to contribute additional generators for stacks such as Next.js, Express.js, or other platform-specific workflows.
- Reach out at
larry@codestrap.me.
These generators are especially useful in opinionated frontend codebases where consistency matters as much as speed.
Traditional generators are deterministic, but rigid. Pure AI generation is flexible, but inconsistent.
Larry combines both:
- generators provide the structure
- templates encode the standards
- AI fills in the intent
- AST analysis checks whether the output actually matches expectations
That combination is what makes the system both practical and economical. Instead of paying a premium model to rediscover your architecture every time, you encode the architecture once and reuse it across every generation workflow.
In one Claude Code workflow using Sonnet 4.6, Larry reduced generation cost by about 50% and improved speed by about 20% compared with doing the same task through open-ended prompting alone.
| Larry generators | Plain Claude Code |
|---|---|
| ✅ Generated from encoded templates and standards created by senior engineers | ❌ Generated from a mix of training data, partial repo context, and inferred local patterns |
| ✅ Produces predictable structure across files, modules, and naming | ❌ Can drift into inconsistent structure between similar features |
| ✅ Uses explicit architectural targets instead of guessing the desired shape | ❌ Often needs repeated prompt correction to align with project conventions |
| ✅ Lower cost because the model solves a narrower problem | ❌ More expensive because the model must infer more from scratch |
| ✅ Cost: $0.0775 | ❌ Cost: $0.1443 |
| ✅ Time to finish: 2min | ❌ Time to finish: 2min23sec |
Costs can be reduced even further with a custom Larry agent harness, because in the benchmark above we still used Claude Code, which means Claude was deciding what files to read and how to navigate the codebase even when that extra exploration was not needed. In Brainly's guided coding study, a custom Larry workflow achieved about 97% cost reduction per task.
AI works best when the target architecture is explicit. Without clear standards for structure, layering, naming, and code organization, generated code becomes inconsistent, harder to review, and more expensive to produce because the model has to infer too much on its own.
That is why Larry works best in codebases with an opinionated architectural model. When the architecture is defined, generators can encode it, and AI can follow it reliably instead of guessing.
If you are interested in a React example of this broader idea, read React for Enterprise: Timeless Architecture for Enterprise Apps by Przemyslaw Nowak, published with Nx:
Larry was created by experienced engineers who have built and scaled developer platforms for large organizations, including Fortune 500 environments with hundreds of engineers.
|
Dorian Smiley doriansmiley
|
Przemek Nowak fasosnql
|
Igor Kopach kopach
|
Andrzej Fricze fricze
|
Ben Rogojan bcodestrap
|
