Your potential hires are given role-realistic tasks and guide AI agents to complete them; we analyze their session for AI-native thinking and best agentic practices.
Now in private trial for AI-native backend roles.
Includes 5 free evals.
Target pricing: $20+ per candidate, all-inclusive (agent costs included).
This is not AI-assist. This is an AI-first benchmark.
Candidates receive role-realistic tasks in a controlled sandbox. They work exclusively through an AI agent interface: no direct code editing or terminal access.
Identify candidates who can take vague requirements and systematically break them down into actionable agent prompts. The assessment reveals their ability to think in terms of delegation, not keystrokes.
We plan to support multiple AI providers, designed to represent existing tools and workflows. Candidates interact with agents as they would in real work.
We measure autonomy time, context management, thrash, and swarming. These aren't vanity metrics; they're indicators of how effectively someone leverages AI to ship.
Prototype sample. Scales and labels may change. Candidates do not see pass or fail.
Measures how effectively the candidate collaborates with AI agents: provides clear direction, allows autonomous execution, maintains clean workflows, and verifies outputs.
For illustration only.
Get early access, new assessments, insights on agentic proficiency, and invites to the future of engineering hiring.
Early access includes 5 free assessments
Daniel Toye
Product engineer, architect, and tech lead for 10+ years. I've been using AI-assisted development since day one (2+ years), building AI-first since it became possible. I watched teams struggle to identify who truly gets this paradigm shift. Now I'm building the assessment I desperately needed when hiring.
LinkedIn ProfileNo. The only interface is an AI agent. All interactions are recorded; there is no editor, shell, or manual access.
No. Candidates only orchestrate agents via prompts. We measure planning, delegation, and verification, not typing.
Target pricing per candidate. First 5 free. Agent-only access in a controlled environment.
We're in prototype. Apply for private beta.