Role Assessment

AI Skills Assessment for Developers

Assessing AI-native developers beyond copy-paste

Developers who score above 8 on AISA typically use multiple AI tools daily and can articulate when each one fails. The gap between someone who pastes ChatGPT output into their IDE and someone who has built AI into their development workflow is enormous — but invisible on a resume. AISA makes that gap visible through a 25-minute conversation.

What We Assess

Specific AI competencies we probe through natural conversation, tailored for developers.

Prompt Engineering for Code

Can they write prompts that produce usable code on the first pass? We look at how they specify function signatures, edge cases, language-specific idioms, and expected behaviour. The difference between "write a sort function" and a well-constrained prompt with type annotations and test cases is the difference between a 4 and an 8.

AI-Assisted Debugging

When a bug appears, how do they use AI? We test whether they can provide enough context — stack traces, relevant code, environment details — for an AI to be genuinely useful, versus dumping an error message and hoping for the best.

Model Selection & Tool Awareness

Not every task needs the same tool. We assess whether developers understand the trade-offs between Copilot inline completion, chat-based models, local models, and specialized coding assistants. Knowing when not to use AI is as important as knowing how.

Workflow Integration Depth

Is AI embedded in their daily process or something they use occasionally? We probe for evidence of AI in commit messages, PR descriptions, documentation generation, test writing, and code review — the mundane work where AI actually saves the most time.

Dimension Focus

AISA scores across five dimensions. Here is how they weight for developers.

Workflow & Application

25%

How AI integrates into their actual development process. We look for evidence of AI in code review, architecture decisions, debugging workflows, and documentation — not isolated demo-style prompts. The best developers treat AI as a persistent pair-programming partner, not an on-demand answer machine.

Prompting & Communication

23%

How they structure prompts for code generation, debugging, and architecture. Strong developers provide context about their codebase, specify constraints upfront, and iterate on outputs rather than accepting first results. We probe whether they can decompose a complex task into a prompt sequence that a model can actually handle.

Technical Understanding

20%

Do they understand context windows, model differences, token limits, and when to use which tool? A developer who picks GPT-4 for a simple regex explanation is wasting tokens and time. We assess whether they have a mental model of what these systems can and cannot do.

Prompting

Technical Understanding

Workflow

Critical Thinking

Safety

What Good vs. Poor Looks Like

Concrete signals the AISA rubric is designed to detect. These patterns distinguish proficiency levels.

Strong signal (score 7-10)

+Describes specific prompt patterns for their stack — e.g., providing TypeScript interfaces as context before asking for implementation, or including test fixtures in debugging prompts.
+Can explain why they switched from one AI tool to another for a specific task, citing concrete limitations (context window size, latency, accuracy on their language).
+Uses AI for non-obvious tasks: generating test data, writing migration scripts, drafting ADRs, explaining legacy code to new team members.
+Articulates failure modes they have encountered — hallucinated APIs, incorrect dependency versions, subtly wrong logic — and how they verify outputs.

Weak signal (score 1-4)

-Describes AI use in generic terms: "I use ChatGPT to help with coding." Cannot cite a specific instance where they structured a prompt deliberately.
-Has no opinion on model differences. "I just use whatever is available." Does not understand why different tasks might benefit from different models.
-Cannot describe a time AI gave them a wrong answer, or how they validate AI-generated code before committing it.
-Uses AI only for the most basic tasks (autocomplete, simple questions) and has not explored how it could transform their broader workflow.

The Conversation Approach

AISA does not ask developers to solve coding problems in a chat window. Instead, it has a technical conversation about how they work. The assessment AI adapts its questions based on the candidate's experience level and tech stack.

A typical developer assessment might start with their daily workflow, explore a specific project where they used AI heavily, probe into the decisions they made (why that model, why that prompt structure, what went wrong), and finish with scenario-based questions about unfamiliar situations. The scoring AI tracks evidence across all five dimensions throughout — it is not checking boxes, it is building a profile.

This matters because the best developers often cannot articulate their AI skills on a resume. They have integrated these tools so deeply into their workflow that using AI feels as natural as using an IDE. The conversational format surfaces this tacit knowledge in a way that multiple-choice tests never can. For more on why this approach works, see our piece on why conversation beats quizzes for AI assessment.

The Hiring Context

Engineering teams are split on AI adoption. Some organizations actively encourage AI-assisted development; others still treat it as a gray area. The result is a workforce where AI fluency is unevenly distributed and nearly impossible to gauge from interviews or take-home assignments.

Traditional technical interviews test algorithmic thinking and system design — both important, but neither captures how a developer actually works day-to-day in an AI-augmented environment. A candidate might ace a whiteboard session and then spend three hours manually writing boilerplate that Copilot could have generated in seconds.

AISA fills that gap. It complements your existing technical interviews by adding a dimension you currently cannot measure: how effectively someone works with AI tools in practice. For teams building their AI-native hiring pipeline, our AI-Native Hiring Guide covers the full process from job description to offer stage.

Why It Matters

Hiring managers consistently overweight whiteboard-style coding ability and underweight how effectively a developer works with AI tooling. In practice, a developer who writes mediocre algorithms but uses AI to ship twice as fast creates more value than a Leetcode specialist who treats AI as cheating. AISA gives you evidence of the actual workflow — not self-reported tool lists on a resume, but demonstrated fluency in a conversation that requires them to explain how, when, and why they reach for AI.

Start assessing developers

One conversation. Evidence-based scoring across five dimensions. A report you can actually use to make hiring decisions.

The Science Behind AISA

In 2026, Anthropic published the AI Fluency Index — the largest empirical study of AI fluency to date, analysing nearly 10,000 conversations. AISA covers 93% of the behaviours Anthropic identified as markers of AI fluency and goes even deeper with 4 additional dimensions. The U.S. Department of Labor's AI Literacy Framework (TEN 07-25) defines what every worker needs to know about AI — AISA covers 100% of its 25 sub-competencies.Read our analysis: Anthropic's AI Fluency Study & AISA · DOL AI Literacy Framework & AISA

AISA's framework is developed by a team with deep roots in tech, behavioural science, and AI product leadership — the rubric is informed by backgrounds spanning the Metropolitan Police, Harvard, Crowdbotics (Silicon Valley), and the European School of Economics.

Featured on