Blog

The 9-box grid talent review, rebuilt for the calibration problem it actually has

Learn how to run a 9 box grid talent review that goes beyond last year’s politics. See how structured pre-reads, clear potential criteria, AI-supported calibration, and evidence-based development plans improve succession planning and retention.

Top 10 AI Software to Identify and Nurture High-Potential Talent

High Potential Employees — 2026

High Potential Employees 2026

Download the white paper for free

Why your 9 box grid talent review keeps giving last year’s answer

The 9 box grid talent review is not failing because the grid is simplistic. It is failing because the calibration around each box, each employee performance discussion, and each decision on high potential employees is still driven by narrative rather than objective data. When managers walk into a talent review with only gut feel about potential employees and vague memories of performance high or performance low moments, the box assessment becomes a mirror of last year’s organization politics.

Most employees fall into the middle box talent category because managers are unsure how to separate solid performers from true high performance or high potential profiles. When the performance potential axes are not clearly defined, the same employee can be placed in different employees box positions across business units, which destroys trust in the grid and in talent management as a whole. A 9 box grid talent review only creates value when the leadership team agrees on what low performance, strong performance, and performance high really mean in the current strategy cycle, and when they can explain to each employee why they sit in a specific box grid position.

Think about the last cycle where your organization used a 9 box grid template in a regional talent review. You probably saw managers argue over whether one employee belonged in a high performance box or in a solid performance box, while no one challenged the vague definition of potential that sat behind the vertical axis of the grid. Until you treat the grid template as a decision support tool that forces clarity on performance low, high potential, and succession planning readiness, you will keep running talent reviews that feel busy but do not help leadership make different choices.

Designing a pre-read that forces real performance potential conversations

The single biggest lever to improve any 9 box grid talent review is the pre-read pack that every manager receives and completes before calibration. A strong pre-read turns the grid from a subjective box talent exercise into a structured review of employee performance, potential employees indicators, and concrete development evidence. Without it, managers arrive with anecdotes, and high potential employees are decided by who tells the best story in the room.

At minimum, each employee should have a one page template that includes the last three performance ratings, goal completion against agreed KPIs, a short 360 feedback summary, outcomes from any recent stretch assignment, and tenure in role and in the organization. This pre-read template should also capture early signals of leadership capacity such as learning agility, speed of skill acquisition, and appetite for complex problems, because these are the real drivers of the potential axis in a modern box grid. When managers see this level of objective data side by side, they can separate low performance from constrained opportunity, and they can argue less about personalities and more about evidence.

For creative driven employees whose impact is not always visible in classic metrics, the pre-read should highlight innovation outcomes, cross functional collaboration, and influence on the wider équipe, which aligns with the way creative driven employees shape high potential workplaces. This richer data set helps managers avoid pushing such employees into a low performance box simply because their contribution does not fit a narrow sales or operations template. Over time, a disciplined pre-read process will help your leadership team see patterns in where employees fall on the performance potential grid and where your development investments are actually paying off.

One-page pre-read template (example)
Below is a practical outline you can adapt into a single-page form for each employee:

Section 1 – Role snapshot: current role and level, time in role, time in organization, criticality of role to strategy.
Section 2 – Performance history: last three performance ratings, key KPIs with target vs actual, major achievements and any sustained performance low trends.
Section 3 – Development evidence: recent stretch assignments, project outcomes, cross functional work, formal learning completed, feedback highlights from 360 or manager reviews.
Section 4 – Potential indicators: examples of learning agility, speed of skill acquisition, comfort with ambiguity, problem solving in complex situations, influence on the wider équipe.
Section 5 – Manager summary: proposed box grid placement, two strengths, two risks, and one to two concrete development or succession planning recommendations.

Anchoring the potential axis in learning agility, not charisma

Most 9 box grid talent reviews collapse because the potential axis is code for “I like this person” or “this employee reminds me of me”. When potential is defined as charisma or presentation style, high performance individual contributors with quieter styles are trapped in the middle employees box, while louder low performance managers are floated into the upper right box. That is how organizations end up with succession planning pipelines full of confident talkers and thin on real leadership depth.

A defensible definition of high potential should anchor in learning agility, speed of skill acquisition, and the ability to handle increased scope without a collapse in employee performance. In practice, this means your box assessment criteria for potential employees should include how quickly they mastered a new domain, how they responded to a stretch assignment, and whether they can operate in ambiguity without constant escalation. When you use this kind of objective data, the performance potential conversation shifts from “she seems sharp” to “she moved from an individual contributor role to leading a 10 person équipe in under 18 months with sustained performance high ratings”.

Modern talent management teams are also experimenting with tools that track skill growth and internal mobility to inform the potential axis, similar to how platforms that benchmark recognition and engagement, such as those discussed in analyses of platforms similar to Awardco for high potential employees, use data to surface hidden performers. These signals will never replace judgment, but they can help managers see where employees fall on the potential continuum beyond their current job title. Over time, this approach will help your leadership cadre distinguish between high performance in a narrow box and genuine high potential for broader leadership roles.

Running the calibration room: roles, bias checks, and the mckinsey box legacy

The original McKinsey box concept that inspired the modern 9 box grid was designed to force trade offs, not to rubber stamp existing hierarchies. In a well run 9 box grid talent review, the calibration room has three explicit roles : a facilitator who manages time and flow, a devil’s advocate who challenges bias and halo effects, and a scribe who captures decisions, risks, and development commitments for each employee. Without these roles, the grid template becomes a passive slide, and the loudest voice in the room decides where employees fall.

As each employee is placed on the box grid, the facilitator should ask the same three questions : what objective data supports this performance rating, what evidence supports this potential rating, and what would change our mind in the next cycle. This discipline exposes where low performance is actually a result of a broken system, where performance low is a pattern across roles, and where performance high is being over rewarded without checking for leadership derailers. The devil’s advocate role is particularly important for high potential nominations, because it forces managers to articulate why a specific employees box placement reflects more than recent wins or personal affinity.

Some organizations are now using AI powered calibration tools to flag inconsistent ratings, detect potential gender or ethnicity bias, and compare employee performance patterns across units, which aligns with the rapid growth of such tools reported by Peoplebox and Confirm. These tools can help highlight where potential employees are systematically underrated or where certain managers never use the low performance box at all. Judgment still belongs to leadership, but the combination of structured roles, a disciplined review process, and technology support will help your talent reviews move beyond the legacy of the McKinsey box and toward sharper, fairer decisions.

Calibration checklist for the room
To make the session more rigorous, facilitators and devil’s advocates can use a short checklist:

Facilitator prompts: (1) “Which specific data from the pre-read supports this performance rating?” (2) “What concrete examples justify this potential rating on the grid?” (3) “What development or succession planning action follows from this box placement?” (4) “Have we applied the same standard to similar employees in other teams?”
Devil’s advocate questions: (1) “If this employee worked for a different manager, would we place them in the same box?” (2) “Are we rewarding recent wins over sustained performance high?” (3) “Could bias about style, background, or tenure be influencing this decision?” (4) “What evidence would move this employee one box up or down on the performance potential grid?”

From grid to action: development plans, succession slates, and retention bets

A 9 box grid talent review that ends with colored boxes and no action is a governance ritual, not a talent management engine. The value of each box lies in the specific development plan, succession planning move, or retention action that follows for the employee and for the organization. If nothing in your leadership pipeline, learning portfolio, or mobility patterns changes after the review, the grid is just an expensive template.

For employees in the high performance and high potential box, the next step should be a concrete development plan that includes at least one significant stretch assignment, exposure to senior leadership, and a clear target role on the succession planning slate. Managers should articulate what skills must be built in the next 12 to 24 months and what objective data will show that the employee is ready to move. For those in a high performance but moderate potential box, the focus may be on deepening expertise, stabilizing critical processes, and designing recognition mechanisms that respect their impact without forcing them into unwanted leadership tracks.

Employees in a low performance but potentially high potential box require a different strategy that separates capability from context. Here, the development conversation should probe whether performance low is driven by a misaligned role, unclear goals, or a lack of support, and the plan should test the employee in a different environment before writing off their potential. Across all boxes, the scribe from the calibration session should document who will do what by when, so that the next round of talent reviews can track whether employees fall into the same box talent category or whether the organization actually moved the needle.

Case example: before and after a disciplined 9 box review
Consider a regional sales organization that initially ran informal talent reviews. Most employees clustered in the middle box, high potential nominations were based on charisma, and few development plans were tracked. After introducing a structured pre-read, explicit calibration roles, and a simple action log, the next cycle looked different. High performance but quieter account managers were surfaced as succession planning candidates based on learning agility and stretch assignment results, while a few previously favored managers moved down a box when their performance low trends became visible. Within a year, internal promotions into sales leadership roles increased, and retention among clearly identified high potential employees improved because they could see a concrete development path.

Where AI, language, and leadership judgment meet in modern talent reviews

AI powered tools are reshaping how organizations run the 9 box grid talent review, but they are not a substitute for leadership judgment. These systems can aggregate performance data, flag rating anomalies, and surface patterns in employee performance that humans might miss across thousands of employees. They can also generate a draft grid template or box assessment view that shows where employees fall today based on current ratings and potential indicators.

The real opportunity lies in using AI to provide objective data that challenges comfortable narratives, not to automate the final decision on who is high potential. For example, if an AI system shows that a manager never rates anyone in the low performance box, or that women are consistently rated lower on potential despite similar performance high records, the facilitator can use this insight to steer the calibration conversation. AI can also help summarize qualitative feedback into themes that highlight leadership strengths and risks, which supports sharper development planning for both high performance and struggling employees.

Language also matters in how you talk about talent, which is why many talent management leaders now pay close attention to the words used to describe leadership skills in high potential employees, as explored in depth in this analysis of powerful words to describe leadership skills. When managers use precise, behavior based language in the review, they are more likely to align on what each box in the grid really represents. The 9 box grid talent review will never be perfect, but with better data, clearer language, and disciplined roles, it can move from a static picture of current performance to a dynamic engine of future leadership capacity.

Key statistics on 9 box grids, high potential programs, and calibration quality

Industry surveys from HR analytics providers such as Gartner’s “Modern Performance Management Benchmarking Survey 2023” and Betterworks’ “State of Performance Enablement 2023” suggest that organizations using a structured 9 box grid with rigorous calibration often report reductions in manager self assessment bias in performance ratings of around 20 %, compared with those using unstructured talent reviews. These figures are directional and will vary by company and methodology.
Talent market observers such as SignalFire, in their “Future of Work: Talent Intelligence 2022” brief, have noted that modernized 9 box practices that anchor potential in learning agility and speed of skill acquisition are associated with faster internal fill rates for critical leadership roles, sometimes approaching 30 % improvements in specific case studies, although results are not guaranteed.
Analyses from vendors including Peoplebox (“Peoplebox Talent Review Trends 2023”) and Confirm (“Organizational Network Intelligence Report 2023”) indicate that AI powered calibration and talent review tools are among the faster growing categories in HR technology, with reported adoption growth rates above 25 % year on year in some large enterprise segments.
Public case examples from companies such as Microsoft and Unilever, shared in their published talent management case studies and investor ESG reports, describe how high potential employees who receive a clear development plan and at least one major stretch assignment after a 9 box grid talent review are more likely to stay with the organization over a multi year period, with reported retention uplifts in the 15 to 25 % range in those specific programs.
Workforce platforms such as Deel, in their “Global HR Benchmarks 2023” summary, have shared that standardized pre read packs for talent reviews can reduce calibration session time by roughly 30 to 40 % in their client base, while also increasing perceived fairness among managers and HR business partners according to internal surveys.

FAQ about 9 box grid talent reviews and high potential employees

How should we define potential in a 9 box grid talent review ?

Potential should be defined in terms of learning agility, speed of skill acquisition, and the capacity to handle greater scope and complexity without a drop in performance. This means looking at how quickly an employee masters new domains, how they respond to stretch assignments, and whether they can operate in ambiguity. Charisma or presentation style should never be used as proxies for potential.

What data should be in the pre read for a 9 box session ?

A strong pre read should include at least three years of performance ratings, goal completion against KPIs, a short 360 feedback summary, outcomes from recent stretch assignments, and tenure in role and in the organization. It should also capture early leadership signals such as learning agility and cross functional influence. This combination of quantitative and qualitative data supports more objective placement on the grid.

How often should we run a 9 box grid talent review ?

Most organizations run a full 9 box grid talent review annually, with lighter touch check ins mid year for critical roles and high potential employees. The annual cycle allows enough time for development plans and role moves to take effect. More frequent reviews can create noise without adding insight, unless there is a major reorganization or strategic shift.

Where can AI tools genuinely help in talent reviews ?

AI tools can help by aggregating performance data, highlighting rating inconsistencies, and detecting potential bias patterns across managers, teams, or demographic groups. They can also summarize qualitative feedback into themes that inform potential and development discussions. However, final decisions about box placement and succession planning should remain with experienced leaders who understand context.

How do we turn 9 box outcomes into real development and succession moves ?

Every box on the grid should have a clear playbook that links placement to specific development actions, succession planning decisions, and retention strategies. For example, high performance and high potential employees should receive stretch assignments and be placed on successor slates, while solid performers might receive role depth opportunities and recognition. Documenting who owns each action and by when ensures that the next review can track real progress rather than repeating the same conversations.

Top 10 AI Software to Identify and Nurture High-Potential Talent

High Potential Employees — 2026