Skip to main content
โ† Back to Blog
CV ScreeningAI RecruitmentRecruitment AgencyHiring ProcessEvaluation

The One-Role Test: How to Trial AI CV Screening Without Disrupting Your Process

A low-risk way to evaluate AI CV screening: run one real, live role in parallel with your manual review and see exactly where the two disagree.

12 June 2026ยทUpdated 12 June 2026ยท6 min readยทDan Vernon, Founder at Marxel
Share:๐•inf

The one-role test is the simplest honest way to evaluate AI CV screening: take a single live role, run your real candidates through the tool, and compare the shortlist it produces against the one your team would have built by hand. You change nothing else about how you work.

Most tool evaluations fail because they ask the wrong question. They set up a demo with sample CVs, watch the software do something impressive, and end without learning whether it would have helped on the roles that actually keep you busy. A one-role test fixes that by using your work, your standard, and your candidates.

Why One Role Beats a Big Pilot

It is tempting to "properly" trial a new tool across a batch of roles. That usually backfires. A large pilot needs sign-off, coordination, and a block of time nobody has during a live desk week. So it gets scheduled, slips, and never happens.

One role is different. It fits inside work you are already doing, the stakes are contained, and you get a clear read in an afternoon rather than a quarter.

ApproachTime to first signalRiskLikelihood it actually happens
Sales demoMinutesNoneHigh, but tells you little
Multi-role pilotWeeksCoordinationLow โ€” it stalls
One-role testAn afternoonContainedHigh

Pick the Right Role

Do not pick your hardest, most political role. You are testing the tool, not stress-testing it. Pick a role that is:

  • Live and real โ€” an open vacancy with candidates already in your inbox.
  • Representative โ€” the kind of role you fill often, not an edge case.
  • Already understood โ€” you know roughly who your strong candidates are, so you can sense-check the output.

Avoid roles where you have only two or three applicants. You need enough of a stack for ranking to mean something.

Run It in Parallel, Not Instead

The point of the test is comparison, so keep your normal process running. You are not replacing your judgement for this role โ€” you are creating a second opinion you can hold up against your own.

Because this uses real candidate data, keep the same safeguards you would use in any AI-assisted screening process: human review, clear criteria, and documented reasoning. The GDPR-compliant CV screening guide covers the UK compliance checklist in more detail.

Here is the full loop:

  1. Upload the job description. This gives the tool the written requirements.
  2. Add your briefing notes. This is the part most teams skip, and it is the part that matters most. The real criteria usually live in the hiring manager call, not the JD.
  3. Review the rubric it generates. Spend five minutes here. Adjust weights, remove anything wrong, add anything missing. If the rubric is off, the shortlist will be too.
  4. Run your CV stack through it.
  5. Compare the two shortlists โ€” the one the tool produced and the one your team would have built manually.

What You Are Actually Measuring

The interesting question is not "is the AI better than me?" It is "where do we disagree, and who is right?"

Sort the candidates into three groups:

GroupWhat it meansWhat to do
AgreeBoth you and the tool rated them the same wayConfirms the rubric is reading the role correctly
AI surfacedThe tool rated them highly; you had skipped themRead the reasoning โ€” did fatigue cause a miss?
AI droppedYou liked them; the tool held or rejected themCheck whether it missed context or caught a real gap

The disagreements are where the value is. A candidate the tool surfaced that you had passed over might be a genuine miss from reviewing CV number 60 at 4pm. A candidate it dropped that you rated highly might expose a soft criterion you never wrote into the rubric.

Either way, you learn something โ€” about the tool, and often about your own process.

Read the Reasoning, Not Just the Ranking

A ranked list on its own is not enough to trust. For every candidate that lands in a disagreement group, open the explanation. A good screening tool should tell you which criteria a candidate matched, which they missed, and what evidence it used.

If the reasoning is sound and points at real lines in the CV, the disagreement is informative. If the reasoning is vague or invented, that is your answer about the tool โ€” and it is better to learn it on one role than after you have rebuilt your workflow around it.

Decide What Happens Next

After one role you will not have a final verdict, but you will have a clear next step:

ResultNext step
High agreement, clear reasoningTry it on a second, harder role
Useful output, but the rubric was offTune the rubric and rerun
It surfaced good candidates you had missedWorth continuing โ€” and review why they were missed
Vague reasoning or unexplained disagreementPause; do not expand yet

The goal of the test is not to prove the software right. It is to find out whether it makes your screening more consistent than doing it by hand under volume โ€” and to do that without betting your week on the answer.

Related Reading


Want to run the one-role test? Upload a live role, review the rubric, and compare the shortlist against your own. Start free screening

Related Articles

Ready to screen CVs faster?

Try Marxel free and see results in minutes.

Get Started Free