Anatomy of a rubric
Fractional weight in the overall score. Weights across all dimensions
should sum to
1.0. The platform normalizes if they don’t, but explicit
is better.Minimum score on this dimension for a pass. A candidate that scores
below the threshold on any single dimension fails the rubric — even
if their overall is above
overallPassThreshold.Minimum weighted-average score for a pass. Defaults to
7.0 if unset.How scoring runs
For each dimension, the AI:Reads the transcript
Identifies the segments relevant to this dimension — code-quality
evidence in the coding round, discovery-question evidence in a sales
round.
Grades against the dimension's description
Produces a 0–10 score with a short reasoning paragraph citing the
specific transcript lines that drove the score.
Joins into the breakdown
All per-dimension scores are bundled into
evaluation_breakdown with
transcript citations and reasoning per dimension.Why dimensions matter more than overall
A single overall score throws away the signal you need to actually decide. Two candidates can both score 7.5 overall and be wildly different hires:| Candidate | Communication | Technical | Trade-offs | Authenticity |
|---|---|---|---|---|
| Alex | 9.0 | 6.5 | 7.5 | 9.0 |
| Brin | 6.0 | 9.0 | 7.5 | 9.0 |
Templates ship with calibrated rubrics
You don’t have to design rubrics from scratch. intervyo.ai ships calibrated starters for the most-hired roles:Software engineers
Problem decomposition · Code quality · Trade-off thinking · Communication
Sales reps
Discovery quality · Pitch adaptation · Objection handling · Closing
Customer support
Diagnosis · Written communication · Technical fluency · Escalation judgement
Product managers
Prioritization · Customer empathy · Cross-functional language · Metric literacy
Calibration analytics
Rubrics are useful only if humans use them consistently. intervyo.ai surfaces:- Recruiter-to-recruiter variance per dimension
- Hiring-manager parity across panels reviewing the same candidates
- Drift alerts when a recruiter’s scoring trends away from the team baseline by more than a threshold
- Calibration sessions — built-in quarterly workflow where multiple scorers grade the same candidate and the platform surfaces divergence
Versioning
Rubrics on a template are versioned. Edits create a new version; existing sessions keep the rubric they were scored against. That way:- Hiring panels reviewing 60-day-old scorecards see the rubric that produced those scores
- You can tighten a rubric mid-quarter without breaking historical comparability
- Compliance teams can audit which version of which rubric scored which candidate