1. IELTS speaking is a script; OPI is a probe
IELTS speaking has a known shape. Part 1 is four minutes of personal questions ("Where are you from? Do you work or study?"). Part 2 is a two-minute monologue on a topic given in a card. Part 3 is a four-minute discussion on related abstract topics. Examiners follow a standardised question bank and a band-score rubric. Twelve minutes, predictable, scoreable.
The Oral Proficiency Interview — codified at FSI in 1958, formalised by ACTFL and the Interagency Language Roundtable in the 1980s, and still the gold standard for US federal hiring and visas — is fundamentally different. It is a 20–40 minute conversation designed to find the ceiling at which you stop being able to function. The interviewer doesn't follow a script; they adapt in real time, pushing you up until you break, then pulling you back to verify the level at which you were stable.
The difference matters because IELTS prep makes you very good at IELTS speaking and only moderately good at speaking. OPI is built to be unprep-able — that's the entire design intent. If you can hit ILR 2 on OPI, you can probably check into a hotel, complain about a faulty appliance, and explain your project to a colleague. If you got 7.0 on IELTS, that suggests B2/C1 territory, but doesn't certify any of those specific abilities.
2. The OPI's five phases
A real OPI interview follows a five-phase arc, taught in the same form to every ACTFL-certified tester worldwide:
Phase 1 — Warm-up (3–5 min). Easy social topics: introductions, today's weather, your job. The point is to put you at ease and establish a baseline of comfortable production.
Phase 2 — Level check (5–10 min). Questions at your hypothesised ILR level (the interviewer has a guess from your background). Can you sustain that level — not just hit it, but stay at it across multiple exchanges? Single fluent sentences don't count; the level check looks for stable function.
Phase 3 — Probes (5–10 min). Questions intentionally one ILR level above where you've been stable. The interviewer is looking for your ceiling. You will struggle, you are supposed to struggle, and how you struggle is part of the rating. Do you fall apart? Do you stay in target language? Do you use higher-level structures even when they're shaky?
Phase 4 — Role-play (3–5 min). A realistic scenario at your level: phone a hotel to extend a stay; explain to your boss why a project is delayed; talk to a doctor about chronic symptoms. The role-play tests function in pragmatic context, not just structure.
Phase 5 — Wind-down (2–3 min). Easy questions again, back to the warm-up register. Lets you exit on a confident note. Crucially, this also lets the rater confirm the floor: where you stabilise after being pushed.
Two ACTFL-certified raters independently score the recording. They must agree within one sub-level (e.g. both rate Intermediate-Mid or Intermediate-High). Disagreement triggers a third rater. The final rating maps to an ILR base level + plus marker (1, 1+, 2, 2+, 3, etc.).
3. Why test-prep candidates ace IELTS but flunk OPI
Here is the failure mode you don't read about in IELTS prep books:
A learner sits IELTS, gets 7.0–7.5, walks into a US government job interview (or moves to the UK and tries to function), and discovers their actual conversational ability is roughly ILR 1+. Why?
Three reasons:
Reason 1: IELTS Part 2 is a memorisation game. You get one minute to prepare and two minutes to speak on a known-category topic. After a few weeks of prep, most candidates have a half-dozen template monologues they can deploy with light editing. The two-minute monologue is, in practice, a recitation of a polished routine. OPI's probes are designed to land on topics you haven't prepared.
Reason 2: IELTS rewards fluency, OPI rewards function. A high IELTS score correlates with smooth delivery — pace, intonation, lack of filler. OPI's question: can you accomplish a communicative task? "I think coffee is great, I drink it every morning" is fluent enough for an IELTS band 7. It doesn't help if the OPI prompt is "tell me about a time you had to explain a workplace problem to a supervisor."
Reason 3: IELTS examiners are scored on adherence to a question bank; OPI raters are scored on accurately finding your ceiling. The incentive gradient is opposite. IELTS examiners are not trying to break you; OPI raters are. This is by design — OPI's whole purpose is to predict on-the-job performance, where breakage matters.
4. ILR vs CEFR — same skill, different evaluation philosophy
The technical mapping is well-known:
- ILR 0+ ≈ CEFR A1
- ILR 1 ≈ A2
- ILR 2 ≈ B1
- ILR 3 ≈ B2 (most FSI Foreign Service jobs end here)
- ILR 4 ≈ C1
- ILR 5 ≈ C2 / native
But the evaluation philosophy differs:
ILR/OPI is functional and integrative. It asks: at this level, can you sustainably do the tasks that matter? Production breakdowns count.
CEFR/IELTS is descriptive and analytic. It asks: at this level, do you demonstrate the linguistic features (grammar range, lexis range, fluency, coherence)? Production breakdowns count less if compensated by complexity elsewhere.
Neither philosophy is wrong. They serve different purposes — CEFR is great for academic and visa purposes (well-defined band descriptors, stable comparison across millions of test-takers); ILR is great for hiring (predicts whether you can do the job).
The practical takeaway: if your goal is a UK university place, IELTS 6.5 is what you need and OPI doesn't matter. If your goal is actually working in English, OPI ILR 2 is closer to "can you function" than IELTS 7.0 is.
5. How to add OPI-style probes to your practice
You can't hire an ACTFL rater for self-study, but you can train against the OPI shape:
Build a question bank. Write 30 prompts across ILR levels 1, 2 and 3. For each:
- One easy version (level you're targeting)
- One probe version (one level above)
- One role-play scenario (level appropriate)
Examples at ILR 2:
- Easy: "Tell me about your daily routine."
- Probe (push to 2+): "What's been the biggest change in your daily routine in the last year, and how have you adapted?"
- Role-play: "Phone the hotel to extend your stay by two nights. Negotiate the rate."
Self-record without prep. Pick a prompt at random. Record yourself answering for 90 seconds. No script, no planning. Play it back. Where did you break? Where did you stay stable?
Pattern of breaks > number of breaks. If you consistently break on past-tense narration, that's a structural gap. If you break on workplace vocabulary, it's lexical. The OPI rater looks at patterns, not totals.
PrepLearnio's Phase 2 roadmap includes an OPI simulator at /tools/opi-simulator/ that runs you through all five phases against TTS prompts, with ASR-based scoring on the function/content/accuracy/text-type axes that real OPI raters use. Until that ships, the manual procedure above is the closest substitute.
6. So which test should you take?
For most people, the answer is simple and disappointing: take whatever your end goal requires.
- UK university or visa: IELTS or LanguageCert IESOL.
- US federal job / security clearance: OPI through ACTFL or the DLPT (military equivalent).
- US university: TOEFL iBT or IELTS.
- General competitive certification: Cambridge English (B2 First, C1 Advanced, C2 Proficiency).
If your goal is "I want to know how good my English actually is, separated from test-prep advantages," the OPI is genuinely the most honest measure. ACTFL offers OPI tests for individuals (paid,
The thing IELTS doesn't tell you that OPI does: whether your English breaks under conditions you didn't rehearse for. That is the part that matters when you start using English at work.
See also: /method/ for the methodology that builds OPI-rateable proficiency; from 0 to ILR 2 in 90 days for a working plan; LanguageCert vs IELTS for UK visa for the standardised-test side.
Sources: govtilr.org ILR Skill Level Descriptions (public domain); ACTFL OPI Familiarization Manual (2020); Higgs & Clifford (1982) — the seminal paper on OPI's "Functional Trisection of Oral Proficiency Levels."