Exam Weight 15% 31 Articles · 6 Tasks

Context Management & Reliability

Task 5.1 Progressive Summarization (7)

K5.1.1

"$127.50 Refund" Became "Customer Requested a Refund" — The Agent Processed $50

Progressive summarization & Case Facts

arrow_forward

K5.1.2

Sources 1-2: 96%. Sources 5-6: 52%. Sources 9-10: 94%. The U-Shaped Attention Curve.

Lost-in-the-middle effect

arrow_forward

K5.1.3

40 Fields Returned, 5 Needed — Context Full After 4 Calls Instead of 11

Tool output bloat and filtering

arrow_forward

K5.1.4

The API Does Not Remember Your Conversation — You Must Send It Every Time

Stateless API & history management

arrow_forward

S5.1.1

Turns 1-10: 96% Accuracy. Turns 31+: 58%. No Persistent Facts Block.

Case Facts & issue tracker pattern

arrow_forward

S5.1.2

28% of Issues Dropped Without a Tracker. 3% With One.

Multi-issue tracking

arrow_forward

S5.1.3

2,800 Tokens of Reasoning Chain → 280 Tokens of Structured Facts. Same Findings, 10x Less Context.

Sub-agent structured output efficiency

arrow_forward

Task 5.2 Escalation Triggers (5)

K5.2.1

Sentiment-Based Escalation: 40% Volume, 30% Needed Human. Replace It.

Escalation trigger design

arrow_forward

K5.2.2

"I Want a Person" → Attempt Resolution First → CSAT 2.1. Immediate Escalation → 3.8.

Immediate escalation on human request

arrow_forward

K5.2.3

High Confidence (0.9+): 12% Errors. Low Confidence (<0.5): 68% Correct. The Signal Is Broken.

Confidence scores unreliable

arrow_forward

K5.2.4

Auto-Select "Most Recent": 27% Wrong Customer. Ask for Email: 2%.

Multi-match disambiguation

arrow_forward

S5.2.1

Policy Silent on Competitor Matching → Agent Decides → 52% Approve, 48% Deny. Same Request.

Policy gap detection & escalation

arrow_forward

Task 5.3 Structured Error Context (5)

K5.3.1

Generic "Failed" → 18% Recovery. Structured Error → 71%.

Structured error context for recovery

arrow_forward

K5.3.2

"Database Error" — Orchestrator Retried 5 Times. The Database Was Permanently Decommissioned.

Generic error status anti-pattern

arrow_forward

K5.3.3

Two Anti-Patterns That Compound: 35% of Reports Have Hidden Gaps, 25% of Queries Are Killed

Silent swallow + termination anti-patterns

arrow_forward

K5.3.4

"No Peer-Reviewed Studies Exist" — Actually, 47 Papers Were Found After the Outage Ended

Access failure vs valid empty

arrow_forward

S5.3.1

"AI Has Minimal Impact on Performing Arts" — Actually, the Search Just Timed Out

Coverage annotation in synthesis

arrow_forward

Task 5.4 Context Degradation (5)

K5.4.1

Minute 0: "src/auth/jwt.ts:12 → verifyJWT()". Minute 45: "Typical JWT Validation Pattern."

Context degradation over time

arrow_forward

K5.4.2

The Scratchpad: Writing Findings Down Before the Context Forgets Them

Scratchpad for persistent findings

arrow_forward

K5.4.3

Sub-Agent Delegation: Let Someone Else Hold the Data

Sub-agent delegation for context isolation

arrow_forward

K5.4.4

Crash Recovery: A Manifest File So You Don't Start Over

Crash recovery with manifest

arrow_forward

S5.4.1

Save, Compact, Restore: The Three-Step Workflow for /compact

/compact save → compact → restore

arrow_forward

Task 5.5 Monitoring & Accuracy (4)

K5.5.1

96% Accuracy, 40% More Escalations: Why Aggregate Metrics Lie

Aggregate accuracy masks failures

arrow_forward

K5.5.2

Stratified Sampling: Why Random Monitoring Misses the Problems That Matter

Stratified sampling for monitoring

arrow_forward

K5.5.3

Field-Level Confidence: Review the Uncertain Fields, Not the Entire Document

Field-level confidence routing

arrow_forward

K5.5.4

Confidence Calibration: The Model Says 0.9 — What Does That Actually Mean?

Confidence threshold calibration

arrow_forward

Task 5.6 Provenance & Source Mapping (5)

K5.6.1

Claim-Source Mapping: Every Fact Needs a Return Address

Claim-source mapping provenance

arrow_forward

K5.6.2

Conflicting Data: Present Both, Fabricate Neither

Conflicting data: preserve both

arrow_forward

K5.6.3

Temporal Metadata: Not Every Difference Is a Contradiction

Temporal metadata prevents false contradictions

arrow_forward

K5.6.4

Established vs Contested: Structure Reports by Evidence Strength

Well-established vs contested structure

arrow_forward

K5.6.5

Content-Type-to-Format Matching: Tables for Numbers, Prose for Analysis

Content-type-to-format matching

arrow_forward