Verdict System
How AletheionGuard decides between ACCEPT, MAYBE, and REFUSED based on Q1, Q2, and Height
ACCEPT
Low uncertainty. Model is confident and likely correct.
Q1 < 0.35 AND
Q2 < 0.35 AND
Total Uncertainty < 0.30
MAYBE
Moderate uncertainty. Question may be ambiguous.
Q1 ≥ 0.35 OR
0.30 ≤ Total Uncertainty < 0.60
REFUSED
High uncertainty. Model lacks knowledge or hallucination risk.
Q2 ≥ 0.35 OR
Total Uncertainty ≥ 0.60
Decision Logic
Key Insight
The order matters: REFUSED takes priority over MAYBE. If epistemic uncertainty (Q2) is high, the verdict is always REFUSED regardless of Q1, because the model fundamentally lacks knowledge.
Verdict Breakdown
✅ ACCEPT - High Confidence
What it Means
- • Question is clear and unambiguous
- • Model has strong knowledge about the topic
- • Low risk of hallucination
- • Response is likely accurate
Recommended Action
- • ✅ Safe to use response automatically
- • ✅ Can be shown to end users
- • ✅ Minimal human review needed
- • ✅ High trust in model output
⚠️ MAYBE - Moderate Uncertainty
What it Means
- • Question may be ambiguous or subjective
- • Multiple valid interpretations possible
- • Model has knowledge but context is unclear
- • Inherent data noise (high Q1)
Recommended Action
- • 🔍 Ask user for clarification
- • 🔍 Request more specific question
- • 🔍 Show response with disclaimer
- • 🔍 Include confidence score in UI
🚫 REFUSED - High Epistemic Uncertainty
What it Means
- • Model lacks sufficient knowledge
- • High risk of hallucination
- • Out-of-distribution query
- • Topic outside training data
Recommended Action
- • 🛑 Do NOT use response automatically
- • 🛑 Escalate to human expert
- • 🛑 Retrieve additional context (RAG)
- • 🛑 Flag for manual review
Decision Tree Visualization
Threshold Reference
| Metric | Threshold | If Exceeded | Reason |
|---|---|---|---|
| Q2 (Epistemic) | ≥ 0.35 | REFUSED | Model lacks knowledge |
| Total Uncertainty | ≥ 0.60 | REFUSED | Combined uncertainty too high |
| Q1 (Aleatoric) | ≥ 0.35 | MAYBE | Question is ambiguous |
| Total Uncertainty | 0.30 - 0.59 | MAYBE | Moderate uncertainty |
| All Metrics | Below thresholds | ACCEPT | Low uncertainty, high confidence |
Real-World Examples
Healthcare Q&A
Q: "What is normal heart rate?"
✅ ACCEPT - Safe to answer
Q: "Is my headache serious?"
⚠️ MAYBE - Need more context
Q: "Should I take drug X?"
🚫 REFUSED - See doctor
Customer Support Automation
Q: "How do I reset password?"
✅ ACCEPT - Auto-respond
Q: "Why was I charged?"
⚠️ MAYBE - Ask for details
Q: "Legal issue with contract"
🚫 REFUSED - Escalate to agent
RAG System Optimization
Model has sufficient knowledge. Return answer directly without additional retrieval.
Question is ambiguous. Ask user to clarify before retrieving documents.
High epistemic uncertainty. Trigger additional retrieval (k=10 → k=20) to provide more context.
Implementation Example
Next Steps
Questions about Verdicts?
Learn how to tune thresholds for your specific use case