Grade Any AI Output — Know If It's Actually Good or Just Sounds Good
AI can sound confident while being wrong. This prompt turns AI into its own quality checker.
A systematic prompt that forces AI to flag its own uncertain claims. Trust but verify — automatically.
You are a fact-checking specialist and AI reliability analyst. I'm about to use AI-generated content for something important. Before I do, audit it for hallucinations, fabrications, and overconfident claims. The content to audit: [PASTE AI-GENERATED CONTENT] The topic/domain: [WHAT IS THIS ABOUT?] Perform this audit: 1. CONFIDENCE MAP — Go through every factual claim and rate your confidence: 🟢 HIGH (>90%) — Well-established fact, easily verifiable 🟡 MEDIUM (50-90%) — Likely true but should be verified 🔴 LOW (<50%) — Possibly hallucinated, unverifiable, or suspicious ⚫ FABRICATED — This is definitely made up (fake statistics, non-existent studies, invented quotes) 2. RED FLAG PATTERNS — Identify common hallucination tells: - Suspiciously specific statistics without sources - Named studies or papers (often fabricated) - Quotes attributed to specific people - Historical dates and events - Technical specifications or benchmarks 3. VERIFICATION CHECKLIST — For every 🟡 and 🔴 claim, tell me exactly what to Google to verify it 4. CLEANED VERSION — Rewrite the content with hallucinations removed and uncertain claims properly hedged 5. TRUST SCORE — Overall reliability rating (0-100%) with justification
CONFIDENCE MAP: 🟢 'Python is the most popular language for data science' — TRUE, verified by Stack Overflow surveys 🔴 'A 2023 Stanford study found that 67% of developers...' — SUSPICIOUS. Cannot verify this specific study exists. ⚫ 'As Jeff Bezos said in his 2019 keynote...' — FABRICATED. No such keynote quote exists. TRUST SCORE: 62% — Content has 3 likely hallucinations that need verification before publishing.
This prompt exploits AI's ability to self-audit by forcing explicit uncertainty quantification. When instructed to flag claims by confidence level and identify unsupported assertions, AI becomes dramatically more honest about what it actually knows versus what it's pattern-matching.
Use before publishing any AI-generated factual content — research summaries, technical explanations, historical claims, or statistical assertions. Critical for journalists, researchers, students, and anyone whose credibility depends on accuracy.
You get a confidence-rated breakdown of every factual claim in the output, with high-confidence statements separated from uncertain ones. Specific claims are flagged for manual verification with suggested fact-checking approaches.
AI can sound confident while being wrong. This prompt turns AI into its own quality checker.
Identify cognitive biases affecting your decisions with debiasing techniques.
Determine whether a relationship in your data is real, spurious, or hiding a confounding variable
Stop guessing which prompt is better. This systematic framework tests variations and picks the winner.
Paste raw data, get patterns, surprises, segments, and actionable recommendations — not just numbers