In large language models, hallucination refers to the generation of content that is plausible-sounding but factually incorrect or fabricated, presented without uncertainty signals1. The model generates statistically likely output — it does not look facts up. When its training data or retrieval context is thin, incomplete, or outdated, it fills the gap with plausible-sounding approximations that can be demonstrably false.
Rhinegold's reframe: hallucination is not primarily an AI quality problem — it is a brand risk management problem. When an AI assistant states an incorrect product feature, a wrong price, or a fabricated claim about an organisation, a buyer may act on that information before ever reaching the brand's website. The exposure is real and entirely invisible to standard analytics. Rhinegold tracks this via the Hallucination Risk Index (HRI): the share of AI responses about a brand that contain a verifiable factual error, measured across a structured brand-fact prompt set. HRI is the GEO risk dimension that Citation Rate and Mention Rate do not capture — a high citation rate combined with a high HRI is an actively damaging combination.
Use when making the case for ongoing AI-answer monitoring beyond visibility tracking. A brand with high citation rate but high HRI is being cited with incorrect information — which can be worse than not being cited at all. HRI is also an input into prioritising which content assets and structured data need urgent attention.
Hallucination rate varies by provider, model version, query type, and how well a brand is represented in training data and grounding sources. No platform publishes per-brand or per-vertical hallucination rates. Measurement requires running a structured brand-fact prompt set against a verified reference, then scoring each response — it does not scale without tooling, and results are point-in-time snapshots that shift with model updates.
A directional HRI per platform can be approximated by running a curated set of brand-fact prompts — covering known products, prices, team, and key claims — against each AI platform and scoring responses against a verified fact base. This gives a relative risk ranking across platforms that is actionable even without perfect precision.
From Grounding: grounding is the mechanism that anchors AI responses to verifiable sources — better grounding reduces hallucination risk but does not eliminate it. A grounded response can still misrepresent the source it cites. From Citation Rate: Citation Rate measures whether the brand is named as a source; HRI measures whether what the AI says about the brand is factually correct. The two metrics are independent — and need to be tracked together.
There is no standardised per-brand or per-vertical hallucination benchmark. Platform providers do not publish hallucination rates by topic or brand. Measurement is bespoke and depends critically on what facts are chosen as the test set and how scoring is defined — making cross-company comparisons unreliable.