Hallucination Dataset — AI Factual Drift, Fabrication Detection & Answer Integrity Layer

Hallucination Dataset is a diagnostic intelligence layer that captures, classifies, and quantifies factual hallucination patterns in AI-generated outputs. It focuses on detecting when models produce information that is unsupported, partially inferred, or fully fabricated relative to available retrieval sources.

Core purpose: transform hallucination from an anecdotal AI failure into a measurable system signal that can be tracked across models, prompts, and retrieval conditions.

Internal system links: Datasets Root | Retrieval Observation Dataset | AI Citation Dataset | Entity Visibility Dataset | Framework Layer

DATASET OBJECTIVE

The Hallucination Dataset is designed to isolate and structure failure modes in AI reasoning where output diverges from grounded sources or retrieval evidence.

Detect factual inconsistency in generated answers
Classify hallucination severity levels
Identify trigger patterns in prompts and contexts
Measure hallucination frequency across AI models
Track correction and self-repair behavior in responses

CORE DATA FIELDS

Each record captures one hallucination event at response level.

query_id
input_prompt
ai_model (GPT, Gemini, Claude, etc)
response_output
claimed_facts
verification_status (verified / unverified / false)
hallucination_type (factual / entity / numerical / citation / fabricated source)
severity_score (low / medium / high / critical)
ground_truth_source
timestamp

HALLUCINATION CLASSIFICATION SYSTEM

Hallucinations are not treated as binary errors. They are structured into failure taxonomies.

Factual hallucination (incorrect real-world claims)
Entity hallucination (false or mixed entity identity)
Citation hallucination (fabricated or invalid sources)
Numerical hallucination (wrong calculations or stats)
Contextual drift hallucination (misinterpreted prompt intent)

Link: Hallucination Classification Module

HALLUCINATION TRIGGER CONDITIONS

This layer identifies systemic conditions that increase hallucination probability.

Low retrieval grounding density
High ambiguity query structures
Conflicting entity contexts
Missing citation reinforcement signals
Over-compression of multi-topic prompts

Link: Hallucination Trigger Analysis

CROSS-MODEL HALLUCINATION RATE COMPARISON

Different AI systems exhibit different hallucination profiles under identical query conditions.

Model-specific hallucination frequency
Severity distribution per model
Entity drift comparison across models
Citation fabrication variance

Link: AI Retrieval Behavior Dataset

ENTITY DRIFT DETECTION LAYER

A key hallucination subtype occurs when entity identity becomes unstable or incorrectly merged.

entity_id
incorrect_entity_mapping
entity_confusion_pairs
reference_mismatch_score
cross-context identity stability

Link: Entity Visibility Dataset

CITATION FABRICATION TRACKING

This module isolates hallucinated references that appear structurally valid but are not grounded.

fabricated_url_detection
invalid_source_pattern
citation_confidence_score
reference_verification_status

Link: AI Citation Dataset

HALLUCINATION DECAY & CORRECTION BEHAVIOR

Some models self-correct hallucinations within extended dialogue or updated context windows.

self_correction_rate
error_persistence_duration
follow-up correction triggers
context_recovery effectiveness

Link: Retrieval Observation Dataset

USE CASES

AI reliability engineering and evaluation
GEO system trust calibration
Entity grounding improvement strategies
Retrieval augmentation optimization
Cross-model factual consistency benchmarking

SYSTEM POSITIONING

Hallucination Dataset defines the boundary between generated knowledge and factual truth. It acts as a control system for measuring when AI shifts from retrieval-based reasoning to synthetic fabrication.

In GEO architecture, hallucination is not noise. It is a signal of broken retrieval grounding.