#

hallucination-detection

Here are 41 public repositories matching this topic...

uptrain-ai / uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

machine-learning monitoring evaluation experimentation jailbreak-detection autoevaluation root-cause-analysis prompt-engineering llmops openai-evals llm-prompting llm-eval llm-test hallucination-detection

Updated Aug 18, 2024
Python

cvs-health / uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

uncertainty-quantification uncertainty-estimation ai-safety confidence-score hallucination confidence-estimation ai-evaluation llm llm-evaluation llm-safety hallucination-evaluation hallucination-detection hallucination-mitigation llm-hallucination

Updated Aug 22, 2025
Python

LettuceDetect

KRLabsOrg / LettuceDetect

LettuceDetect is a hallucination detection framework for RAG applications.

python nlp pytorch information-extraction bert token-classification hallucination-evaluation hallucination-detection

Updated May 18, 2025
Python

IAAR-Shanghai / UHGEval

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark evaluation dataset openai hallucination huggingface huggingface-transformers ceval gpt-3 openai-api hallucinations gpt-4 large-language-models llm chatgpt qwen hallucination-evaluation hallucination-detection

Updated Jun 7, 2025
Python

voidism / Lookback-Lens

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

text-generation factuality hallucinations large-language-models hallucination-detection

Updated Aug 13, 2024
Python

Alsace08 / Chain-of-Embedding

[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"

interpretability trustworthy-ai large-language-models self-evaluation hallucination-detection iclr2025

Updated Dec 19, 2024
Python

OpenKG-ORG / EasyDetect

An Easy-to-use Hallucination Detection Framework for LLMs.

natural-language-processing knowledge-graph generation hallucinations aigc large-language-models multimodal-large-language-models genrative-ai easydetect hallucination-detection

Updated Apr 21, 2024
Python

open-compass / ANAH

[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO

acl alignment gpt iclr neurips llms hallucination-detection hallucination-mitigation

Updated Apr 30, 2025
Python

Ruiyang-061X / VL-Uncertainty

🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".

uncertainty uncertainty-quantification multi-modal uncertainty-estimation uncertainty-analysis hallucination vision-language vision-language-model large-vision-language-model hallucination-evaluation hallucination-detection multi-modal-large-language-model

Updated Mar 18, 2025
Python

patrick-tssn / VideoHallucer

VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)

multimodal-large-language-models hallucination-detection video-language-model video-hallucination

Updated Apr 1, 2025
Python

zjunlp / EasyDetect

[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.

natural-language-processing artificial-intelligence knowledge-graph generation multimodal hallucination aigc large-language-models generative-ai model-editing knowledge-editing multimodal-large-language-models knowlm easydetect hallucination-detection

Updated Feb 25, 2025
Python

AlexanderVNikitin / kernel-language-entropy

Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)

reliability uncertainty-quantification large-language-models hallucination-detection

Updated Dec 17, 2024
Python

Ruiyang-061X / Uncertainty-o

✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models".

uncertainty reasoning multimodal model-agnostic o1 large-language-models chain-of-thought large-multimodal-models hallucination-detection hallucination-mitigation

Updated Mar 13, 2025
Python

mbzuai-nlp / fire

A lightweight, agent-style framework for fact-checking atomic claims using iterative retrieval and verification. Reduces LLM and search cost while maintaining strong factuality performance.

framework retrieval verification factchecking factuality llm llm-agent hallucination-detection

Updated Jun 4, 2025
Python

aimonlabs / aimon-python-sdk

This repo hosts the Python SDK and related examples for AIMon, which is a proprietary, state-of-the-art system for detecting LLM quality issues such as Hallucinations. It can be used during offline evals, continuous monitoring or inline detection. We offer various model quality metrics that are fast, reliable and cost-effective.

continuous-monitoring guardrails instruction-following llm generative-ai hallucination-detection

Updated Aug 4, 2025
Python

Kernel-Dirichlet / CoTARAG

Agentic-AI framework w/o the headaches

sql multi-modal indexing-engine rag llm chain-of-thought prompt-caching prompt-engineering-for-programmers hallucination-detection agentic-framework agentic-workflow hallucination-mitigation agentic-rag agentic-ai

Updated Jun 8, 2025
Python

AikyamLab / hallucinogen

A benchmark for evaluating hallucinations in large visual language models

ai aisafety visual-language-models hallucination-evaluation hallucination-detection medical-safety medical-visual-language-model

Updated Mar 18, 2025
Python

F4biian / HalluRAG

Source code of "The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM's Internal States" (arXiv: https://arxiv.org/abs/2412.17056)

dataset rag closed-domain hallucinations large-language-models llm llms retrieval-augmented-generation hallucination-detection

Updated Mar 20, 2025
Python

fannie1208 / FactTest

[ICML2025] "FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees"

hypothesis-testing conformal-prediction large-language-models hallucination-detection neyman-pearson-classification

Updated May 29, 2025
Python

aimonlabs / hallucination-detection-model

HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification

nlp machine-learning hallucination llm hallucination-detection

Updated May 2, 2025
Python

Improve this page

Add a description, image, and links to the hallucination-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hallucination-detection topic, visit your repo's landing page and select "manage topics."