hallucination

Star

Here are 87 public repositories matching this topic...

Libr-AI / OpenFactVerification

Star

Loki: Open-source solution designed to automate the process of verifying factuality

ai hallucination factuality

Updated Oct 3, 2024
Python

cvs-health / uqlm

Star

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

uncertainty-quantification uncertainty-estimation ai-safety confidence-score hallucination confidence-estimation ai-evaluation llm llm-evaluation llm-safety hallucination-evaluation hallucination-detection hallucination-mitigation llm-hallucination

Updated Aug 5, 2025
Python

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

Star

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

reliability calibration safety awesome-list uncertainty-quantification uncertainty-estimation robustness hallucination gpt-3 gpt-4 in-context-learning large-language-models prompt-engineering prompting llms chain-of-thought chatgpt

Updated May 21, 2025

VITA-MLLM / Woodpecker

Star

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

multimodality hallucination hallucinations large-language-models llm mllm multimodal-large-language-models

Updated Dec 23, 2024
Python

amazon-science / RefChecker

Star

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

hallucination factuality llms

Updated May 16, 2025
Python

WFGY — Semantic Reasoning Engine for LLMs (MIT) (WanFaGuiYi / 萬法歸一) Fixes RAG/OCR drift, interpretation collapse, and “ghost matches” with symbolic overlays + logic patches. ⭐️ Star if you’re exploring semantic RAG or hallucination mitigation.

open-source transformer knowledge-graph alignment semantic-engine embedding reasoning hallucination rag symbolic-reasoning llm semantic-inference ai-interpretability semantic-resonance txt-os wanfaguiyi semantic-tension semantic-residue

Updated Aug 6, 2025
Python

tianyi-lab / HallusionBench

Star

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark benchmarks lmm hallucination gpt-4 large-language-models llm llava large-vision-language-models vlms gpt-4v

Updated Nov 13, 2024
Python

FuxiaoLiu / LRV-Instruction

Star

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

evaluation vision vqa llama object-detection gpt evaluation-metrics iclr multimodal vision-and-language hallucination vicuna gpt-4 foundation-models prompt-engineering chatgpt llava iclr2024

Updated Mar 13, 2024
Python

shufangxun / LLaVA-MoD

Star

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

moe preference knowledge-distillation distillation kd hallucination mixture-of-experts large-language-models llm rlhf mllm llava multimodal-large-language-models qwen

Updated Mar 31, 2025
Python

IAAR-Shanghai / UHGEval

Star

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark evaluation dataset openai hallucination huggingface huggingface-transformers ceval gpt-3 openai-api hallucinations gpt-4 large-language-models llm chatgpt qwen hallucination-evaluation hallucination-detection

Updated Jun 7, 2025
Python

IAAR-Shanghai / ICSFSurvey

Star

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

decoding self-improvement knowledge-distillation data-augmentation reasoning self-consistency preference-learning hallucination self-correction attention-head large-language-models chain-of-thought large-language-model internal-consistency self-feedback self-refine self-correct

Updated Dec 7, 2024
Jupyter Notebook

NishilBalar / Awesome-LVLM-Hallucination

Star

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

mlm hallucination large-language-models llm mllm large-vision-language-models multimodal-large-language-models hallucination-evaluation hallucination-detection vision-language-models lvlm hallucination-mitigation hallucination-survey hallucination-research hallucination-benchmark multimodal-language-model

Updated Jul 27, 2025

zjunlp / KnowledgeCircuits

Star

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

natural-language-processing artificial-intelligence transformer circuit interpretability hallucination large-language-models model-editing knowledge-editing knowledge-edting knowledge-circuit

Updated Feb 20, 2025
Python

AmourWaltz / Reliable-LLM

Star

knowledge uncertainty reliable hallucination

Updated Sep 10, 2024
JavaScript

xieyuquanxx / awesome-Large-MultiModal-Hallucination

Star

😎 curated list of awesome LMM hallucinations papers, methods & resources.

multi-modal multimodal lmm hallucination

Updated Mar 23, 2024

ictnlp / TruthX

Star

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

safety llama representation language-model mistral explainable-ai hallucination baichuan hallucinations gpt-4 truthfulness llm llms chatgpt chatglm llm-inference llama2 llama3

Updated Mar 26, 2024
Python

zjunlp / Deco

Star

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

natural-language-processing decoding artificial-intelligence doco hallucination large-language-models mllm multimodal-large-language-models hallucination-mitigation iclr2025

Updated Dec 10, 2024
Python

zjunlp / FactCHD

Star

[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

benchmark natural-language-processing knowledge dataset factual hallucination large-language-models factchd

Updated Apr 28, 2024
Python

yfzhang114 / LLaVA-Align

Star

[ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

hallucination debiasing large-vision-language-models

Updated Feb 22, 2025
Python

kereva-dev / kereva-scanner

Star

Code scanner to check for issues in prompts and LLM calls

cli security ai linter evaluation code-scanning red-teaming ai-security hallucination ai-evaluation llm prompt-injection llm-security ai-code-review llm-evaluation owasp-llm-top-10 ai-performance ai-red-teaming llm-performance

Updated Apr 6, 2025
Python

Improve this page

Add a description, image, and links to the hallucination topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hallucination topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hallucination

Here are 87 public repositories matching this topic...

Libr-AI / OpenFactVerification

cvs-health / uqlm

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

VITA-MLLM / Woodpecker

amazon-science / RefChecker

onestardao / WFGY

tianyi-lab / HallusionBench

FuxiaoLiu / LRV-Instruction

shufangxun / LLaVA-MoD

IAAR-Shanghai / UHGEval

IAAR-Shanghai / ICSFSurvey

NishilBalar / Awesome-LVLM-Hallucination

zjunlp / KnowledgeCircuits

AmourWaltz / Reliable-LLM

xieyuquanxx / awesome-Large-MultiModal-Hallucination

ictnlp / TruthX

zjunlp / Deco

zjunlp / FactCHD

yfzhang114 / LLaVA-Align

kereva-dev / kereva-scanner

Improve this page

Add this topic to your repo