Skip to content

Purdue-M2/Detect-LAIM-generated-Multimedia-Survey

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 

Repository files navigation

Detect-LAIM-generated-Multimedia-Survey

This repository contains a collection of resources and papers on Detecting Multimedia Generated by Large AI Models: A Survey

timeline

The references of those works are displayed in Generation Works and Detection Works.

Please let us know if you find a mistake, or if we have missed your wonderful work by e-mail: lin1785@purdue.edu, hu968@purdue.edu, gupt1031@purdue.edu

If you find our survey useful for your research, please cite the following Paper

@article{lin2024detecting,
  title={Detecting Multimedia Generated by Large AI Models: A Survey},
  author={Lin, Li and Gupta, Neeraj and Zhang, Yue and Ren, Hainan and Liu, Chun-Hao and Ding, Feng and Wang, Xin and Li, Xin and Verdoliva, Luisa and Hu, Shu},
  journal={arXiv preprint arXiv:2402.00045},
  year={2024}
}

💻 Contents

📈 Related Work

         - A Survey on Detection of LLMs-Generated Content Paper GitHub

         - A Survey on LLM-generated Text Detection: Necessity, Methods, and Future Directions Paper GitHub

         - Towards possibilities & impossibilities of ai-generated text detection: A survey Paper

         - Machine-generated text: A comprehensive survey of threat models and detection methods Paper

         - The Age of Synthetic Realities: Challenges and Opportunities Paper

         - GenAI against humanity: Nefarious applications of generative artificial intelligence and large language models Paper

         - A Comprehensive Survey of Fake Text Detection on Misinformation and LM-Generated Texts Paper

         - Recent Advances on Generalizable Diffusion-generated Image Detection Paper

         - Survey on AI-Generated Media Detection: From Non-MLLM to MLLM Paper

         - Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey Paper

Generation

Generation Processes

Illustrations of different types of multimedia generation process based on LAIMs.

Public Datasets for Detection

Please read the column I20(Input-to-Output) with these abbreviations:

  • T2T: Text-to-Text
  • V2T: Video-to-Text
  • T2I: Text-to-Image
  • I2I: Image-to-Image
  • T2A: Text-to-Audio
  • I.A2V: (Image conditioned with Audio)-to-Video
Modality Dataset Year BM Content Link I2O #Real #Generated Source of Real Media Generative Method
Text TuringBench 2021 News Link T2T 8,854 159,758 News Media GPT-1&2&3, CTRL, GROVER
Paraphrase 2022 Essays Link T2T 98,280 163,710 Arxiv, Wikipedia, Theses GPT-3, T5
SynSCiPass 2022 Passages Link T2T 99,989 99,989 Scientific papers GPT-2, BLOOM
MAGE 2023 General Link T2T 154,078 294,381 Reddit, EL15, Yelp, XSum 27 LLMs
Stu.Essays 2023 Essays Link T2T 1,000 6,000 Ivy Panda ChatGPT
Writing 2023 Stories Link T2T 1,000 6,000 Reddit WritingPrompts ChatGPT
News 2023 News Link T2T 1,000 6,000 Reuters 50-50 ChatGPT
OUTFOX 2023 News Link T2T 15,400 15,400 Feedback Prize ChatGPT, GPT-3.5, T5
MULTITuDE 2023 Essays Link T2T 7,992 16,005 MassiveSumm GPT-3&4, ChatGPT
MGTDetect-CoCo 2023 News Link T2T 10,486 10,484 News Outlets GPT-3.5
HPPT 2023 Abstracts Link T2T 1,000 1,000 ACL Anthology ChatGPT
HC-Var 2023 General Link T2T 90,096 90,096 XSum , IMDb, Yelp, FiQA ChatGPT
HC3 2023 General Link T2T 26,903 58,546 FiQA , EL15 , MediaDialog ChatGPT
M4 2023 General Link T2T 32,799 58,803 WikiHow , Arxiv, Reddit ChatGPT, GPT-3.5, LLaMA, T5, BLOOM
F3 2023 Social Media Link T2T 12,723 27,667 Politifact , Snopes GPT-3.5
MixSet 2024 General Link T2T 3,600 3,600 Email , BBC News, ArXiv GPT-4, LLaMA2
GPABench 2024 Writing Link T2T 150,000 450,000 Arxiv GPT-3.5
M4GT-Bench 2024 General Link T2T 119,771 119,388 Wikipedia, WikiHow, Reddit, ArXiv, News 10 LLMs
RAID 2024 General Link T2T 14,917 6,287,820 Public datasets from 8 domains 11 LLMs
DetectRL 2024 General Link T2T 100,800 134,400 Writing Prompts , Yelp GPT-3.5, PaLM2, Claude, LLaMA2
MultiSocial 2024 Social Media - T2T 58,000 414,000 Gab, Discord, WhatsApp 7 LLMs
SM-D 2024 Social Media - T2T - - Medium, Quora, Reddit Sourced from social media
Image DFF 2023 Face Link T/I2I 30,000 90,000 IMDB-WIKI SDMs, InsightFace
RealFaces 2023 Face Link T2I 258 258 Prompts SDMs
OHImg 2023 Overhead Link T/I2I 6,475 6,675 MapBox , Google Maps GLIDE, DDPM
Western Blot 2022 Biology Link T/I2I ~14,000 - Western Blot DDPM, Pix2pix, CycleGAN
Synthbuster 2023 General Link T2I - 9,000 Raise-1K DALL-E 2&3, Midjourney, SDMs, GLIDE
GenImage 2023 General Link T/I2I 1,331,167 1,350,000 ImageNet SDMs, Midjourney, BigGAN
CIFAKE 2023 General Link T/I2I 60,000 60,000 CIFAR-10 SD-V1.4
AutoSplice 2023 General Link T2I 2,275 3,621 Visual News DALL-E 2
DiffusionDB 2023 General Link T/I2I 3,300,000 16,000,000 DiscordChatExporter SD
Artifact 2023 General Link T/I2I 1,749 960,894 COCO, FFHQ , COCO, LSUN SDMs, DDPM, LDM, CIPS
HiFi-FIDL 2023 General Link T/I2I ~60,000 1,300,000 FFHQ , COCO, LSUN DDPM, GLIDE, LDM, GANs
DiffForensics 2023 General Link T/I2I 232,000 232,000 LSUN, ImageNet LDM, DDPM, VQDM, ADM
CocoGlide 2023 General Link T/I2I 512 512 COCO GLIDE
LSUNDB 2023 General Link T/I2I 250,000 250,000 LSUN DDPM, LDM, StyleGAN
UniFake 2023 General Link T/I2I 8,000 8,000 LAION-400M LDM, GLIDE
REGM 2023 General Link T/I2I 116,000 116,000 CelebA , LSUN 116 publicly available GMs
DMImage 2023 General Link T/I2I 200,000 200,000 COCO, LSUN LDM
AIGCD 2023 General Link T/I2I 360,000 580,000 LSUN, COCO, FFHQ SDMs, GANs, ADM, DALL-E 2, GLIDE
DIF 2023 General Link T/I2I 34,800 54,500 LAION-585 SDMs, DALL-E 2, GLIDE, GANS
Fake2M 2024 General Link T/I2I 2,300,000 - CC3M SD-V1.5, IOI, IF , StyleGAN3
SID-Set 2024 Social Media Link T/I2I 100,000 100,000 COCO, Flickr30K, MagicBrush FLUX
Chameleon 2024 Social Media Link T/I2I 14,863 11,170 Unsplash GANs, SDMs, DALL-E 2, GLIDE
DF40 2024 Face Link T/I2I ~1,100 ~1,000,000 FF++, CDF, FFHQ, CelebA SDMs, GANs, Midjourney, DDPM
FakeBench 2024 General Link T/I2I 3,000 3,000 10 Public Datasets 10 Generative Models
AI-Face 2024 Face Link T/I2I 400,885 1,245,660 6 Public datasets SDMs, GANs, Midjourney, IF
Video WildDeepfake 2021 Face Link I.A2V 3,805 3,509 Social Media Social Media
DiffHead 2023 Face Link I.A2V 820 - CREMA Diffused Heads: build on DDPM
DVF 2024 General Link I/T2V 2,750 3,938 Intervid , Youtube-8M 8 Diffusion Models
GenVideo 2024 General Link I/T2V 1,223,511 1,078,838 Kinetics-400 , Youku-mPLUG, MSR-VTT 20 Generative Models
GenVidBench 2025 General Link I/T2V 33,931 110,400 Vript , HDL-VG-130M 8 Generative Models
PDID 2024 Social Media Link - - - Social Media Social Media
Audio In-the-Wild 2022 Speech Link T2A 20.7 hours 17.2 hours Social Media, Video Streaming Platforms Social Media, Video Streaming Platforms
LibriSeVoc 2023 Speech Link T2A 13,201 79,206 LibriTTS DiffWave, WaveNet
SONAR 2024 Speech Link T2A - 2,274 LibriTTS OpenAI, Seed-TTS, AudioGen
ASVspoof 2024 2024 Speech Link T/A2A ~289,527 ~1,211,186 MLS-English 32 Manipulation Methods
Multi-modal DGM^4 2023 News Link T/I2T 77,426 152,574 Visual News B-GST, StyleCLIP, HFGI
COCOFake 2023 General Link T/I2T 113,287 566,435 COCO SDMs
AV-Deepfake1M 2023 Face Link T2A 286,721 860,039 Voxceleb2 VITS, YourTTS, TalkLip
2024 General Link T2I ~2,300,000 ~9,200,000 LAION-400M SDMs, IF
M³A 2024 News - T2T/I/T/V/V/A/T
T/I/V/V/A2T
708,425 6,566,386 60 News Outlets LLaMA2, GPT-4, GLIDE, SD, Tango
LOKI 2024 General Link T2T/I/T/V/V/T
T/I/V/V/A2T
~9,000 ~9,000 21 Public Datasets 43 Generative Models
MMFakeBench 2024 Social Media Link T2T/I - ~11,000 MS-COCO, VisualNews, Reddit, FEVER GPT-3.5, SD-XL, DALL-E 3, Midjourney
Deepfake-Eval 2024 Social Media Link T2T/A/V 3,390 2,441 Social Media Social Media
ILLUSION 2025 General Link T2A/I
I2I
139,740 1,232,246 CelebV-Text [158], COCO, MusicCaps , Social Media 28 Generative Methods

🔎 Detection 🔥

📄 Text


Pure Detection

text_pure

Illustrations of pure detection methodologies for LAIM-generated text.

  ♣️ Easy Explainable Methods

        ▶️ Watermarking

         - Distillation-Resistant Watermarking for Model Protection in NLP Paper

         - Three bricks to consolidate watermarks for large language models Paper GitHub

         - Robust multi-bit natural language watermarking through invariant features Paper

         - Undetectable Watermarks for Language Models Paper

         - Robust distortion-free watermarks for language models Paper

         - Provable robust watermarking for ai-generated text Paper GitHub

         - A Private Watermark for Large Language Models Paper

        ▶️ Non-watermarking

         - Unraveling the mystery of artifacts in machine generated text Paper

         - Stylometric detection of ai-generated text in twitter timelines Paper

         - CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning Paper

         - Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT Paper GitHub

         - Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore Paper GitHub

  ♣️ Hard Explainable Methods

         - HowkGPT: Investigating the Detection of ChatGPT-generated University Student Homework through Context-Aware Perplexity Analysis Paper

         - GPTZero Tool

         - Detectgpt: Zero-shot machine-generated text detection using probability curvature Paper GitHub

         - Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text Paper GitHub

         - Multiscale Positive-Unlabeled Detection of AI-Generated Texts Paper GitHub

Beyond Detection

text_beyond

Illustrations of beyond detection methodologies for LAIM-generated text.

  ♣️ Efficiency

         - Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model Paper

         - Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature Paper GitHub

         - DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text Paper GitHub

         - SeqXGPT: Sentence-Level AI-Generated Text Detection Paper GitHub

         - Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection Paper GitHub

  ♣️ Attribution

         - TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text Generation Paper Turingbench

         - Whodunit? Learning to Contrast for Authorship Attribution Paper

         - Through the looking glass: Learning to attribute synthetic text generated by language models Paper

         - TopRoBERTa: Topology-Aware Authorship Attribution of Deepfake Texts Paper

         - Authorship attribution for neural text generation Paper GitHub

         - Gpt-who: An information density-based machine-generated text detector Paper

         - LLMDet: A Third Party Large Language Models Generated Text Detection Tool Paper GitHub

         - Few-Shot Detection of Machine-Generated Text using Style Representations Paper

         - Origin Tracing and Detecting of LLMs Paper

  ♣️ Generalization

         - Ghostbuster: Detecting Text Ghostwritten by Large Language Models Paper

         - Conda: Contrastive domain adaptation for ai-generated text detection Paper GitHub

         - Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features Paper GitHub

         - DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning Paper GitHub

         - Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts Paper GitHub

  ♣️ Interpretability

         - DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text Paper GitHub

         - A Watermark for Large Language Models Paper GitHub

         - Chatgpt or human? detect and explain. explaining decisions of machine learning model for detecting short chatgpt-generated text Paper

         - Check Me If You Can: Detecting ChatGPT-Generated Academic Writing using CheckGPT Paper

         - Is chatgpt involved in texts? measure the polish ratio to detect chatgpt-generated text Paper

  ♣️ Robustness

        ▶️ Adversarial Attack Robustness

         - Red Teaming Language Model Detectors with Language Models Paper

         - Radar: Robust ai-text detection via adversarial learning Paper Project Page

         - J-guard: Journalism guided adversarially robust detection of ai-generated news Paper

         - Outfox: Llm-generated essay detection through in-context learning with adversarially generated examples Paper

        ▶️ LAIM-Polished Robustness

         - Is chatgpt involved in texts? measure the polish ratio to detect chatgpt-generated text Paper

  ♣️ Empirical Study

         - ChatLog: Recording and Analyzing ChatGPT Across Time Paper GitHub

         - On the Zero-Shot Generalization of Machine-Generated Text Detectors Paper

         - On the Generalization of Training-based ChatGPT Detection Methods Paper

         - Supervised Machine-Generated Text Detectors: Family and Scale Matters Paper GitHub

         - Deepfake Text Detection in the Wild Paper GitHub

         - How large language models are transforming machine-paraphrased plagiarism Paper

         - Paraphrase Detection: Human vs. Machine Content Paper

         - MGTBench: Benchmarking Machine-Generated Text Detection Paper GitHub

         - How close is chatgpt to human experts? comparison corpus, evaluation, and detection Paper GitHub

         - Can LLM-Generated Misinformation Be Detected? Paper GitHub

         - From Text to Source: Results in Detecting Large Language Model-Generated Content Paper

📸 Image


Pure Detection

image_pure

Illustrations of pure detection methodologies for LAIM-generated image.

  ♣️ Physical/Physiological based Methods

         - Qualitative Failures of Image Generation Models and Their Application in Detecting Deepfakes Paper

         - Perspective (in) consistency of paint by text Paper

         - Lighting (in) consistency of paint by text Paper

  ♣️ Diffuser Fingerprints based Methods

         - Deep Image Fingerprint: Accurate And Low Budget Synthetic Image Detector Paper

         - DIRE for Diffusion-Generated Image Detection Paper GitHub

         - Exposing the Fake: Effective Diffusion-Generated Images Detection Paper

         - LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection Paper GitHub

         - Aligned Datasets Improve Detection of Latent Diffusion-Generated Images Paper GitHub

         - Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images Paper GitHub

  ♣️ Spatial-based Methods

         - Rich and Poor Texture Contrast: A Simple yet Effective Approach for AI-generated Image Detection Paper Project Page

         - Unmasking The Artist: Discriminating Human-Drawn And AI-Generated Human Face Art Through Facial Feature Analysis Paper

         - Detecting images generated by deep diffusion models using their local intrinsic dimensionality Paper

  ♣️ Frequency-based Methods

         - Wavelet-packets for deepfake image analysis and detection Paper GitHub

         - AUSOME: authenticating social media images using frequency analysis Paper

         - AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network Paper

         - Synthbuster: Towards Detection of Diffusion Model Generated Images Paper

         - Faster Than Lies: Real-time Deepfake Detection using Binary Neural Networks Paper GitHub

  ♣️ Distribution-based Methods

         - Zero-Shot Detection of AI-Generated Images Paper GitHub

Beyond Detection

image_beyond

Illustrations of beyond detection methodologies for LAIM-generated image.

  ♣️ Attribution and Model Parsing

        ▶️ Attribution and Model Parsing

         - Level up the deepfake detection: a method to effectively discriminate images generated by gan architectures and diffusion models Paper

         - Reverse engineering of generative models: Inferring model hyperparameters from generated images Paper

  ♣️ Generalization

         - Online Detection of AI-Generated Images Paper

         - Towards universal fake image detectors that generalize across generative models Paper GitHub

         - Raising the Bar of AI-generated Image Detection with CLIP Paper

         - Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection Paper

         - Fingerprintnet: Synthesized fingerprints for generated image detection Paper

         - Detecting Deepfakes Without Seeing Any Paper GitHub

         - Improving Synthetically Generated Image Detection in Cross-Concept Settings Paper

         - Diffusion Noise Feature: Accurate and Fast Generated Image Detection Paper

         - Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities Paper GitHub

         - HRR: Hierarchical Retrospection Refinement for Generated Image Detection Paper

         - A Sanity Check for AI-generated Image Detection Paper GitHub

         - Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection Paper GitHub

         - A Bias-Free Training Paradigm for More General AI-generated Image Detection Paper GitHub

         - Breaking Semantic Artifacts for Generalized AI-generated Image Detection Paper GitHub

         - Dual Data Alignment Makes AI‑Generated Image Detector Easier Generalizable Paper

  ♣️ Interpretability

         - Interpretable-through-prototypes deepfake detection for diffusion models Paper GitHub

         - Did You Note My Palette? Unveiling Synthetic Images Through Color Statistics Paper

  ♣️ Localization

        ▶️ Fully-supervised

         - Hierarchical fine-grained image forgery detection and localization Paper GitHub

         - Perceptual Artifacts Localization for Image Synthesis Tasks Paper GitHub

         - TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization Paper GitHub

         - UnionFormer: Unified-Learning Transformer with Multi-View Representation for Image Manipulation Detection and Localization Paper

        ▶️ Weakly-supervised

         - Weakly-supervised deepfake localization in diffusion-generated images Paper

  ♣️ Robustness

        ▶️ Adversarial Attack Robustness

         - D4: Detection of Adversarial Diffusion Deepfakes Using Disjoint Ensembles Paper

         - Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection Paper

         - All Patches Matter, More Patches Better: Enhance AI‑Generated Image Detection via Panoptic Patch Learning Paper

        ▶️ Post-Processing Robustness

         - GLFF: Global and Local Feature Fusion for AI-synthesized Image Detection Paper

         - Exposing fake images generated by text-to-image diffusion models Paper

         - Local Statistics for Generative Image Detection Paper

  ♣️ Empirical Study

         - On the detection of synthetic images generated by diffusion models Paper GitHub

         - Intriguing properties of synthetic images: from generative adversarial networks to diffusion models Paper

         - Towards the detection of diffusion model deepfakes Paper

         - Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis Paper

         - On the use of Stable Diffusion for creating realistic faces: from generation to detection Paper

         - Finding AI-Generated Faces in the Wild Paper

         - Forensic analysis of synthetically generated western blot images Paper

         - Beyond Human Forgeries: An Investigation into Detecting Diffusion-Generated Handwriting Paper

🎞️ Video


Video Detection

Illustration of detection methodology in generalization task for LAIM-generated video.

Pure Detection

  ♣️ Spatial & Temporal based Methods

         - Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features Paper

         - Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method Paper

Beyond Detection

  ♣️ Generalization

         - Revisiting Generalizability in Deepfake Detection: Improving Metrics and Stabilizing Transfer Paper

  ♣️ Empirical Study

         - Beyond Deepfake Images: Detecting AI-Generated Videos Paper

🎵 Audio


Pure Detection

Audio Detection

The artifacts introduced by DM-based neural vocoders (WaveGrad and DiffWave) to a voice signal. The differences in mel-spectrograms between real and generated ones are illustrated in the third and fifth columns.

  ♣️ Vocoder-based

         - AI-Synthesized Voice Detection Using Neural Vocoder Artifacts Paper GitHub

Beyond Detection

  ♣️ Generalization

         - Improving Generalization for AI-Synthesized Voice Detection Paper GitHub

🍯 Multimodal


Pure Detection

Multimodal Detection

Illustrations of pure detection methodologies for LAIM-generated multimodal media.

  ♣️ Prompt-guided

         - Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images Paper

         - On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection Paper GitHub

         - Human Action CLIPS: Detecting AI-generated Human Motion Paper

  ♣️ Text-image Inconsistency

         - Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News Paper GitHub

         - Exposing Text-Image Inconsistency Using Diffusion Models Paper

Beyond Detection

Multimodal Detection

Illustrations of beyond detection methodologies for LAIM-generated multimodal media.

  ♣️ Attribution

         - De-fake: Detection and attribution of fake images generated by text-to-image generation models Paper

         - FIDAVL: Fake Image Detection and Attribution using Vision-Language Model Paper GitHub

  ♣️ Generalization

        ▶️ Prompt Tuning

         - AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors Paper GitHub

        ▶️ Contrastive Learning

         - Generalizable Synthetic Image Detection via Language-guided Contrastive Learning Paper GitHub

  ♣️ Interpretability

         - Combating Misinformation in the Era of Generative AI Models Paper

         - FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant Paper GitHub

         - X^2-DFD: A framework for eXplainable and eXtendable deepfake detection Paper

  ♣️ Localization

        ▶️ Spatial-based

         - Detecting and grounding multi-modal media manipulation Paper

         - Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding Paper

        ▶️ Frequency-based

         - Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation Paper

        ▶️ MLLM-based

         - FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models Paper GitHub

         - ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization Paper

  ♣️ Empirical Study

         - Detecting Images Generated by Diffusers Paper GitHub

         - CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Paper

         - VERITE: a Robust benchmark for multimodal misinformation detection accounting for unimodal bias Paper GitHub

         - Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics Paper

Detection Tools

Modality Tool Company Link Type Open Source Cost
Text AI Content Detector Copyleaks Link Webapp & API Limited free usage
AI Content Detector, ChatGPT detector ZeroGPT Link Webapp & API Free usage
AI Detector GPTZero Link Multi-platform Limited free usage
AI Content Detector Winston AI Link Webapp & API Limited free usage
AI Content Detector Crossplag Link Webapp Limited free usage
Giant Language model Test Room GLTR Link Webapp Free usage
The AI Detector Brandwell Link Webapp Free usage
AI Checker Originality ai Link Webapp & API Limited free usage
Advanced AI Detector and Humanizer Undetectable ai Link Webapp & API Limited free usage
AI Content Detector Writer Link Webapp & API Limited free usage
AI Content Detector Conch Link Webapp Limited free usage
Illuminarty Text Illuminarty Link Webapp & API Limited free usage
AI-Generated Text Detector Is it AI Link Webapp & API Limited free usage
Image Liveness Detection, Facial Recognition Incode Link Multi-platform Paid
AI or Not image AI or Not Link Webapp & API Limited free usage
AI-Generated Image Detector Is it AI Link Webapp & API Limited free usage
Illuminarty Image Illuminarty Link Webapp & API Limited free usage
AI Image Detector Undetectable ai Link Webapp & API Limited free usage
SynthID Google Link Webapp Free usage
The AI image detector Winston Link Webapp & API Limited free usage
Advanced AI Image Detector Brandwell Link Webapp Limited free usage
Video Deepware Scanner Deepware Link Webapp & API Free usage
Attestiv Deepfake Video Detection Attestiv Link Webapp & API Limited free usage
Audio Pulse Inspect Pindrop Link Multi-platform Paid
AI Voice Detector AI Voice Detector Link Webapp & API Limited free usage
AI Speech Classifier ElevenLabs Link Webapp & API Limited free usage
AI or Not audio AI or Not Link Multi-platform Limited free usage
Multi-modal Video, Image, and Audio Detector Deep Media Link Multi-platform Limited free usage
Deepfake Detection Sensity AI Link Multi-platform Paid
Hive AI’s Deepfake Detection API Hive AI Link API Limited free usage
Resemble Detect Resemble AI Link Webapp & API Limited free usage
DuckDuckGoose AI (Phocus) DuckDuckGoose AI Link Webapp Paid
Sentinel Sentinel Link Webapp Paid
Deepfake Detector Deepfake Detector Link Multi-platform Free usage
DeepFake-o-meter U of Buffalo Link Webapp Free usage
BioID BioID Link Webapp & API Limited free usage
Get Real Protect Get Real Link Multi-platform Paid
Reality Defender Reality Defender Link Multi-platform Paid

Generation Works

Works Time Modality Links
T5 Q4 2019 Text Link
GPT-3 Q2 2020 Text Link
Wave-Grad2 Q1 2021 Audio Link
PanGu Q2 2021 Text Link
LDMs Q4 2021 Image Link
GLIDE Q4 2021 Image Link
Imagen Q2 2022 Image Link
PaLM Q2 2022 Text Link
OPT Q2 2022 Text Link
Make-A-Video Q3 2022 Video Link
GLM Q3 2022 Text Link
HuggingGPT Q3 2022 Multimodal Link
Whisper Q3 2022 Audio Link
ChatGPT Q4 2022 Text Link
DALL-E 2 Q4 2022 Image Link
SD Q4 2022 Image Link
mT0 Q4 2022 Text Link
BLOOM Q4 2022 Text Link
Make-An-Audio Q1 2023 Audio Link
GPT-4 Q1 2023 Multimodal Link
Bard Q1 2023 Text Link
LLaMA Q1 2023 Text Link
GEN-1 Q1 2023 Video Link
ImageReward Q2 2023 Image Link
PaLM2 Q2 2023 Text Link
CodeGen2 Q2 2023 Text Link
IF Q2 2023 Image Link
VideoGen Q3 2023 Video Link
DALL-E 3 Q3 2023 Image Link
LLaMA 2 Q3 2023 Text Link
Gemini Q4 2023 Text Link
Emu Edit Q4 2023 Image Link
Emu Video Q4 2023 Video Link
Titan Q4 2023 Image Link
Stable Video Q4 2023 Video Link
MidjourneyV6 Q4 2023 Image Link
Imagen 2 Q4 2023 Image Link
Claude 3.5 Q1 2024 Multimodal Link
aMUSEd Q1 2024 Image Link
Synthesia 2 Q2 2024 Video Link
MultiBooth Q2 2024 Image Link
GPT-4o Q2 2024 Multimodal Link
LLaMA 3 Q2 2024 Multimodal Link
GLM-4 Q2 2024 Multimodal Link
CustomCrafter Q3 2024 Video Link
MegaFusion Q3 2024 Image Link
Qwen 2 Q3 2024 Multimodal Link
Tri-Ergon Q4 2024 Audio Link
Veo 2 Q4 2024 Video Link
Sora Q4 2024 Video Link
AudioX Q1 2025 Audio Link
Grok 3 Q1 2025 Multimodal Link
DeepSeek-V3 Q1 2025 Multimodal Link
Gemini 2.5 Pro Q1 2025 Multimodal Link
LLaMA 4 Q2 2025 Multimodal Link
Qwen 3 Q2 2025 Multimodal Link

Detection Works

Works Time Modality Links
Linguistic Q4 2020 Text Link
XLNet-FT Q1 2021 Text Link
Turing-Bench Q4 2021 Text Link
Unraveling Q2 2022 Text Link
Wavelet Q3 2022 Image Link
Whodunit Q3 2022 Text Link
De-Fake Q4 2022 Multimodal Link
Towards Q4 2022 Image Link
TruFor Q4 2022 Image Link
DIRE Q1 2023 Image Link
GPTZero Q1 2023 Text Link
DetectGPT Q1 2023 Text Link
HAMMER Q2 2023 Multimodal Link
DetectVocoder Q2 2023 Audio Link
SeDID Q3 2023 Image Link
RADAR Q3 2023 Text Link
OUTFOX Q3 2023 Image Link
SeqXGPT Q4 2023 Text Link
RevisitVideo Q4 2023 Video Link
Raising Q4 2023 Image Link
Binoculars Q1 2024 Text Link
AI Face Q2 2024 Image Link
DuB3D Q2 2024 Video Link
GECScore Q2 2024 Text Link
FFAA Q3 2024 Multimodal Link
Breaking Q4 2024 Image Link
B-Free Q4 2024 Image Link
ForgeryGPT Q4 2024 Multimodal Link
FakeShield Q4 2024 Multimodal Link
GenVidBench Q1 2025 Video Link

About

This repository contains a collection of resources and papers on Detecting Multimedia Generated by Large AI Models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •