Detect-LAIM-generated-Multimedia-Survey

This repository contains a collection of resources and papers on Detecting Multimedia Generated by Large AI Models: A Survey

The references of those works are displayed in Generation Works and Detection Works.

Please let us know if you find a mistake, or if we have missed your wonderful work by e-mail: lin1785@purdue.edu, hu968@purdue.edu, gupt1031@purdue.edu

If you find our survey useful for your research, please cite the following Paper

@article{lin2024detecting,
  title={Detecting Multimedia Generated by Large AI Models: A Survey},
  author={Lin, Li and Gupta, Neeraj and Zhang, Yue and Ren, Hainan and Liu, Chun-Hao and Ding, Feng and Wang, Xin and Li, Xin and Verdoliva, Luisa and Hu, Shu},
  journal={arXiv preprint arXiv:2402.00045},
  year={2024}
}

💻 Contents

📈 Related Work

- A Survey on Detection of LLMs-Generated Content Paper GitHub

- A Survey on LLM-generated Text Detection: Necessity, Methods, and Future Directions Paper GitHub

- Towards possibilities & impossibilities of ai-generated text detection: A survey Paper

- Machine-generated text: A comprehensive survey of threat models and detection methods Paper

- The Age of Synthetic Realities: Challenges and Opportunities Paper

- GenAI against humanity: Nefarious applications of generative artificial intelligence and large language models Paper

- A Comprehensive Survey of Fake Text Detection on Misinformation and LM-Generated Texts Paper

- Recent Advances on Generalizable Diffusion-generated Image Detection Paper

- Survey on AI-Generated Media Detection: From Non-MLLM to MLLM Paper

- Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey Paper

Generation

Illustrations of different types of multimedia generation process based on LAIMs.

Public Datasets for Detection

Please read the column I20(Input-to-Output) with these abbreviations:

T2T: Text-to-Text
V2T: Video-to-Text
T2I: Text-to-Image
I2I: Image-to-Image
T2A: Text-to-Audio
I.A2V: (Image conditioned with Audio)-to-Video

Modality	Dataset	Year	BM	Content	Link	I2O	#Real	#Generated	Source of Real Media	Generative Method
Text	TuringBench	2021	✔	News	Link	T2T	8,854	159,758	News Media	GPT-1&2&3, CTRL, GROVER
	Paraphrase	2022		Essays	Link	T2T	98,280	163,710	Arxiv, Wikipedia, Theses	GPT-3, T5
	SynSCiPass	2022		Passages	Link	T2T	99,989	99,989	Scientific papers	GPT-2, BLOOM
	MAGE	2023	✔	General	Link	T2T	154,078	294,381	Reddit, EL15, Yelp, XSum	27 LLMs
	Stu.Essays	2023		Essays	Link	T2T	1,000	6,000	Ivy Panda	ChatGPT
	Writing	2023		Stories	Link	T2T	1,000	6,000	Reddit WritingPrompts	ChatGPT
	News	2023		News	Link	T2T	1,000	6,000	Reuters 50-50	ChatGPT
	OUTFOX	2023	✔	News	Link	T2T	15,400	15,400	Feedback Prize	ChatGPT, GPT-3.5, T5
	MULTITuDE	2023	✔	Essays	Link	T2T	7,992	16,005	MassiveSumm	GPT-3&4, ChatGPT
	MGTDetect-CoCo	2023		News	Link	T2T	10,486	10,484	News Outlets	GPT-3.5
	HPPT	2023		Abstracts	Link	T2T	1,000	1,000	ACL Anthology	ChatGPT
	HC-Var	2023		General	Link	T2T	90,096	90,096	XSum , IMDb, Yelp, FiQA	ChatGPT
	HC3	2023	✔	General	Link	T2T	26,903	58,546	FiQA , EL15 , MediaDialog	ChatGPT
	M4	2023	✔	General	Link	T2T	32,799	58,803	WikiHow , Arxiv, Reddit	ChatGPT, GPT-3.5, LLaMA, T5, BLOOM
	F3	2023		Social Media	Link	T2T	12,723	27,667	Politifact , Snopes	GPT-3.5
	MixSet	2024		General	Link	T2T	3,600	3,600	Email , BBC News, ArXiv	GPT-4, LLaMA2
	GPABench	2024	✔	Writing	Link	T2T	150,000	450,000	Arxiv	GPT-3.5
	M4GT-Bench	2024		General	Link	T2T	119,771	119,388	Wikipedia, WikiHow, Reddit, ArXiv, News	10 LLMs
	RAID	2024	✔	General	Link	T2T	14,917	6,287,820	Public datasets from 8 domains	11 LLMs
	DetectRL	2024		General	Link	T2T	100,800	134,400	Writing Prompts , Yelp	GPT-3.5, PaLM2, Claude, LLaMA2
	MultiSocial	2024		Social Media	-	T2T	58,000	414,000	Gab, Discord, WhatsApp	7 LLMs
	SM-D	2024		Social Media	-	T2T	-	-	Medium, Quora, Reddit	Sourced from social media
Image	DFF	2023		Face	Link	T/I2I	30,000	90,000	IMDB-WIKI	SDMs, InsightFace
	RealFaces	2023		Face	Link	T2I	258	258	Prompts	SDMs
	OHImg	2023		Overhead	Link	T/I2I	6,475	6,675	MapBox , Google Maps	GLIDE, DDPM
	Western Blot	2022		Biology	Link	T/I2I	~14,000	-	Western Blot	DDPM, Pix2pix, CycleGAN
	Synthbuster	2023		General	Link	T2I	-	9,000	Raise-1K	DALL-E 2&3, Midjourney, SDMs, GLIDE
	GenImage	2023	✔	General	Link	T/I2I	1,331,167	1,350,000	ImageNet	SDMs, Midjourney, BigGAN
	CIFAKE	2023		General	Link	T/I2I	60,000	60,000	CIFAR-10	SD-V1.4
	AutoSplice	2023		General	Link	T2I	2,275	3,621	Visual News	DALL-E 2
	DiffusionDB	2023		General	Link	T/I2I	3,300,000	16,000,000	DiscordChatExporter	SD
	Artifact	2023		General	Link	T/I2I	1,749	960,894	COCO, FFHQ , COCO, LSUN	SDMs, DDPM, LDM, CIPS
	HiFi-FIDL	2023	✔	General	Link	T/I2I	~60,000	1,300,000	FFHQ , COCO, LSUN	DDPM, GLIDE, LDM, GANs
	DiffForensics	2023		General	Link	T/I2I	232,000	232,000	LSUN, ImageNet	LDM, DDPM, VQDM, ADM
	CocoGlide	2023		General	Link	T/I2I	512	512	COCO	GLIDE
	LSUNDB	2023		General	Link	T/I2I	250,000	250,000	LSUN	DDPM, LDM, StyleGAN
	UniFake	2023		General	Link	T/I2I	8,000	8,000	LAION-400M	LDM, GLIDE
	REGM	2023		General	Link	T/I2I	116,000	116,000	CelebA , LSUN	116 publicly available GMs
	DMImage	2023		General	Link	T/I2I	200,000	200,000	COCO, LSUN	LDM
	AIGCD	2023	✔	General	Link	T/I2I	360,000	580,000	LSUN, COCO, FFHQ	SDMs, GANs, ADM, DALL-E 2, GLIDE
	DIF	2023	✔	General	Link	T/I2I	34,800	54,500	LAION-585	SDMs, DALL-E 2, GLIDE, GANS
	Fake2M	2024	✔	General	Link	T/I2I	2,300,000	-	CC3M	SD-V1.5, IOI, IF , StyleGAN3
	SID-Set	2024		Social Media	Link	T/I2I	100,000	100,000	COCO, Flickr30K, MagicBrush	FLUX
	Chameleon	2024		Social Media	Link	T/I2I	14,863	11,170	Unsplash	GANs, SDMs, DALL-E 2, GLIDE
	DF40	2024		Face	Link	T/I2I	~1,100	~1,000,000	FF++, CDF, FFHQ, CelebA	SDMs, GANs, Midjourney, DDPM
	FakeBench	2024		General	Link	T/I2I	3,000	3,000	10 Public Datasets	10 Generative Models
	AI-Face	2024	✔	Face	Link	T/I2I	400,885	1,245,660	6 Public datasets	SDMs, GANs, Midjourney, IF
Video	WildDeepfake	2021		Face	Link	I.A2V	3,805	3,509	Social Media	Social Media
	DiffHead	2023		Face	Link	I.A2V	820	-	CREMA	Diffused Heads: build on DDPM
	DVF	2024		General	Link	I/T2V	2,750	3,938	Intervid , Youtube-8M	8 Diffusion Models
	GenVideo	2024	✔	General	Link	I/T2V	1,223,511	1,078,838	Kinetics-400 , Youku-mPLUG, MSR-VTT	20 Generative Models
	GenVidBench	2025	✔	General	Link	I/T2V	33,931	110,400	Vript , HDL-VG-130M	8 Generative Models
	PDID	2024		Social Media	Link	-	-	-	Social Media	Social Media
Audio	In-the-Wild	2022		Speech	Link	T2A	20.7 hours	17.2 hours	Social Media, Video Streaming Platforms	Social Media, Video Streaming Platforms
	LibriSeVoc	2023		Speech	Link	T2A	13,201	79,206	LibriTTS	DiffWave, WaveNet
	SONAR	2024		Speech	Link	T2A	-	2,274	LibriTTS	OpenAI, Seed-TTS, AudioGen
	ASVspoof 2024	2024	✔	Speech	Link	T/A2A	~289,527	~1,211,186	MLS-English	32 Manipulation Methods
Multi-modal	DGM^4	2023		News	Link	T/I2T	77,426	152,574	Visual News	B-GST, StyleCLIP, HFGI
	COCOFake	2023		General	Link	T/I2T	113,287	566,435	COCO	SDMs
	AV-Deepfake1M	2023	✔	Face	Link	T2A	286,721	860,039	Voxceleb2	VITS, YourTTS, TalkLip
	D³	2024		General	Link	T2I	~2,300,000	~9,200,000	LAION-400M	SDMs, IF
	M³A	2024		News	-	T2T/I/T/V/V/A/T T/I/V/V/A2T	708,425	6,566,386	60 News Outlets	LLaMA2, GPT-4, GLIDE, SD, Tango
	LOKI	2024		General	Link	T2T/I/T/V/V/T T/I/V/V/A2T	~9,000	~9,000	21 Public Datasets	43 Generative Models
	MMFakeBench	2024	✔	Social Media	Link	T2T/I	-	~11,000	MS-COCO, VisualNews, Reddit, FEVER	GPT-3.5, SD-XL, DALL-E 3, Midjourney
	Deepfake-Eval	2024		Social Media	Link	T2T/A/V	3,390	2,441	Social Media	Social Media
	ILLUSION	2025	✔	General	Link	T2A/I I2I	139,740	1,232,246	CelebV-Text [158], COCO, MusicCaps , Social Media	28 Generative Methods

🔎 Detection 🔥

📄 Text

Pure Detection

Illustrations of pure detection methodologies for LAIM-generated text.

♣️ Easy Explainable Methods

▶️ Watermarking

- Distillation-Resistant Watermarking for Model Protection in NLP Paper

- Three bricks to consolidate watermarks for large language models Paper GitHub

- Robust multi-bit natural language watermarking through invariant features Paper

- Undetectable Watermarks for Language Models Paper

- Robust distortion-free watermarks for language models Paper

- Provable robust watermarking for ai-generated text Paper GitHub

- A Private Watermark for Large Language Models Paper

▶️ Non-watermarking

- Unraveling the mystery of artifacts in machine generated text Paper

- Stylometric detection of ai-generated text in twitter timelines Paper

- CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning Paper

- Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT Paper GitHub

- Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore Paper GitHub

♣️ Hard Explainable Methods

- HowkGPT: Investigating the Detection of ChatGPT-generated University Student Homework through Context-Aware Perplexity Analysis Paper

- GPTZero Tool

- Detectgpt: Zero-shot machine-generated text detection using probability curvature Paper GitHub

- Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text Paper GitHub

- Multiscale Positive-Unlabeled Detection of AI-Generated Texts Paper GitHub

Beyond Detection

Illustrations of beyond detection methodologies for LAIM-generated text.

♣️ Efficiency

- Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model Paper

- Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature Paper GitHub

- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text Paper GitHub

- SeqXGPT: Sentence-Level AI-Generated Text Detection Paper GitHub

- Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection Paper GitHub

♣️ Attribution

- TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text Generation Paper Turingbench

- Whodunit? Learning to Contrast for Authorship Attribution Paper

- Through the looking glass: Learning to attribute synthetic text generated by language models Paper

- TopRoBERTa: Topology-Aware Authorship Attribution of Deepfake Texts Paper

- Authorship attribution for neural text generation Paper GitHub

- Gpt-who: An information density-based machine-generated text detector Paper

- LLMDet: A Third Party Large Language Models Generated Text Detection Tool Paper GitHub

- Few-Shot Detection of Machine-Generated Text using Style Representations Paper

- Origin Tracing and Detecting of LLMs Paper

♣️ Generalization

- Ghostbuster: Detecting Text Ghostwritten by Large Language Models Paper

- Conda: Contrastive domain adaptation for ai-generated text detection Paper GitHub

- Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features Paper GitHub

- DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning Paper GitHub

- Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts Paper GitHub

♣️ Interpretability

- DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text Paper GitHub

- A Watermark for Large Language Models Paper GitHub

- Chatgpt or human? detect and explain. explaining decisions of machine learning model for detecting short chatgpt-generated text Paper

- Check Me If You Can: Detecting ChatGPT-Generated Academic Writing using CheckGPT Paper

- Is chatgpt involved in texts? measure the polish ratio to detect chatgpt-generated text Paper

♣️ Robustness

▶️ Adversarial Attack Robustness

- Red Teaming Language Model Detectors with Language Models Paper

- Radar: Robust ai-text detection via adversarial learning Paper Project Page

- J-guard: Journalism guided adversarially robust detection of ai-generated news Paper

- Outfox: Llm-generated essay detection through in-context learning with adversarially generated examples Paper

▶️ LAIM-Polished Robustness

- Is chatgpt involved in texts? measure the polish ratio to detect chatgpt-generated text Paper

♣️ Empirical Study

- ChatLog: Recording and Analyzing ChatGPT Across Time Paper GitHub

- On the Zero-Shot Generalization of Machine-Generated Text Detectors Paper

- On the Generalization of Training-based ChatGPT Detection Methods Paper

- Supervised Machine-Generated Text Detectors: Family and Scale Matters Paper GitHub

- Deepfake Text Detection in the Wild Paper GitHub

- How large language models are transforming machine-paraphrased plagiarism Paper

- Paraphrase Detection: Human vs. Machine Content Paper

- MGTBench: Benchmarking Machine-Generated Text Detection Paper GitHub

- How close is chatgpt to human experts? comparison corpus, evaluation, and detection Paper GitHub

- Can LLM-Generated Misinformation Be Detected? Paper GitHub

- From Text to Source: Results in Detecting Large Language Model-Generated Content Paper

📸 Image

Pure Detection

Illustrations of pure detection methodologies for LAIM-generated image.

♣️ Physical/Physiological based Methods

- Qualitative Failures of Image Generation Models and Their Application in Detecting Deepfakes Paper

- Perspective (in) consistency of paint by text Paper

- Lighting (in) consistency of paint by text Paper

♣️ Diffuser Fingerprints based Methods

- Deep Image Fingerprint: Accurate And Low Budget Synthetic Image Detector Paper

- DIRE for Diffusion-Generated Image Detection Paper GitHub

- Exposing the Fake: Effective Diffusion-Generated Images Detection Paper

- LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection Paper GitHub

- Aligned Datasets Improve Detection of Latent Diffusion-Generated Images Paper GitHub

- Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images Paper GitHub

♣️ Spatial-based Methods

- Rich and Poor Texture Contrast: A Simple yet Effective Approach for AI-generated Image Detection Paper Project Page

- Unmasking The Artist: Discriminating Human-Drawn And AI-Generated Human Face Art Through Facial Feature Analysis Paper

- Detecting images generated by deep diffusion models using their local intrinsic dimensionality Paper

♣️ Frequency-based Methods

- Wavelet-packets for deepfake image analysis and detection Paper GitHub

- AUSOME: authenticating social media images using frequency analysis Paper

- AI-Generated Image Detection using a Cross-Attention Enhanced Dual-Stream Network Paper

- Synthbuster: Towards Detection of Diffusion Model Generated Images Paper

- Faster Than Lies: Real-time Deepfake Detection using Binary Neural Networks Paper GitHub

♣️ Distribution-based Methods

- Zero-Shot Detection of AI-Generated Images Paper GitHub

Beyond Detection

Illustrations of beyond detection methodologies for LAIM-generated image.

♣️ Attribution and Model Parsing

▶️ Attribution and Model Parsing

- Level up the deepfake detection: a method to effectively discriminate images generated by gan architectures and diffusion models Paper

- Reverse engineering of generative models: Inferring model hyperparameters from generated images Paper

♣️ Generalization

- Online Detection of AI-Generated Images Paper

- Towards universal fake image detectors that generalize across generative models Paper GitHub

- Raising the Bar of AI-generated Image Detection with CLIP Paper

- Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection Paper

- Fingerprintnet: Synthesized fingerprints for generated image detection Paper

- Detecting Deepfakes Without Seeing Any Paper GitHub

- Improving Synthetically Generated Image Detection in Cross-Concept Settings Paper

- Diffusion Noise Feature: Accurate and Fast Generated Image Detection Paper

- Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities Paper GitHub

- HRR: Hierarchical Retrospection Refinement for Generated Image Detection Paper

- A Sanity Check for AI-generated Image Detection Paper GitHub

- Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection Paper GitHub

- A Bias-Free Training Paradigm for More General AI-generated Image Detection Paper GitHub

- Breaking Semantic Artifacts for Generalized AI-generated Image Detection Paper GitHub

- Dual Data Alignment Makes AI‑Generated Image Detector Easier Generalizable Paper

♣️ Interpretability

- Interpretable-through-prototypes deepfake detection for diffusion models Paper GitHub

- Did You Note My Palette? Unveiling Synthetic Images Through Color Statistics Paper

♣️ Localization

▶️ Fully-supervised

- Hierarchical fine-grained image forgery detection and localization Paper GitHub

- Perceptual Artifacts Localization for Image Synthesis Tasks Paper GitHub

- TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization Paper GitHub

- UnionFormer: Unified-Learning Transformer with Multi-View Representation for Image Manipulation Detection and Localization Paper

▶️ Weakly-supervised

- Weakly-supervised deepfake localization in diffusion-generated images Paper

♣️ Robustness

▶️ Adversarial Attack Robustness

- D4: Detection of Adversarial Diffusion Deepfakes Using Disjoint Ensembles Paper

- Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection Paper

- All Patches Matter, More Patches Better: Enhance AI‑Generated Image Detection via Panoptic Patch Learning Paper

▶️ Post-Processing Robustness

- GLFF: Global and Local Feature Fusion for AI-synthesized Image Detection Paper

- Exposing fake images generated by text-to-image diffusion models Paper

- Local Statistics for Generative Image Detection Paper

♣️ Empirical Study

- On the detection of synthetic images generated by diffusion models Paper GitHub

- Intriguing properties of synthetic images: from generative adversarial networks to diffusion models Paper

- Towards the detection of diffusion model deepfakes Paper

- Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis Paper

- On the use of Stable Diffusion for creating realistic faces: from generation to detection Paper

- Finding AI-Generated Faces in the Wild Paper

- Forensic analysis of synthetically generated western blot images Paper

- Beyond Human Forgeries: An Investigation into Detecting Diffusion-Generated Handwriting Paper

🎞️ Video

Illustration of detection methodology in generalization task for LAIM-generated video.

Pure Detection

♣️ Spatial & Temporal based Methods

- Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features Paper

- Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method Paper

Beyond Detection

♣️ Generalization

- Revisiting Generalizability in Deepfake Detection: Improving Metrics and Stabilizing Transfer Paper

♣️ Empirical Study

- Beyond Deepfake Images: Detecting AI-Generated Videos Paper

🎵 Audio

Pure Detection

The artifacts introduced by DM-based neural vocoders (WaveGrad and DiffWave) to a voice signal. The differences in mel-spectrograms between real and generated ones are illustrated in the third and fifth columns.

♣️ Vocoder-based

- AI-Synthesized Voice Detection Using Neural Vocoder Artifacts Paper GitHub

Beyond Detection

♣️ Generalization

- Improving Generalization for AI-Synthesized Voice Detection Paper GitHub

🍯 Multimodal

Pure Detection

Illustrations of pure detection methodologies for LAIM-generated multimodal media.

♣️ Prompt-guided

- Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images Paper

- On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection Paper GitHub

- Human Action CLIPS: Detecting AI-generated Human Motion Paper

♣️ Text-image Inconsistency

- Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News Paper GitHub

- Exposing Text-Image Inconsistency Using Diffusion Models Paper

Beyond Detection

Illustrations of beyond detection methodologies for LAIM-generated multimodal media.

♣️ Attribution

- De-fake: Detection and attribution of fake images generated by text-to-image generation models Paper

- FIDAVL: Fake Image Detection and Attribution using Vision-Language Model Paper GitHub

♣️ Generalization

▶️ Prompt Tuning

- AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors Paper GitHub

▶️ Contrastive Learning

- Generalizable Synthetic Image Detection via Language-guided Contrastive Learning Paper GitHub

♣️ Interpretability

- Combating Misinformation in the Era of Generative AI Models Paper

- FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant Paper GitHub

- X^2-DFD: A framework for eXplainable and eXtendable deepfake detection Paper

♣️ Localization

▶️ Spatial-based

- Detecting and grounding multi-modal media manipulation Paper

- Exploiting Modality-Specific Features For Multi-Modal Manipulation Detection And Grounding Paper

▶️ Frequency-based

- Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-Modal Manipulation Paper

▶️ MLLM-based

- FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models Paper GitHub

- ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization Paper

♣️ Empirical Study

- Detecting Images Generated by Diffusers Paper GitHub

- CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection Paper

- VERITE: a Robust benchmark for multimodal misinformation detection accounting for unimodal bias Paper GitHub

- Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics Paper

Detection Tools

Modality	Tool	Company	Link	Type	Open Source	Cost
Text	AI Content Detector	Copyleaks	Link	Webapp & API	✗	Limited free usage
	AI Content Detector, ChatGPT detector	ZeroGPT	Link	Webapp & API	✗	Free usage
	AI Detector	GPTZero	Link	Multi-platform	✗	Limited free usage
	AI Content Detector	Winston AI	Link	Webapp & API	✗	Limited free usage
	AI Content Detector	Crossplag	Link	Webapp	✗	Limited free usage
	Giant Language model Test Room	GLTR	Link	Webapp	✗	Free usage
	The AI Detector	Brandwell	Link	Webapp	✗	Free usage
	AI Checker	Originality ai	Link	Webapp & API	✗	Limited free usage
	Advanced AI Detector and Humanizer	Undetectable ai	Link	Webapp & API	✗	Limited free usage
	AI Content Detector	Writer	Link	Webapp & API	✗	Limited free usage
	AI Content Detector	Conch	Link	Webapp	✗	Limited free usage
	Illuminarty Text	Illuminarty	Link	Webapp & API	✗	Limited free usage
	AI-Generated Text Detector	Is it AI	Link	Webapp & API	✗	Limited free usage
Image	Liveness Detection, Facial Recognition	Incode	Link	Multi-platform	✗	Paid
	AI or Not image	AI or Not	Link	Webapp & API	✗	Limited free usage
	AI-Generated Image Detector	Is it AI	Link	Webapp & API	✗	Limited free usage
	Illuminarty Image	Illuminarty	Link	Webapp & API	✗	Limited free usage
	AI Image Detector	Undetectable ai	Link	Webapp & API	✗	Limited free usage
	SynthID	Google	Link	Webapp	✗	Free usage
	The AI image detector	Winston	Link	Webapp & API	✗	Limited free usage
	Advanced AI Image Detector	Brandwell	Link	Webapp	✗	Limited free usage
Video	Deepware Scanner	Deepware	Link	Webapp & API	✓	Free usage
	Attestiv Deepfake Video Detection	Attestiv	Link	Webapp & API	✗	Limited free usage
Audio	Pulse Inspect	Pindrop	Link	Multi-platform	✗	Paid
	AI Voice Detector	AI Voice Detector	Link	Webapp & API	✗	Limited free usage
	AI Speech Classifier	ElevenLabs	Link	Webapp & API	✗	Limited free usage
	AI or Not audio	AI or Not	Link	Multi-platform	✗	Limited free usage
Multi-modal	Video, Image, and Audio Detector	Deep Media	Link	Multi-platform	✗	Limited free usage
	Deepfake Detection	Sensity AI	Link	Multi-platform	✗	Paid
	Hive AI’s Deepfake Detection API	Hive AI	Link	API	✗	Limited free usage
	Resemble Detect	Resemble AI	Link	Webapp & API	✗	Limited free usage
	DuckDuckGoose AI (Phocus)	DuckDuckGoose AI	Link	Webapp	✗	Paid
	Sentinel	Sentinel	Link	Webapp	✗	Paid
	Deepfake Detector	Deepfake Detector	Link	Multi-platform	✗	Free usage
	DeepFake-o-meter	U of Buffalo	Link	Webapp	✗	Free usage
	BioID	BioID	Link	Webapp & API	✗	Limited free usage
	Get Real Protect	Get Real	Link	Multi-platform	✗	Paid
	Reality Defender	Reality Defender	Link	Multi-platform	✗	Paid

Generation Works

Works	Time	Modality	Links
T5	Q4 2019	Text	Link
GPT-3	Q2 2020	Text	Link
Wave-Grad2	Q1 2021	Audio	Link
PanGu	Q2 2021	Text	Link
LDMs	Q4 2021	Image	Link
GLIDE	Q4 2021	Image	Link
Imagen	Q2 2022	Image	Link
PaLM	Q2 2022	Text	Link
OPT	Q2 2022	Text	Link
Make-A-Video	Q3 2022	Video	Link
GLM	Q3 2022	Text	Link
HuggingGPT	Q3 2022	Multimodal	Link
Whisper	Q3 2022	Audio	Link
ChatGPT	Q4 2022	Text	Link
DALL-E 2	Q4 2022	Image	Link
SD	Q4 2022	Image	Link
mT0	Q4 2022	Text	Link
BLOOM	Q4 2022	Text	Link
Make-An-Audio	Q1 2023	Audio	Link
GPT-4	Q1 2023	Multimodal	Link
Bard	Q1 2023	Text	Link
LLaMA	Q1 2023	Text	Link
GEN-1	Q1 2023	Video	Link
ImageReward	Q2 2023	Image	Link
PaLM2	Q2 2023	Text	Link
CodeGen2	Q2 2023	Text	Link
IF	Q2 2023	Image	Link
VideoGen	Q3 2023	Video	Link
DALL-E 3	Q3 2023	Image	Link
LLaMA 2	Q3 2023	Text	Link
Gemini	Q4 2023	Text	Link
Emu Edit	Q4 2023	Image	Link
Emu Video	Q4 2023	Video	Link
Titan	Q4 2023	Image	Link
Stable Video	Q4 2023	Video	Link
MidjourneyV6	Q4 2023	Image	Link
Imagen 2	Q4 2023	Image	Link
Claude 3.5	Q1 2024	Multimodal	Link
aMUSEd	Q1 2024	Image	Link
Synthesia 2	Q2 2024	Video	Link
MultiBooth	Q2 2024	Image	Link
GPT-4o	Q2 2024	Multimodal	Link
LLaMA 3	Q2 2024	Multimodal	Link
GLM-4	Q2 2024	Multimodal	Link
CustomCrafter	Q3 2024	Video	Link
MegaFusion	Q3 2024	Image	Link
Qwen 2	Q3 2024	Multimodal	Link
Tri-Ergon	Q4 2024	Audio	Link
Veo 2	Q4 2024	Video	Link
Sora	Q4 2024	Video	Link
AudioX	Q1 2025	Audio	Link
Grok 3	Q1 2025	Multimodal	Link
DeepSeek-V3	Q1 2025	Multimodal	Link
Gemini 2.5 Pro	Q1 2025	Multimodal	Link
LLaMA 4	Q2 2025	Multimodal	Link
Qwen 3	Q2 2025	Multimodal	Link

Detection Works

Works	Time	Modality	Links
Linguistic	Q4 2020	Text	Link
XLNet-FT	Q1 2021	Text	Link
Turing-Bench	Q4 2021	Text	Link
Unraveling	Q2 2022	Text	Link
Wavelet	Q3 2022	Image	Link
Whodunit	Q3 2022	Text	Link
De-Fake	Q4 2022	Multimodal	Link
Towards	Q4 2022	Image	Link
TruFor	Q4 2022	Image	Link
DIRE	Q1 2023	Image	Link
GPTZero	Q1 2023	Text	Link
DetectGPT	Q1 2023	Text	Link
HAMMER	Q2 2023	Multimodal	Link
DetectVocoder	Q2 2023	Audio	Link
SeDID	Q3 2023	Image	Link
RADAR	Q3 2023	Text	Link
OUTFOX	Q3 2023	Image	Link
SeqXGPT	Q4 2023	Text	Link
RevisitVideo	Q4 2023	Video	Link
Raising	Q4 2023	Image	Link
Binoculars	Q1 2024	Text	Link
AI Face	Q2 2024	Image	Link
DuB3D	Q2 2024	Video	Link
GECScore	Q2 2024	Text	Link
FFAA	Q3 2024	Multimodal	Link
Breaking	Q4 2024	Image	Link
B-Free	Q4 2024	Image	Link
ForgeryGPT	Q4 2024	Multimodal	Link
FakeShield	Q4 2024	Multimodal	Link
GenVidBench	Q1 2025	Video	Link

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
assets		assets
LICENSE		LICENSE
README.md		README.md

License

Purdue-M2/Detect-LAIM-generated-Multimedia-Survey

Folders and files

Latest commit

History

Repository files navigation

Detect-LAIM-generated-Multimedia-Survey

💻 Contents

📈 Related Work

Generation

Public Datasets for Detection

🔎 Detection 🔥

📄 Text

Pure Detection

♣️ Easy Explainable Methods

▶️ Watermarking

▶️ Non-watermarking

♣️ Hard Explainable Methods

Beyond Detection

♣️ Efficiency

♣️ Attribution

♣️ Generalization

♣️ Interpretability

♣️ Robustness

▶️ Adversarial Attack Robustness

▶️ LAIM-Polished Robustness

♣️ Empirical Study

📸 Image

Pure Detection

♣️ Physical/Physiological based Methods

♣️ Diffuser Fingerprints based Methods

♣️ Spatial-based Methods

♣️ Frequency-based Methods

♣️ Distribution-based Methods

Beyond Detection

♣️ Attribution and Model Parsing

▶️ Attribution and Model Parsing

♣️ Generalization

♣️ Interpretability

♣️ Localization

▶️ Fully-supervised

▶️ Weakly-supervised

♣️ Robustness

▶️ Adversarial Attack Robustness

▶️ Post-Processing Robustness

♣️ Empirical Study

🎞️ Video

Pure Detection

♣️ Spatial & Temporal based Methods

Beyond Detection

♣️ Generalization

♣️ Empirical Study

🎵 Audio

Pure Detection

♣️ Vocoder-based

Beyond Detection

♣️ Generalization

🍯 Multimodal

Pure Detection

♣️ Prompt-guided

♣️ Text-image Inconsistency

Beyond Detection

♣️ Attribution

♣️ Generalization

▶️ Prompt Tuning

▶️ Contrastive Learning

♣️ Interpretability

♣️ Localization

▶️ Spatial-based

▶️ Frequency-based

▶️ MLLM-based

♣️ Empirical Study

Detection Tools

Generation Works

Detection Works

About

Resources

License

Uh oh!

Stars

Packages