Testing TIFA for my master thesis

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

[Yushi Hu], [Benlin Liu], [Jungo Kasai], [Yizhong Wang], [Mari Ostendorf], [Ranjay Krishna], [Noah A. Smith]

What is TIFA?

Images synthesized by text-to-image models (e.g. Stable Diffusion) often do not follow the text inputs well. TIFA is a simple tool to evaluate the fine-grained alignment between the text and the image. This repository contains the code and models for our paper TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering. This paper is also accepted to ICCV 2023. Please refer to the project page for a quick overview.

Quick start

conda create --name tifa python=3.8
conda activate tifa
pip install -r requirements.txt

Image evaluation

The repository expects the following structure for the folder containing the images to be evaluated:

evaluation/
│── prompt_collection1/
│   │── prompt_collection1-model_name1/
│   │   ├── 000_A bus/
│   │   ├── 001_A bus and a bench/
│   │   ├── ...
│   │── prompt_collection1-model_name2/
│   │   ├── 000_A bus/
│   │   ├── 001_A bus and a bench/
│   │   ├── ...
│   │── prompt_collection1.csv

where:

evaluation: a folder in the root of the project
prompt_collection1: a folder containing everything is needed for the evaluation of that specific prompt collection
prompt_collection1-model_name: a folder containing the generated images, divided by prompt (e.g. 000_A bus, 001_A bus and a bench etc.) by a specific model. More than one folder (and thus more than one model) can be present, all of them will be evaluated.
prompt_collection1.csv: a file containing the data about prompts and bounding boxes used to generate the images

Configuration

In the root of the project there is a file named config.py containing the configuration. In particular, the prompt_collection field should be the same used for the .csv file and the folder in evaluation/ (e.g. prompt_collection1). Also, the tifa_version field defines which version of TIFA should be used:

REGULAR: only the TIFA score (text-alignment) is returned
EXTENDED: both the TIFA score (text-alignment) and AuC (layout-score) is returned

Citation

@article{hu2023tifa,
  title={TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering},
  author={Hu, Yushi and Liu, Benlin and Kasai, Jungo and Wang, Yizhong and Ostendorf, Mari and Krishna, Ranjay and Smith, Noah A},
  journal={arXiv preprint arXiv:2303.11897},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
tifascore		tifascore
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
font.ttf		font.ttf
logger.py		logger.py
requirements.txt		requirements.txt
setup.py		setup.py
tifa_test.py		tifa_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Testing TIFA for my master thesis

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

What is TIFA?

Quick start

Image evaluation

Configuration

Citation

About

Uh oh!

Releases

Packages

Languages

License

Nikura3/tifa

Folders and files

Latest commit

History

Repository files navigation

Testing TIFA for my master thesis

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

What is TIFA?

Quick start

Image evaluation

Configuration

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages