Replies: 5 comments 13 replies
-
Do you think that object detection datasets with a single label can be useful for DALL-E? It seems that the information content is quite low; but it could still learn the object names and various views and their typical backgrounds. |
Beta Was this translation helpful? Give feedback.
-
YFCC100m could also be added to the list: #110 (comment) as well as the conceptual captions dataset consisting of 3,318,333 image/caption pairs: https://ai.google.com/research/ConceptualCaptions/ |
Beta Was this translation helpful? Give feedback.
-
@afiaka87 good point with the with the copyright issue - I will always also add the license agreements for new datasets i find: 2.5 million images from 205 scene categories under the Create Common License One million labeled images for each of 10 scene categories and 20 object categories. (maybe a little bit too reductionist), I didn't find any information on license agreements despite "If you find LSUN dataset useful in your research, please consider citing"... 2,686,419 of AI-generated faces, e.g. "A white male with long black hair facing left" or "A black woman facing right with short white hair."
Bulk download request under: work.with@generated.photos 8,456,240 (or 6,464,018 cleaned) images of 94,682 celebrities under GNU General Public License v3.0 |
Beta Was this translation helpful? Give feedback.
-
Here is the resized coco dataset as a torrent: https://academictorrents.com/details/eea5a532dd69de7ff93d5d9c579eac55a41cb700 |
Beta Was this translation helpful? Give feedback.
-
https://github.com/rom1504/kaggle-fashion-dalle/releases/tag/1.0.0 contains preprocessed for dalle of https://www.kaggle.com/paramaggarwal/fashion-product-images-dataset |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
We'll need lots of data to train dalle-pytorch to the level OpenAI has with DALLE. If you find any new or interesting datasets that are either captioned or could have captions generated for them using class-name, etc. then please post here and I'll update the list:
We'll download these with aria2c, gdown and wget (for the WIT links). Make sure to:
apt install wget
python3 -m pip install aria2c gdown
COCO 2014 Resized to 256x256
edit: If someone could rehost this for me I'd appreciate it. That drive account isnt exactly production ready.
gdown "https://drive.google.com/file/d/1d7_N0Uxf4xYSSS-VcIVt4lBXFsUIbjvP/view"
Visual Genome
aria2c https://academictorrents.com/download/1bfe6871046860a2ff8c0cc1414318beb35dc916.torrent;
imagenet
aria2c https://academictorrents.com/download/96816a530ee002254d29bf7a61c0c158d3dedc3b.torrent;
STL-10
aria2c https://academictorrents.com/download/a799a2845ac29a66c07cf74e2a2838b6c5698a6a.torrent;
food-101
aria2c https://academictorrents.com/download/470791483f8441764d3b01dbc4d22b3aa58ef46f.torrent;
indoor CVPR
aria2c https://academictorrents.com/download/59aa0ad684e5d849f68bad9a6d43a9000a927164.torrent;
SVHN
aria2c https://academictorrents.com/download/6f4caf3c24803d114c3cae3ab9cb946cd23c7213.torrent;
OpenImagesV6 (only downloads the 256 px versions)
WIT
Here are the links to download the 10 files.
wit_v1.train.all-00000-of-00010.tsv.gz
wit_v1.train.all-00001-of-00010.tsv.gz
wit_v1.train.all-00002-of-00010.tsv.gz
wit_v1.train.all-00003-of-00010.tsv.gz
wit_v1.train.all-00004-of-00010.tsv.gz
wit_v1.train.all-00005-of-00010.tsv.gz
wit_v1.train.all-00006-of-00010.tsv.gz
wit_v1.train.all-00007-of-00010.tsv.gz
wit_v1.train.all-00008-of-00010.tsv.gz
wit_v1.train.all-00009-of-00010.tsv.gz
Beta Was this translation helpful? Give feedback.
All reactions