First things first
This repo is a convenient clone of AttnGAN by MSFT Research. You can find the original repo at here.
AttnGAN original repo contains the training and testing code. But this repo is solely intended to use for generating images from text using MSCOCO pretrained weights.
Pytorch implementation for reproducing AttnGAN results in the paper AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks by Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He. (This work was performed when Tao was an intern with Microsoft Research).
Python version
python 2.7
Dependencies
pytorchpython-dateutileasydictpandastorchfilenltkscikit-image
YML files
*.ymlfiles are example configuration files for training/evaluation our models.
Pretrained Model
- DAMSM for coco. Download and save it to
DAMSMencoders/coco/ - AttnGAN for coco. Download and save it to
models/
Setup script
Run the following script for setting up the dependencies.
./setup.sh
Sampling
- Run the command from project folder.
python2 gen_art.py --gpu 0 --input_text "Mary had a little lamb" --data_dir data/coco --model_path models/coco_AttnGAN2.pth --textencoder_path DAMSMencoders/coco/text_encoder100.pth --output_dir output`
- Output will be stored into
output_dirfolder. - Change the
eval_*.ymlfiles to generate images from other pre-trained models. Default config file iscfg/eval_coco.yml.
