LoRA Adaptation for Stable Diffusion Containerized with Docker

This README serves as a comprehensive guide for understanding, implementing, and evaluating the LoRA-adapted Stable Diffusion model with Docker containerization. For detailed instructions and insights, refer to the respective sections in the pdf documentation: Documentation_Stable_Diffusion_LoRA_Project.pdf.

Check out my Medium article here: https://medium.com/@weimaychen2/revolutionizing-digital-creativity-harnessing-lora-adapted-stable-diffusion-within-docker-for-3f0419c6612e

Overview

This project focuses on adapting the Stable Diffusion model through LoRA integration using Docker. The adaptation aims to enhance the generation of images based on textual prompts. This README provides an extensive guide on utilizing and understanding the adapted model.

Quick Demo

landscape.ipynb and single_object.ipynb explore Single LoRa and multiple LoRA integration with Stable Diffusion model

Files and Folders

Files

config.yaml: Configuration file for base_model and LoRAs. Dockerfile: Used to build and run the Docker image. main.py: Main pipeline for generating images. landscape.ipynb: Jupyter notebook for demo with the prompt "A beautiful sky". single_object.ipynb: Jupyter notebook for demo with the prompt "A green pokemon with blue eyes". Evaluation_img_generation.ipynb: Jupyter notebook for evaluating generated images.

Folders

base_model: Contains the Stable diffusion + LoRA integration model class. generated_images: Stores generated images, organized by text prompts. LoRAs: Contains LoRA model weights in .safetensors format.

Specific Task(s) within Game Creation

The adapted model can be applied in game creation for:

Facilitating Asset Creation: Converting text into 2D images, which can further be transformed into 3D assets.
Concept Ideation: Generating visual themes for story concepts and ideas.
Stable Diffusion Model and LoRA Integration Process

General Overview

The main process of generating images involves:

Loading pretrained Stable Diffusion model.
Loading LoRA model weights into the Stable Diffusion model.
Generating images based on input text prompts.

Extended Detailed Process

The process includes loading LoRA weights, parameter adjustments, and image generation. Optional steps for adjusting LoRA impact are also explained.

LoRA Integration Results

Integration results for single and multiple LoRAs are discussed. It's noted that including LoRA tags in text prompts generally improves results.

Examples:

"A green pokemon with blue eyes", pixel LoRA:

"A beautiful sky", easter and jellyfish forest LoRA

Parameter Adjustments: StableDifusionPipeline

Key parameters include: text prompt, num_inference_steps, and guidance_scale.

Optimal balance requires experimentation.

Running the Code

Download LoRAs in safetensors format and save them to LoRAs folder in the current directory
Instructions for setting up the environment and running the code via examples are provided. Refer to config.yaml for the

List of base_models (currently only 1): runwayml/stable-diffusion-v1-5

List of LoRA model names: easter_egg, basepixel, jellyfish_forest, wanostyle, moxinstyle

The following example defines all possible available arguments (does not use any default arguments)

python3 main.py --text-prompt "An underwater adventure" --lora-name "jellyfish_forest" --fuse-lora-scale 0.9 --height 1024 --width 640 --num-inference-steps 17 --guidance-scale 8.4 --seed-num 13 --num-imgs 3

Docker containerization steps are also outlined.

docker build -t stable-diffusion-lora .

docker run --gpus all -p 8080:8080 stable-diffusion-lora

docker save stable-diffusion-lora:latest > stable-diffusion-lora.tar

docker load < stable-diffusion-lora.tar

docker run --gpus all -v $(pwd)/generated_images:/app/generated_images stable-diffusion-lora:latest --text-prompt "An underwater adventure" --lora-name "jellyfish_forest" --fuse-lora-scale 0.9 --height 1024 --width 640 --num-inference-steps 17 --guidance-scale 8.4 --seed-num 13 --num-imgs 3

Evaluation Strategy for the Adapted Model

Comprehensive evaluation strategies, including qualitative (creativity/novelty, visual quality) and quantitative (Inception Score) approaches, are detailed in the documentation and evaluate_img_generation.ipynb notebook.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA Adaptation for Stable Diffusion Containerized with Docker

Overview

Quick Demo

Files and Folders

Files

Folders

Specific Task(s) within Game Creation

General Overview

Extended Detailed Process

LoRA Integration Results

Examples:

Parameter Adjustments: StableDifusionPipeline

Running the Code

Evaluation Strategy for the Adapted Model

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
base_model		base_model
demo_images		demo_images
generated_images		generated_images
Dockerfile		Dockerfile
Documentation_Stable_Diffusion_LoRA_Project.pdf		Documentation_Stable_Diffusion_LoRA_Project.pdf
README.md		README.md
config.yaml		config.yaml
evaluate_img_generation.ipynb		evaluate_img_generation.ipynb
landscape.ipynb		landscape.ipynb
main.py		main.py
requirements.txt		requirements.txt
single_object.ipynb		single_object.ipynb

Folders and files

Latest commit

History

Repository files navigation

LoRA Adaptation for Stable Diffusion Containerized with Docker

Overview

Quick Demo

Files and Folders

Files

Folders

Specific Task(s) within Game Creation

General Overview

Extended Detailed Process

LoRA Integration Results

Examples:

Parameter Adjustments: StableDifusionPipeline

Running the Code

Evaluation Strategy for the Adapted Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages