Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.idea		.idea
__pycache__		__pycache__
model_weights		model_weights
results		results
static		static
templates		templates
.DS_Store		.DS_Store
.gitattributes		.gitattributes
Procfile		Procfile
README.md		README.md
app.py		app.py
embedding_matrix.pkl		embedding_matrix.pkl
generate_captions.ipynb		generate_captions.ipynb
generate_captions.py		generate_captions.py
generate_captions2.py		generate_captions2.py
generate_captions3.py		generate_captions3.py
index_to_word.pkl		index_to_word.pkl
model_30.h5		model_30.h5
model_37.h5		model_37.h5
model_39.h5		model_39.h5
preparing_data.ipynb		preparing_data.ipynb
requirements.txt		requirements.txt
train_descriptions.pkl		train_descriptions.pkl
training_model.ipynb		training_model.ipynb
word_to_index.pkl		word_to_index.pkl

Repository files navigation

Image Captioner

Getting Started

It will generate captions according to the given images. For example:

File descriptions

app.py Main code to run to create the server
generate_captions.py Python module that compiles the AI model and makes predictions.
embedding_matrix.pkl Matrix for the word embeddings of the vocabulary.
train_descriptions.pkl Dictionary to map image names to the captions for training data.
word_to_index.pkl Dictionary to map words in the vocabulary to their index numbers.
index_to_word.pkl Dictionary to map index number to their words in the vocabulary.
results Contains samples of results on testing.
static Stores images input by the user while generating captions.
templates Contains the <index.html> to generate the UI.
preparing_data.ipynb Jupyter Notebook to prepare the data for training.
training_model.ipynb Jupyter Notebook to train the model.
generate_captions.ipynbJupyter Notebook to import all the essentials and generate the captions.

External Data

model_weights Folder that contains all the models generated in 40 epochs during the training.(Link: https://drive.google.com/open?id=1EzkEjTSQAAlKejyJAwRfKbKqBtDNHwv7
glove.6B.50d.txt Text file to contain mapping of words to their corresponding 50-dimensional vector. (Link: https://drive.google.com/open?id=1mqHRTOyF87fHoiuRZwOlgcYwcCynQ5Ki
encoding_train_features.pkl Dictionary to map training images to their corresponding 2048 dimensional vector. (Link: https://drive.google.com/open?id=1qO4fgm8qUu0eIslMpg6oqqmcZil5qs9k
flickr30k_images Training images and their captions. (Link: https://www.kaggle.com/hsankesara/flickr-image-dataset)

Installation

Clone the repository. git clone https://www.github.com/parask11/image-captioner
Go in the directory. cd image-captioner
Install requirements. pip install -r requirements.txt

Running

Run the python script. python app.py

It will start a server.

Open the link from the browser. localhost:5000

The UI will appear. Upload images and generate the captions!

About

Generates suitable captions for the images of people and animals input by the user.

3f49124e.ngrok.io/

Report repository

Releases

No releases published

Packages

Contributors

Languages

Jupyter Notebook 99.4%
Other 0.6%