Skip to content

deepakshroff/Image-captioning-app

Repository files navigation

🖼️ AI Image Captioning App

The AI Image Captioning App is a Flask-based web application that uses a deep learning model to generate intelligent captions for uploaded images. It analyzes the image and provides the best possible description along with a confidence score.


🔍 Screenshots


🚀 Features

  • 📷 Upload any JPG/PNG image
  • 🧠 Automatically generates a descriptive caption
  • 📊 Displays prediction confidence percentage
  • 🎨 Smooth, responsive, and aesthetic UI
  • 💡 Built using Flask, Python, and a custom ML model

🧪 Tech Stack

💻 Frontend

  • HTML5
  • CSS3
  • Jinja2 (Flask template rendering)

🧠 Backend

  • Python
  • Flask
  • Pre-trained image captioning model
  • Custom module: caption_model.py

💡 How to Run Locally

🛠 Prerequisites

  • Python 3.x
  • Pip
  • Virtualenv (optional but recommended)

⚙️ Steps

  1. Clone the repository
    git clone https://github.com/your-username/AI-Image-Captioning.git
    cd AI-Image-Captioning
    

🔍 How It Works

  • User uploads an image through the UI
  • The image is saved to the /static/uploads/ directory
  • The model (in caption_model.py) processes the image
  • The generated caption and confidence score are returned and displayed

🔮 Future Enhancements

  • 🖼️ Support for drag-and-drop image upload
  • 🧠 Use more powerful models like BLIP or Vision Transformers
  • 🌐 Add language translation for captions
  • ☁️ Deploy on Render or HuggingFace Spaces

About

👨‍🏫This project was built under the mentorship of Mr. Lokesh Sir as part of the AI & ML Capstone Project. It combines computer vision and deep learning for practical image understanding.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors