Skip to content

Randika00/VisionGPT-Extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

𝐕𝐢𝐬𝐢𝐨𝐧𝐆𝐏𝐓 𝐄𝐱𝐭𝐫𝐚𝐜𝐭𝐨𝐫

Built an AI-powered system for structured data extraction, enabling highly optimized and intelligent processing of PDFs and image-based documents.

VisionGPT Extractor is a Generative AI-powered OCR tool designed to extract structured data from image-based PDFs and images with high accuracy. Leveraging the GPT-5-mini model via Azure API, this tool not only performs OCR but also intelligently understands table layouts and text context, reducing the need for manual data entry.

Technologies: Python, GPT-5-mini, Azure API, OCR, JSON, AI data validation, automation

Features

  • Extracts text and tables from image-based PDFs and images.
  • Utilizes GPT-5-mini for high-quality AI-based data extraction.
  • Performs multiple validation cycles to ensure data reliability:
    • Runs three separate extraction cycles for each file.
    • Compares results from all cycles.
    • Includes only data rows that appear in at least two out of three cycles in the final output.
  • Generates a clean JSON output ready for downstream processing.
  • Supports large-scale document processing with minimal manual effort.

About

An AI-powered tool designed to extract structured data from documents, optimized for accurate and intelligent extraction from PDFs and images.”

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages