Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 1.09 KB

File metadata and controls

15 lines (11 loc) · 1.09 KB

Abstract

InkWave is all about developing a machine learning model capable of converting handwritten notes into digitized ones. The digitized text would be used to be formatted into various files, including pdf and markdown. Ultimately we would be building a UI for the ML model so that we can see it in works and use it when required easily. We would be using Computer Vision, Natural Language Processing and Large Language Model in order to develop it.

InkWaveBasicWorkingPlan drawio

Dataset

We will be using these kaggle datasets to train our models: https://www.kaggle.com/datasets/landlord/handwriting-recognition/data (Learning CV Model Dataset) https://www.kaggle.com/datasets/vipin20/nlp-word-correction (Learning NLP/LLM Model Dataset) https://writeoff.cs.byu.edu/censustree/dataset.tar.gz (CV dataset) https://www.kaggle.com/datasets/jpmiller/layoutlm/data(Tentative NLP/LLM Model dataset)

Tech Doc Link

https://docs.google.com/document/d/1btfWL-6h_YLID7R_LKtT4L6QeBYJlxQ6F1TWeHM_u-8/edit?usp=sharing