This innovative project leverages cutting-edge Natural Language Processing (NLP) techniques to automate the extraction and organization of data from medical PDF reports.
The impact of this project spans across various domains.
The system currently achieves 85% accuracy — an impressive benchmark for a zero-shot classification model that doesn’t require task-specific training.
Future plans include integrating Optical Character Recognition (OCR) for image-based text extraction and fine-tuning the model with domain-specific datasets to improve accuracy.