Agentic VQA System - Anas Mhana

Framework: Built with PyTorch, ensuring efficient tensor operations and GPU acceleration where available.
Model Library: Integrated with the Hugging Face transformers library to utilize the BlipForQuestionAnswering architecture.
Pre-trained Model: Defaults to Salesforce/blip-vqa-base, which is optimized for visual question answering tasks.
Processing: Uses BlipProcessor for multimodal input handling (image + text).