[1]
“Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion”, JITI, vol. 2, no. 3, pp. 19–46, Oct. 2024, Accessed: Jul. 16, 2025. [Online]. Available: https://www.itip-submit.com/index.php/JITI/article/view/65