MyUEVision: an application generating image caption for assisting visually impaired people
Journal of Enabling Technologies
ISSN: 2398-6263
Article publication date: 3 September 2024
Issue publication date: 8 November 2024
Abstract
Purpose
Visually impaired people usually struggle with doing daily tasks due to a lack of visual cues. For image captioning assistive applications, most applications require an Internet connection for the image captioning generation function to work properly. In this study, we developed MyUEVision, an application that assists visually impaired people by generating image captions that can work with and without the Internet. This work also involves reviewing some image captioning models for this application.
Design/methodology/approach
The author has selected and experimented with three image captioning models for online models and two image captioning models for offline models. The user experience (UX) design was designed based on the problems faced by visually impaired users when using mobile applications. The application is developed for the Android platform, and the offline model is integrated into the application for the image captioning generation function to work offline.
Findings
After conducting experiments for selecting online and offline models, ExpansionNet V2 is chosen for the online model and VGG16 + long short-term memory (LSTM) is chosen for the offline model. The application is then developed and assessed, and the results show that the application can generate image captions with or without the Internet, providing the best result when having an Internet connection, and the image is captured in good lighting with a few objects.
Originality/value
MyUEVision stands out for its both online and offline functionality. This approach ensures the image captioning generator works with or without the Internet, setting it apart as a unique solution to address the needs of visually impaired individuals.
Keywords
Citation
Nguyen, H., Huynh, T., Tran, N. and Nguyen, T. (2024), "MyUEVision: an application generating image caption for assisting visually impaired people", Journal of Enabling Technologies, Vol. 18 No. 4, pp. 248-264. https://doi.org/10.1108/JET-03-2024-0024
Publisher
:Emerald Publishing Limited
Copyright © 2024, Emerald Publishing Limited