Literature Review: Recent Advances in Computer Vision and Language AI
DOI:
https://doi.org/10.47363/4jymvb15Keywords:
Computer Vision, Natural Language Processing, Deep Learning, Multimodal Learning, Visual Question Answering, Scene Understanding, Context Modelling, Generalization, Reasoning, Human-AI InteractionAbstract
This comprehensive literature review examines the latest breakthroughs in computer vision and natural language processing (NLP), two rapidly evolving fields with applications across search, human-computer interaction, robotics, and more. It synthesizes key findings, trends, limitations, and open challenges from cutting-edge research at their intersection. The dramatic progress driven by deep neural networks is analysed in depth, along with issues like generalization, context handling, reasoning, uncertainty, and human-centric evaluation. Although remarkable advances have been made, especially in computer vision, core problems remain to be addressed. This review provides a thorough overview of the state-of-the-art, reflecting the most recent innovations, and promising future directions in this dynamic research domain.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Journal of Artificial Intelligence & Cloud Computing

This work is licensed under a Creative Commons Attribution 4.0 International License.