Literature Review: Recent Advances in Computer Vision and Language AI

Suresh Babu  Rajasekaran

doi:10.47363/4jymvb15

Authors

Suresh Babu Rajasekaran USA Author

DOI:

https://doi.org/10.47363/4jymvb15

Keywords:

Computer Vision, Natural Language Processing, Deep Learning, Multimodal Learning, Visual Question Answering, Scene Understanding, Context Modelling, Generalization, Reasoning, Human-AI Interaction

Abstract

This comprehensive literature review examines the latest breakthroughs in computer vision and natural language processing (NLP), two rapidly evolving fields with applications across search, human-computer interaction, robotics, and more. It synthesizes key findings, trends, limitations, and open challenges from cutting-edge research at their intersection. The dramatic progress driven by deep neural networks is analysed in depth, along with issues like generalization, context handling, reasoning, uncertainty, and human-centric evaluation. Although remarkable advances have been made, especially in computer vision, core problems remain to be addressed. This review provides a thorough overview of the state-of-the-art, reflecting the most recent innovations, and promising future directions in this dynamic research domain.

Author Biography

Suresh Babu Rajasekaran, USA

Suresh Babu Rajasekaran, NVIDIA. USA

Literature Review: Recent Advances in Computer Vision and Language AI

Authors

DOI:

Keywords:

Abstract

Author Biography

Downloads

Published

Issue

Section

License

How to Cite

Similar Articles

issn

Make a Submission

Information

Latest publications