Frontiers | Identifying, Analyzing, and Overcoming Challenges in Vision and Language Research

About this Research Topic

Submission closed

Background

Integration of computer vision and natural language processing is one of the holy grails in machine learning and artificial intelligence research with important and wide-reaching applications in both real-world and digital lives. Progress in vision and language (V&L) research has been swift; those have been further supported by the development of algorithms capable of producing increasingly better results in various V&L tasks, such as visual question answering, image captioning, image retrieval, visual dialogue, and several other visual reasoning tasks. However, despite stimulating progress, recent studies have shown that the majority of current datasets contain a non-negligible amount of spurious correlations and/or dataset biases. Moreover, several existing models mirror or amplify those biases and current evaluation metrics may be insufficient to identify these issues. In this context, it is imperative to strive towards proper design, evaluation, and analysis of the data and models used for V&L research.

Possible solutions may lay towards a three-pronged approach, which involves improvements in datasets, algorithms, and evaluation metrics used for V&L research. Recently, advances have been made in each of those tasks including the creation of synthetic datasets (NLVR, CLEVR), new variations on existing datasets (VQA-CP, TDIUC, nocaps), and altogether new tasks (GQA, FOIL, Social-IQ). In parallel, a plethora of new algorithms, evaluation metrics, and critical analyses regarding dataset bias, spurious correlations, interpretability, out-of-distribution performance, and related issues have been implemented.

By bringing together researchers from machine learning, computer vision, natural language processing areas, and experts from a variety of application domains, this Research Topic aims at representing the state-of-the-art in V&L research and at fostering new foundational research towards robust, fair, and interpretable AI for V&L.
Therefore, we seek a broad range of original contributions by researchers and practitioners from different disciplines within the V&L domain. We welcome submissions regarding novel algorithms, datasets, analysis, and other innovations that make advancements in highlighting and addressing challenges in vision and language research, particularly along the lines of demonstrating improved algorithmic fairness, interpretability, and robustness to bias, spurious correlations, long-tailed and out-of-distribution data.

The submissions may include, but not limited to, the following topics:
- Novel algorithms and techniques that help improve the state-of-the-art in existing V&L tasks;
- Novel V&L algorithms that are less prone to dataset bias and spurious correlations, enforce demographic fairness and/or are more interpretable and explainable;
- Novel datasets, sub-tasks, and challenges that help test for new capabilities and/or highlight shortcomings with existing datasets and algorithms in V&L;
- Controlled test sets aimed to evaluate specific abilities involved in the language grounded visual understanding;
- Probing tasks aiming to evaluate the quality of the multimodal V&L representations;
- Novel evaluation metrics that help accurate and fair evaluation of V&L algorithms with respect to dataset bias, label imbalance, lack of compositionality, and other related issues with V&L tasks;
- Previously unknown analysis, key-observations, discussion, and insights about bias and related issues in existing V&L datasets and algorithms;
- Negative or critical results regarding practices currently used in mainstream V&L research;
- Successes or challenges of integrating vision and language in a novel application domain.

We also welcome well-formulated survey articles, opinion pieces, position papers, or commentaries regarding the current state and future prospects of V&L research as long as it fits within the theme of the Research Topic.

Keywords: vision and language, bias and fairness, explainable visual grounding, probing tasks for vision and language, evaluation of vision and language models

Important note: All contributions to this Research Topic must be within the scope of the section and journal to which they are submitted, as defined in their mission statements. Frontiers reserves the right to guide an out-of-scope manuscript to a more suitable section or journal at any stage of peer review.

Topic editors

Frequently asked questions

Frontiers' Research Topics are collaborative hubs built around an emerging theme.Defined, managed, and led by renowned researchers, they bring communities together around a shared area of interest to stimulate collaboration and innovation.
Unlike section journals, which serve established specialty communities, Research Topics are pioneer hubs, responding to the evolving scientific landscape and catering to new communities.
The goal of Frontiers' publishing program is to empower research communities to actively steer the course of scientific publishing. Our program was implemented as a three-part unit with fixed field journals, flexible specialty sections, and dynamically emerging Research Topics, connecting communities of different sizes and maturity.
Research Topics originate from the scientific community. Many of our Research Topics are suggested by existing editorial board members who have identified critical challenges or areas of interest in their field.
As an editor, Research Topics will help you build your journal, as well as your community, around emerging, cutting-edge research. As research trailblazers, Research Topics attract high-quality submissions from leading experts all over the world.
A thriving Research Topic can potentially evolve into a new specialty section if there is sustained interest and a growing community around it.
Each Research Topic must be approved by the specialty chief editor, and it falls under the editorial oversight of our editorial boards, supported by our in-house research integrity team. The same standards and rigorous peer review processes apply to articles published as part of a Research Topic as for any other article we publish.
In 2023, 80% of the Research Topics we published were edited or co-edited by our editorial board members, who are already familiar with their journal's scope, ethos, and publishing model. All other topics are guest edited by leaders in their field, each vetted and formally approved by the specialty chief editor.
Publishing your article within a Research Topic with other related articles increases its discoverability and visibility, which can lead to more views, downloads, and citations. Research Topics grow dynamically as more published articles are added, causing frequent revisiting, and further visibility.
As Research Topics are multidisciplinary, they are cross-listed in several fields and section journals – increasing your reach even more and giving you the chance to expand your network and collaborate with researchers in different fields, all focusing on expanding knowledge around the same important topic.
Our larger Research Topics are also converted into ebooks and receive social media promotion from our digital marketing team.
 
Frontiers offers multiple article types, but it will depend on the field and section journals in which the Research Topic will be featured. The available article types for a Research Topic will appear in the drop-down menu during the submission process.
Check available article types here 
Yes, we would love to hear your ideas for a topic. Most of our Research Topics are community-led and suggested by researchers in the field. Our in-house editorial team will contact you to talk about your idea and whether you’d like to edit the topic. If you’re an early-stage researcher, we will offer you the opportunity to coordinate your topic, with the support of a senior researcher as the topic editor. 

Suggest your topic here 
A team of guest editors (called topic editors) lead their Research Topic. This editorial team oversees the entire process, from the initial topic proposal to calls for participation, the peer review, and final publications.
The team may also include topic coordinators, who help the topic editors send calls for participation, liaise with topic editors on abstracts, and support contributing authors. In some cases, they can also be assigned as reviewers.
As a topic editor (TE), you will take the lead on all editorial decisions for the Research Topic, starting with defining its scope. This allows you to curate research around a topic that interests you, bring together different perspectives from leading researchers across different fields and shape the future of your field. 
 
You will choose your team of co-editors, curate a list of potential authors, send calls for participation and oversee the peer review process, accepting or recommending rejection for each manuscript submitted.
As a topic editor, you're supported at every stage by our in-house team. You will be assigned a single point of contact to help you on both editorial and technical matters. Your topic is managed through our user-friendly online platform, and the peer review process is supported by our industry-first AI review assistant (AIRA).
If you’re an early-stage researcher, we will offer you the opportunity to coordinate your topic, with the support of a senior researcher as the topic editor. This provides you with valuable editorial experience, improving your ability to critically evaluate research articles and enhancing your understanding of the quality standards and requirements for scientific publishing, as well as the opportunity to discover new research in your field, and expand your professional network.
Yes, certificates can be issued on request. We are happy to provide a certificate for your contribution to editing a successful Research Topic.
Research Topics thrive on collaboration and their multi-disciplinary approach around emerging, cutting-edge themes, attract leading researchers from all over the world.
As a topic editor, you can set the timeline for your Research Topic, and we will work with you at your pace. Typically, Research Topics are online and open for submissions within a few weeks and remain open for participation for 6 – 12 months. Individual articles within a Research Topic are published as soon as they are ready.
Find out more about our Research Topics
Our fee support program ensures that all articles that pass peer review, including those published in Research Topics, can benefit from open access – regardless of the author's field or funding situation.
Authors and institutions with insufficient funding can apply for a discount on their publishing fees. A fee support application form is available on our website.
In line with our mission to promote healthy lives on a healthy planet, we do not provide printed materials. All our articles and ebooks are available under a CC-BY license, so you can share and print copies.

Share on

Frontiers in Big Data

Machine Learning and Artificial Intelligence

Participating Journals

Frontiers in Artificial Intelligence
- Language and Computation
- Machine Learning and Artificial Intelligence
- 4.7 impact factor
- 7.3 citescore
Frontiers in Robotics and AI
- Robot Vision and Artificial Perception
- 3.0 impact factor
- 7.3 citescore

Impact

19kTopic views
13kArticle views
3,014Article downloads

View impact

Identifying, Analyzing, and Overcoming Challenges in Vision and Language Research

About this Research Topic

Background

Topic editors

kushal kafle

raffaella bernardi

moin nabi

jin-hwa kim

sandro pezzelle

julia hockenmaier

Frequently asked questions

Frontiers in Big Data

Machine Learning and Artificial Intelligence

Participating Journals

Frontiers in Artificial Intelligence

Language and Computation

Machine Learning and Artificial Intelligence

Frontiers in Robotics and AI

Robot Vision and Artificial Perception