Advanced machine learning techniques, especially deep learning methods have greatly enhanced the capabilities of single and multi-modal information processing. Techniques like transfer learning and fine-tuning have allowed CNNs to be pre-trained on large datasets and then fine-tuned for specific tasks, improving performance and reducing the need for extensive labeled data. Attention mechanisms, such as those used in Transformer models, have further enhanced the performance of sequence models by allowing them to focus on relevant parts of the input sequence. RNNs can process sequences by maintaining a hidden state that captures information from previous steps. LSTMs improve upon RNNs by addressing the vanishing gradient problem, enabling them to learn long-term dependencies. Multi-modal learning transfers knowledge between different modalities, such as text, speech, image, and video.
There are broader challenges that are faced by multi-modal machine learning, such as in representation, translation, alignment, fusion, and co-learning. It is a multi-disciplinary field of increasing importance and with extraordinary potential. Therefore, it is important to explore both single and multi-modal learning with advanced machine learning techniques.
This Research Topic will focus on advanced machine learning methods for single or multi-modal information processing. Topics include, but are not limited to, the following:
- Improvements to conventional machine learning
- Ensemble Learning;
- Reinforcement learning;
- Deep neural networks;
- Graph neural networks;
- Multi-modal fusion;
- Cross-modal learning;
- Multi-modal Transformers;
- Multi-modal representation learning;
- Self-Supervised and Unsupervised Learning;
- Efficient Models and Compression;
- Interpretability and Robustness.
Keywords: Machine Learning, Deep Learning, Single modal, Multi-modal Learning, Image/Video/Text/Speech Processing
Important note: All contributions to this Research Topic must be within the scope of the section and journal to which they are submitted, as defined in their mission statements. Frontiers reserves the right to guide an out-of-scope manuscript to a more suitable section or journal at any stage of peer review.