Skip to content

Latest commit

 

History

History
54 lines (47 loc) · 5.37 KB

SLP-P14.md

File metadata and controls

54 lines (47 loc) · 5.37 KB

ICASSP-2024-Papers

Application App
Previous Collections Conference

Multimodal Processing of Language

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
Cooking-Clip: Context-Aware Language-Image Pretraining for Zero-Shot Recipe Generation IEEE Xplore
Exploring Object-Centered External Knowledge for Fine-Grained Video Paragraph Captioning IEEE Xplore
Relational Graph-Bridged Image-Text Interaction: A Novel Method for Multi-Modal Relation Extraction IEEE Xplore
DialCLIP: Empowering CLIP as Multi-Modal Dialog Retriever IEEE Xplore
arXiv
Vector Quantization Knowledge Transfer for End-to-End Text Image Machine Translation IEEE Xplore
EmoRED: A Dataset for Relation Extraction in Texts with Emoticons IEEE Xplore
MSG-BART: Multi-granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-grounded Dialogue Generation IEEE Xplore
arXiv
CausalME: Balancing bi-modalities in Visual Question Answering IEEE Xplore
MHPS: Multimodality-Guided Hierarchical Policy Search for Knowledge Graph Reasoning IEEE Xplore
Empowering Vision-Language Models for Reasoning Ability through Large Language Models IEEE Xplore
arXiv
PVCG: Prompt-Based Vision-Aware Classification and Generation for Multi-Modal Rumor Detection IEEE Xplore
LabCLIP: Label-Enhanced Clip for Improving Zero-Shot Text Classification GitHub IEEE Xplore
Context-Aware Dual Attention Network for Multimodal Sarcasm Detection IEEE Xplore