Our Publications

Selected work

ACM Multimedia

2025

Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection

Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection

ICCV

2025

FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

Mainak Singha, Subhankar Roy, Sarthak Mehrotra, Ankit Jha, Moloud Abdar, Biplap Banerjee, Elisa Ricci
On Large Multimodal Models as Open-World Image Classifiers

On Large Multimodal Models as Open-World Image Classifiers

Superpowering Open-Vocabulary Object Detectors for X-ray Vision

Superpowering Open-Vocabulary Object Detectors for X-ray Vision

ICIAP

2025

Automatic benchmarking of large multimodal models via iterative experiment programming

Automatic benchmarking of large multimodal models via iterative experiment programming

Diversified in-domain synthesis with efficient fine-tuning for few-shot classification

Diversified in-domain synthesis with efficient fine-tuning for few-shot classification

Nicola Dall'Asen, Victor G Turrisi da Costa, Yiming Wang, Niculae Sebe, Elisa Ricci

CVPR

2025

Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers

Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers

Compositional Caching for Training-free Open-vocabulary Attribute Detection

Compositional Caching for Training-free Open-vocabulary Attribute Detection