DiMViDA: Diffusion-based multi-view data augmentation

Di Giacomo, Giuseppe; Franzese, Giulio; Cerquitelli, T.; Chiasserini, C. F.; Michiardi, Pietro
CAMAD 2024, IEEE 29th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks, 21-23 October 2024, Athens, Greece

We present DiMViDA, a Diffusion-based Multi-View Data Augmentation method built upon an innovative approach for Novel View Synthesis, which uses an extension of diffusion generative models that accepts any number of input views and that can generate any number of missing output views. In this work, our goal is to analyze the benefits of such a generative model in the context of object classification. Given a single input view, we compare the object classification performance of state-of-the-art models, namely ResNet18 and MobileNetV3, using the input view, versus its application to novel views synthesized by our generative model, using such synthetic views to augment the training set. Notably, differently from other works, we also adopt such a multi-view data augmentation method at inference. Our experimental findings illustrate that novel view synthesis can enhance object classification capabilities.


Type:
Conference
City:
Athens
Date:
2024-10-21
Department:
Digital Security
Eurecom Ref:
7846
Copyright:
© 2024 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/7846