Publications - AImageLab

Appearance and Pose-Conditioned Human Image Generation using Deformable GANs

Authors: Siarohin, Aliaksandr; Lathuilière, Stéphane; Sangineto, Enver; Sebe, Niculae

Published in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

In this paper, we address the problem of generating person images conditioned on both pose and appearance information. Specifically, given … (Read full abstract)

In this paper, we address the problem of generating person images conditioned on both pose and appearance information. Specifically, given an image xa of a person and a target pose P(xb), extracted from an image xb, we synthesize a new image of that person in pose P(xb), while preserving the visual details in xa. In order to deal with pixel-to-pixel misalignments caused by the pose differences between P(xa) and P(xb), we introduce deformable skip connections in the generator of our Generative Adversarial Network. Moreover, a nearest-neighbour loss is proposed instead of the common L1 and L2 losses in order to match the details of the generated image with the target image. Quantitative and qualitative results, using common datasets and protocols recently proposed for this task, show that our approach is competitive with respect to the state of the art. Moreover, we conduct an extensive evaluation using off-the-shell person re-identification (Re-ID) systems trained with person-generation based augmented data, which is one of themain important applications for this task. Our experiments show that our Deformable GANs can significantly boost the Re-ID accuracy and are even better than data-augmentation methods specifically trained using Re-ID losses.

2021 Articolo su rivista

DOI IRIS

Assessing the Role of Boundary-level Objectives in Indoor Semantic Segmentation

Authors: Amoroso, Roberto; Baraldi, Lorenzo; Cucchiara, Rita

Published in: LECTURE NOTES IN COMPUTER SCIENCE

Providing fine-grained and accurate segmentation maps of indoor scenes is a challenging task with relevant applications in the fields of … (Read full abstract)

Providing fine-grained and accurate segmentation maps of indoor scenes is a challenging task with relevant applications in the fields of augmented reality, image retrieval, and personalized robotics. While most of the recent literature on semantic segmentation has focused on outdoor scenarios, the generation of accurate indoor segmentation maps has been partially under-investigated. With the goal of increasing the accuracy of semantic segmentation in indoor scenarios, we focus on the analysis of boundary-level objectives, which foster the generation of fine-grained boundaries between different semantic classes and which have never been explored in the case of indoor segmentation. In particular, we test and devise variants of both the Boundary and Active Boundary losses, two recent proposals which deal with the prediction of semantic boundaries. Through experiments on the NYUDv2 dataset, we quantify the role of such losses in terms of accuracy and quality of boundary prediction and demonstrate the accuracy gain of the proposed variants.

2021 Relazione in Atti di Convegno

DOI IRIS

Automated Artifact Retouching in Morphed Images with Attention Maps

Authors: Borghi, G.; Franco, A.; Graffieti, G.; Maltoni, D.

Published in: IEEE ACCESS

Morphing attack is an important security threat for automatic face recognition systems. High-quality morphed images, i.e. images without significant visual … (Read full abstract)

Morphing attack is an important security threat for automatic face recognition systems. High-quality morphed images, i.e. images without significant visual artifacts such as ghosts, noise, and blurring, exhibit higher chances of success, being able to fool both human examiners and commercial face verification algorithms. Therefore, the availability of large sets of high-quality morphs is fundamental for training and testing robust morphing attack detection algorithms. However, producing a high-quality morphed image is an expensive and time-consuming task since manual post-processing is generally required to remove the typical artifacts generated by landmark-based morphing techniques. This work describes an approach based on the Conditional Generative Adversarial Network paradigm for automated morphing artifact retouching and the use of Attention Maps to guide the generation process and limit the retouch to specific areas. In order to work with high-resolution images, the framework is applied on different facial crops, which, once processed and retouched, are accurately blended to reconstruct the whole morphed face. Specifically, we focus on four different squared face regions, i.e. the right and left eyes, the nose, and the mouth, that are frequently affected by artifacts. Several qualitative and quantitative experimental evaluations have been conducted to confirm the effectiveness of the proposal in terms of, among the others, pixel-wise metrics, identity preservation, and human observer analysis. Results confirm the feasibility and the accuracy of the proposed framework.

2021 Articolo su rivista

DOI IRIS

Avalanche: An end-to-end library for continual learning

Authors: Lomonaco, V.; Pellegrini, L.; Cossu, A.; Carta, A.; Graffieti, G.; Hayes, T. L.; De Lange, M.; Masana, M.; Pomponi, J.; Van De Ven, G. M.; Mundt, M.; She, Q.; Cooper, K.; Forest, J.; Belouadah, E.; Calderara, S.; Parisi, G. I.; Cuzzolin, F.; Tolias, A. S.; Scardapane, S.; Antiga, L.; Ahmad, S.; Popescu, A.; Kanan, C.; Van De Weijer, J.; Tuytelaars, T.; Bacciu, D.; Maltoni, D.

Published in: IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS

Learning continually from non-stationary data streams is a long-standing goal and a challenging problem in machine learning. Recently, we have … (Read full abstract)

Learning continually from non-stationary data streams is a long-standing goal and a challenging problem in machine learning. Recently, we have witnessed a renewed and fast-growing interest in continual learning, especially within the deep learning community. However, algorithmic solutions are often difficult to re-implement, evaluate and port across different settings, where even results on standard benchmarks are hard to reproduce. In this work, we propose Avalanche, an open-source end-to-end library for continual learning research based on PyTorch. Avalanche is designed to provide a shared and collaborative codebase for fast prototyping, training, and reproducible evaluation of continual learning algorithms.

2021 Relazione in Atti di Convegno

DOI IRIS

Circular RNA profiling distinguishes medulloblastoma groups and shows aberrant RMST overexpression in WNT medulloblastoma

Authors: Rickert, Daniel; Bartl, Jasmin; Picard, Daniel; Bernardi, Flavia; Qin, Nan; Lovino, Marta; Puget, Stéphanie; Meyer, Frauke-Dorothee; Mahoungou Koumba, Idriss; Beez, Thomas; Varlet, Pascale; Dufour, Christelle; Fischer, Ute; Borkhardt, Arndt; Reifenberger, Guido; Ayrault, Olivier; Remke, Marc

Published in: ACTA NEUROPATHOLOGICA

2021 Articolo su rivista

DOI IRIS

Coarse-to-fine gaze redirection with numerical and pictorial guidance

Authors: Chen, J.; Zhang, J.; Sangineto, E.; Chen, T.; Fan, J.; Sebe, N.

Gaze redirection aims at manipulating the gaze of a given face image with respect to a desired direction (i.e., a … (Read full abstract)

Gaze redirection aims at manipulating the gaze of a given face image with respect to a desired direction (i.e., a reference angle) and it can be applied to many real life scenarios, such as video-conferencing or taking group photos. However, previous work on this topic mainly suffers of two limitations: (1) Low-quality image generation and (2) Low redirection precision. In this paper, we propose to alleviate these problems by means of a novel gaze redirection framework which exploits both a numerical and a pictorial direction guidance, jointly with a coarse-to-fine learning strategy. Specifically, the coarse branch learns the spatial transformation which warps input image according to desired gaze. On the other hand, the fine-grained branch consists of a generator network with conditional residual image learning and a multi-task discriminator. This second branch reduces the gap between the previously warped image and the ground-truth image and recovers finer texture details. Moreover, we propose a numerical and pictorial guidance module (NPG) which uses a pictorial gazemap description and numerical angles as an extra guide to further improve the precision of gaze redirection. Extensive experiments on a benchmark dataset show that the proposed method outperforms the state-of-the-art approaches in terms of both image quality and redirection precision. The code is available at https://github.com/jingjingchen777/CFGR

2021 Relazione in Atti di Convegno

DOI IRIS

Confidence Calibration for Deep Renal Biopsy Immunofluorescence Image Classification

Authors: Pollastri, Federico; Maroñas, Juan; Bolelli, Federico; Ligabue, Giulia; Paredes, Roberto; Magistroni, Riccardo; Grana, Costantino

Published in: INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION

With this work we tackle immunofluorescence classification in renal biopsy, employing state-of-the-art Convolutional Neural Networks. In this setting, the aim … (Read full abstract)

With this work we tackle immunofluorescence classification in renal biopsy, employing state-of-the-art Convolutional Neural Networks. In this setting, the aim of the probabilistic model is to assist an expert practitioner towards identifying the location pattern of antibody deposits within a glomerulus. Since modern neural networks often provide overconfident outputs, we stress the importance of having a reliable prediction, demonstrating that Temperature Scaling (TS), a recently introduced re-calibration technique, can be successfully applied to immunofluorescence classification in renal biopsy. Experimental results demonstrate that the designed model yields good accuracy on the specific task, and that TS is able to provide reliable probabilities, which are highly valuable for such a task given the low inter-rater agreement.

2021 Relazione in Atti di Convegno

DOI IRIS

DAG-Net: Double Attentive Graph Neural Network for Trajectory Forecasting

Authors: Monti, Alessio; Bertugli, Alessia; Calderara, Simone; Cucchiara, Rita

Published in: INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION

Understanding human motion behaviour is a critical task for several possible applications like self-driving cars or social robots, and in … (Read full abstract)

Understanding human motion behaviour is a critical task for several possible applications like self-driving cars or social robots, and in general for all those settings where an autonomous agent has to navigate inside a human-centric environment. This is non-trivial because human motion is inherently multi-modal: given a history of human motion paths, there are many plausible ways by which people could move in the future. Additionally, people activities are often driven by goals, e.g. reaching particular locations or interacting with the environment. We address the aforementioned aspects by proposing a new recurrent generative model that considers both single agents' future goals and interactions between different agents. The model exploits a double attention-based graph neural network to collect information about the mutual influences among different agents and to integrate it with data about agents' possible future objectives. Our proposal is general enough to be applied to different scenarios: the model achieves state-of-the-art results in both urban environments and also in sports applications.

2021 Relazione in Atti di Convegno

DOI IRIS

Data‐based design of robust fault detection and isolation residuals via LASSO optimization and Bayesian filtering

Authors: Cascianelli, Silvia; Costante, Gabriele; Crocetti, Francesco; Ricci, Elisa; Valigi, Paolo; Luca Fravolini, Mario

Published in: ASIAN JOURNAL OF CONTROL

2021 Articolo su rivista

DOI IRIS

Efficient Training of Visual Transformers with Small-Size Datasets

Authors: Liu, Yahui; Sangineto, Enver; Bi, Wei; Sebe, Nicu; Lepri, Bruno; De Nadai, Marco

Published in: ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS

2021 Relazione in Atti di Convegno

IRIS