Publications

Explore our research publications: papers, articles, and conference proceedings from AImageLab.

Tip: type @ to pick an author and # to pick a keyword.

Exploration of Convolutional Neural Network models for source code classification

Authors: Barchi, F.; Parisi, E.; Urgese, G.; Ficarra, E.; Acquaviva, A.

Published in: ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

The application of Artificial Intelligence is becoming common in many engineering fields. Among them, one of the newest and rapidly … (Read full abstract)

The application of Artificial Intelligence is becoming common in many engineering fields. Among them, one of the newest and rapidly evolving is software generation, where AI can be used to automatically optimise the implementation of an algorithm for a given computing platform. In particular, Deep Learning technologies can be used to the decide how to allocate pieces of code to hardware platforms with multiple cores and accelerators, that are common in high performance and edge computing applications. In this work, we explore the use of Convolutional Neural Networks (CNN)s to analyse the application source code and decide the best compute unit to minimise the execution time. We demonstrate that CNN models can be successfully applied to source code classification, providing higher accuracy with consistently reduced learning time with respect to state-of-the-art methods. Moreover, we show the robustness of the method with respect to source code pre-processing, compiler options and hyper-parameters selection.

2021 Articolo su rivista

Explore and Explain: Self-supervised Navigation and Recounting

Authors: Bigazzi, Roberto; Landi, Federico; Cornia, Marcella; Cascianelli, Silvia; Baraldi, Lorenzo; Cucchiara, Rita

Published in: INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION

Embodied AI has been recently gaining attention as it aims to foster the development of autonomous and intelligent agents. In … (Read full abstract)

Embodied AI has been recently gaining attention as it aims to foster the development of autonomous and intelligent agents. In this paper, we devise a novel embodied setting in which an agent needs to explore a previously unknown environment while recounting what it sees during the path. In this context, the agent needs to navigate the environment driven by an exploration goal, select proper moments for description, and output natural language descriptions of relevant objects and scenes. Our model integrates a novel self-supervised exploration module with penalty, and a fully-attentive captioning model for explanation. Also, we investigate different policies for selecting proper moments for explanation, driven by information coming from both the environment and the navigation. Experiments are conducted on photorealistic environments from the Matterport3D dataset and investigate the navigation and explanation capabilities of the agent as well as the role of their interactions.

2021 Relazione in Atti di Convegno

Extracting accurate long-term behavior changes from a large pig dataset

Authors: Bergamini, L.; Pini, S.; Simoni, A.; Vezzani, R.; Calderara, S.; Eath, R. B. D.; Fisher, R. B.

Visual observation of uncontrolled real-world behavior leads to noisy observations, complicated by occlusions, ambiguity, variable motion rates, detection and tracking … (Read full abstract)

Visual observation of uncontrolled real-world behavior leads to noisy observations, complicated by occlusions, ambiguity, variable motion rates, detection and tracking errors, slow transitions between behaviors, etc. We show in this paper that reliable estimates of long-term trends can be extracted given enough data, even though estimates from individual frames may be noisy. We validate this concept using a new public dataset of approximately 20+ million daytime pig observations over 6 weeks of their main growth stage, and we provide annotations for various tasks including 5 individual behaviors. Our pipeline chains detection, tracking and behavior classification combining deep and shallow computer vision techniques. While individual detections may be noisy, we show that long-term behavior changes can still be extracted reliably, and we validate these results qualitatively on the full dataset. Eventually, starting from raw RGB video data we are able to both tell what pigs main daily activities are, and how these change through time.

2021 Relazione in Atti di Convegno

FashionSearch++: Improving Consumer-to-Shop Clothes Retrieval with Hard Negatives

Authors: Morelli, Davide; Cornia, Marcella; Cucchiara, Rita

Published in: CEUR WORKSHOP PROCEEDINGS

Consumer-to-shop clothes retrieval has recently emerged in computer vision and multimedia communities with the development of architectures that can find … (Read full abstract)

Consumer-to-shop clothes retrieval has recently emerged in computer vision and multimedia communities with the development of architectures that can find similar in-shop clothing images given a query photo. Due to its nature, the main challenge lies in the domain gap between user-acquired and in-shop images. In this paper, we follow the most recent successful research in this area employing convolutional neural networks as feature extractors and propose to enhance the training supervision through a modified triplet loss that takes into account hard negative examples. We test the proposed approach on the Street2Shop dataset, achieving results comparable to state-of-the-art solutions and demonstrating good generalization properties when dealing with different settings and clothing categories.

2021 Relazione in Atti di Convegno

Fast Run-Based Connected Components Labeling for Bitonal Images

Authors: Wonsang, Lee; Allegretti, Stefano; Bolelli, Federico; Grana, Costantino

Connected Components Labeling (CCL) is a fundamental task in binary image processing. Since its introduction in the sixties, several algorithmic … (Read full abstract)

Connected Components Labeling (CCL) is a fundamental task in binary image processing. Since its introduction in the sixties, several algorithmic strategies have been proposed to optimize its execution time. Most CCL algorithms in literature, including the current state-of-the-art, are designed to work on an input stored with 1-byte per pixel, even if the most memory-efficient format for a binary input only uses 1-bit per pixel. This paper deals with connected components labeling on 1-bit per pixel images, also known as 1bpp or bitonal images. An existing run-based CCL strategy is adapted to this input format, and optimized with Find First Set hardware operations and a smart management of provisional labels, giving birth to an efficient solution called Bit-Run Two Scan (BRTS). Then, BRTS is further optimized by merging pairs of consecutive lines through bitwise OR, and finding runs on this reduced data. This modification is the basis for another new algorithm on bitonal images, Bit-Merge-Run Scan (BMRS). When evaluated on a public benchmark, the two proposals outperform all the fastest competitors in literature, and therefore represent the new state-of-the-art for connected components labeling on bitonal images.

2021 Relazione in Atti di Convegno

Foreword by general chairs

Authors: Cucchiara, R.; Del Bimbo, A.; Sclaroff, S.

Published in: LECTURE NOTES IN COMPUTER SCIENCE

2021 Relazione in Atti di Convegno

Foreword by general chairs

Authors: Cucchiara, R.; Del Bimbo, A.; Sclaroff, S.

Published in: LECTURE NOTES IN COMPUTER SCIENCE

2021 Relazione in Atti di Convegno

Foreword by general chairs

Authors: Cucchiara, R.; Del Bimbo, A.; Sclaroff, S.

Published in: LECTURE NOTES IN COMPUTER SCIENCE

2021 Relazione in Atti di Convegno

Foreword by general chairs

Authors: Cucchiara, R.; Del Bimbo, A.; Sclaroff, S.

Published in: LECTURE NOTES IN COMPUTER SCIENCE

2021 Relazione in Atti di Convegno

Foreword by general chairs

Authors: Cucchiara, R.; Bimbo, A. D.; Sclaroff, S.

Published in: LECTURE NOTES IN COMPUTER SCIENCE

2021 Relazione in Atti di Convegno

Page 34 of 106 • Total publications: 1059