Publications - AImageLab

A complete system for garment segmentation and color classification

Authors: Manfredi, Marco; Grana, Costantino; Calderara, Simone; Cucchiara, Rita

Published in: MACHINE VISION AND APPLICATIONS

In this paper, we propose a general approach for automatic segmentation, color-based retrieval and classification of garments in fashion store … (Read full abstract)

In this paper, we propose a general approach for automatic segmentation, color-based retrieval and classification of garments in fashion store databases, exploiting shape and color information. The garment segmentation is automatically initialized by learning geometric constraints and shape cues, then it is performed by modeling both skin and accessory colors with Gaussian Mixture Models. For color similarity retrieval and classification, to adapt the color description to the users’ perception and the company marketing directives, a color histogram with an optimized binning strategy, learned on the given color classes, is introduced and combined with HOG features for garment classification. Experiments validating the proposed strategy, and a free-to-use dataset publicly available for scientific purposes, are finally detailed.

2014 Articolo su rivista

DOI IRIS

A fast and effective ellipse detector for embedded vision applications

Authors: Fornaciari, M.; Prati, A.; Cucchiara, R.

Published in: PATTERN RECOGNITION

Several papers addressed ellipse detection as a first step for several computer vision applications, but most of the proposed solutions … (Read full abstract)

Several papers addressed ellipse detection as a first step for several computer vision applications, but most of the proposed solutions are too slow to be applied in real time on large images or with limited hardware resources. This paper presents a novel algorithm for fast and effective ellipse detection and demonstrates its superior speed performance on large and challenging datasets. The proposed algorithm relies on an innovative selection strategy of arcs which are candidate to form ellipses and on the use of Hough transform to estimate parameters in a decomposed space. The final aim of this solution is to represent a building block for new generation of smart-phone applications which need fast and accurate ellipse detection also with limited computational resources. © 2014 Elsevier Ltd.

2014 Articolo su rivista

DOI IRIS

Benchmarking for Person Re-identification

Authors: Vezzani, Roberto; Cucchiara, Rita

Published in: ADVANCES IN COMPUTER VISION AND PATTERN RECOGNITION

The evaluation of computer vision and pattern recognition systems is usually a burdensome and time-consuming activity. In this chapter all … (Read full abstract)

The evaluation of computer vision and pattern recognition systems is usually a burdensome and time-consuming activity. In this chapter all the benchmarks publicly available for re-identification will be reviewed and compared, starting from the ancestors VIPeR and Caviar to the most recent datasets for 3D modeling such as SARC3d (with calibrated cameras) and RGBD-ID (with range sensors). Specific requirements and constraints are highlighted and reported for each of the described collections. In addition, details on the metrics that are mostly used to test and evaluate the re-identification systems are provided.

2014 Capitolo/Saggio

DOI IRIS

Covariance of Covariance Features for Image Classification

Authors: Serra, Giuseppe; Grana, Costantino; Manfredi, Marco; Cucchiara, Rita

In this paper we propose a novel image descriptor built by computing the covariance of pixel level features on densely … (Read full abstract)

In this paper we propose a novel image descriptor built by computing the covariance of pixel level features on densely sampled patches and encoding them using their covariance. Appropriate projections to the Euclidean space and feature normalizations are employed in order to provide a strong descriptor usable with linear classifiers. In order to remove border effects, we further enhance the Spatial Pyramid representation with bilinear interpolation. Experimental results conducted on two common datasets for object and texture classification show that the performance of our method is comparable with state of the art techniques, but removing any dataset specific dependency in the feature encoding step.

2014 Relazione in Atti di Convegno

DOI IRIS

Detection of static groups and crowds gathered in open spaces by texture classification

Authors: Manfredi, Marco; Vezzani, Roberto; Calderara, Simone; Cucchiara, Rita

Published in: PATTERN RECOGNITION LETTERS

A surveillance system specifically developed to manage crowded scenes is described in this paper. In particular we focused on static … (Read full abstract)

A surveillance system specifically developed to manage crowded scenes is described in this paper. In particular we focused on static crowds, composed by groups of people gathered and stayed in the same place for a while. The detection and spatial localization of static crowd situations is performed by means of a One Class Support Vector Machine, working on texture features extracted at patch level. Spatial regions containing crowds are identified and filtered using motion information to prevent noise and false alarms due to moving flows of people. By means of one class classification and inner texture descriptors, we are able to obtain, from a single training set, a sufficiently general crowd model that can be used for all the scenarios that shares a similar viewpoint. Tests on public datasets and real setups validate the proposed system.

2014 Articolo su rivista

DOI IRIS

Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation

Authors: Baraldi, Lorenzo; Paci, Francesco; Serra, Giuseppe; Benini, Luca; Cucchiara, Rita

Published in: IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS

We present a novel method for monocular hand gesture recognition in ego-vision scenarios that deals with static and dynamic gestures … (Read full abstract)

We present a novel method for monocular hand gesture recognition in ego-vision scenarios that deals with static and dynamic gestures and can achieve high accuracy results using a few positive samples. Specifically, we use and extend the dense trajectories approach that has been successfully introduced for action recognition. Dense features are extracted around regions selected by a new hand segmentation technique that integrates superpixel classification, temporal and spatial coherence. We extensively testour gesture recognition and segmentation algorithms on public datasets and propose a new dataset shot with a wearable camera. In addition, we demonstrate that our solution can work in near real-time on a wearable device.

2014 Relazione in Atti di Convegno

DOI IRIS

Head Pose Estimation in First-Person Camera Views

Authors: Alletto, Stefano; Serra, Giuseppe; Calderara, Simone; Cucchiara, Rita

Published in: INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION

In this paper we present a new method for head pose real-time estimation in ego-vision scenarios that is a key … (Read full abstract)

In this paper we present a new method for head pose real-time estimation in ego-vision scenarios that is a key step in the understanding of social interactions. In order to robustly detect head under changing aspect ratio, scale and orientation we use and extend the Hough-Based Tracker which allows to follow simultaneously each subject in the scene. In an ego-vision scenario where a group interacts in a discussion, each subject's head orientation will be more likely to remain focused for a while on the person who has the floor. In order to encode this behavior we include a stateful Hidden Markov Model technique that enforces the predicted pose with the temporal coherence from a video sequence. We extensively test our approach on several indoor and outdoor ego-vision videos with high illumination variations showing its validity and outperforming other recent related state of the art approaches.

2014 Relazione in Atti di Convegno

DOI IRIS

Human Behavior Understanding: 5th International Workshop, HBU 2014 Zurich, Switzerland, September 12, 2014 Proceedings

Authors: Park, H. S.; Salah, A. A.; Lee, Y. J.; Morency, L. -P.; Sheikh, Y.; Cucchiara, R.

Published in: LECTURE NOTES IN ARTIFICIAL INTELLIGENCE

2014 Relazione in Atti di Convegno

IRIS

Illustrations Segmentation in Digitized Documents Using Local Correlation Features

Authors: Coppi, Dalia; Grana, Costantino; Cucchiara, Rita

Published in: PROCEDIA COMPUTER SCIENCE

In this paper we propose an approach for Document Layout Analysis based on local correlation features. We identify and extract … (Read full abstract)

In this paper we propose an approach for Document Layout Analysis based on local correlation features. We identify and extract illustrations in digitized documents by learning the discriminative patterns of textual and pictorial regions. The proposal has been demonstrated to be effective on historical datasets and to outperform the state-of-the-art in presence of challenging documents with a large variety of pictorial elements.

2014 Relazione in Atti di Convegno

DOI IRIS

Kernelized Structural Classification for 3D Dogs Body Parts Detection

Authors: Pistocchi, Simone; Calderara, Simone; Barnard, S.; Ferri, N.; Cucchiara, Rita

Published in: INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION

Despite pattern recognition methods for human behavioral analysis has flourished in the last decade, animal behavioral analysis has been almost … (Read full abstract)

Despite pattern recognition methods for human behavioral analysis has flourished in the last decade, animal behavioral analysis has been almost neglected. Those few approaches are mostly focused on preserving livestock economic value while attention on the welfare of companion animals, like dogs, is now emerging as a social need. In this work, following the analogy with human behavior recognition, we propose a system for recognizing body parts of dogs kept in pens. We decide to adopt both 2D and 3D features in order to obtain a rich description of the dog model. Images are acquired using the Microsoft Kinect to capture the depth map images of the dog. Upon depth maps a Structural Support Vector Machine (SSVM) is employed to identify the body parts using both 3D features and 2D images. The proposal relies on a kernelized discriminative structural classificator specifically tailored for dogs independently from the size and breed. The classification is performed in an online fashion using the LaRank optimization technique to obtaining real time performances. Promising results have emerged during the experimental evaluation carried out at a dog shelter, managed by IZSAM, in Teramo, Italy.

2014 Relazione in Atti di Convegno

DOI IRIS

Publications by Rita Cucchiara

A complete system for garment segmentation and color classification

A fast and effective ellipse detector for embedded vision applications

Benchmarking for Person Re-identification

Covariance of Covariance Features for Image Classification

Detection of static groups and crowds gathered in open spaces by texture classification

Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation

Head Pose Estimation in First-Person Camera Views

Human Behavior Understanding: 5th International Workshop, HBU 2014 Zurich, Switzerland, September 12, 2014 Proceedings

Illustrations Segmentation in Digitized Documents Using Local Correlation Features

Kernelized Structural Classification for 3D Dogs Body Parts Detection