Publications

Explore our research publications: papers, articles, and conference proceedings from AImageLab.

Tip: type @ to pick an author and # to pick a keyword.

Statistical pattern recognition for multi-camera detection, tracking, and trajectory analysis

Authors: Calderara, S.; Cucchiara, R.; Vezzani, R.; Prati, A.

2009 Capitolo/Saggio

Video Analysis for Ambient intelligence in Urban Environments

Authors: Prati, Andrea; Cucchiara, Rita

Published in: ADVANCED INFORMATION AND KNOWLEDGE PROCESSING

Ambient Intelligence (AmI) is an emerging field of research that comprises new paradigms, techniques and systems for intelligent processing of … (Read full abstract)

Ambient Intelligence (AmI) is an emerging field of research that comprises new paradigms, techniques and systems for intelligent processing of distributed sensing. A challenging arena for AmI framework is represented by urban environments that are characterized by high complexity, numerous sources of data,and spreading of interesting and non-trivial applications. In this context, the project LAICA (Laboratory of Ambient Intelligence for a friendly city) represents a real experiment of the usefulness of AmI for advanced services to citizens. This chapter will address solutions of video analysis that can be directly applied in urban AmI. It describes in details the uniqueness of LAICA approach, focusing in particular on the use of computer vision techniques for monitoring public parks. People surveillanceand web-based video broadcasting will be taken into account.

2009 Capitolo/Saggio

Video surveillance and multimedia forensics: an application to trajectory analysis

Authors: Calderara, Simone; Prati, Andrea; Cucchiara, Rita

This paper reports an application of trajectory analysis in which forensics and video surveillance techniques are jointly employed for providing … (Read full abstract)

This paper reports an application of trajectory analysis in which forensics and video surveillance techniques are jointly employed for providing a new tool of multimedia forensics. Advanced video surveillance techniques are used to extract from a multi-camera system the trajectories of the moving people which are then modelled by either their positions (projected on the ground plane) or their directions of movement. Both these two representations can be very suitable for querying large video repositories, by searching for similar trajectories in terms of either sequences of positions or trajectory shape (encoded as sequence of angles, where positions do not care). Preliminary examples of the possible use of this approach are shown.

2009 Relazione in Atti di Convegno

"Inside the Bible": Segmentation, Annotation and Retrieval for a New Browsing Experience

Authors: Grana, Costantino; Borghesani, Daniele; Calderara, Simone; Cucchiara, Rita

In this paper we present a system for automatic segmentation, annotation and image retrieval based on content, focused on illuminated … (Read full abstract)

In this paper we present a system for automatic segmentation, annotation and image retrieval based on content, focused on illuminated manuscripts and in particular the Borso D'Este Holy Bible. To enhance the interaction possibilities with this work, full of decorations and illustrations, we exploit some well known document analysis techniques in addition to some new approaches, in order to achieve good segmentation of pages into meaningful visual objects with the relative annotation. We wanted to extend the standard keyword-based retrieval approach in a commentary with a modern visual-based retrieval by appearance similarity: an entire software user interface for exploration and visual search of illuminated manuscripts.

2008 Relazione in Atti di Convegno

A Markerless Approach for Consistent Action Recognition in a Multi-camera System

Authors: Calderara, Simone; Prati, Andrea; Cucchiara, Rita

This paper presents a method for recognizing human actions in a multi-camera setup. The proposed method automatically extracts significant points … (Read full abstract)

This paper presents a method for recognizing human actions in a multi-camera setup. The proposed method automatically extracts significant points on the human body, without the need of artificial markers. A sophisticated appearance-based tracking able to cope with occlusions is exploited to extract a probability map for each moving object. A segmentation technique based on mixture of Gaussians is then employed to extract and track significant points on this map, corresponding to significant regions on the human silhouette. The point tracking produces a set of 3D trajectories that are compared with other trajectories by means of global alignment and dynamic programming techniques. Preliminary experiments showed the potentiality of the proposed approach.

2008 Relazione in Atti di Convegno

Action Signature: a Novel Holistic Representation for Action Recognition

Authors: Calderara, Simone; Cucchiara, Rita; Prati, Andrea

Recognizing different actions with a unique approach can be a difficult task. This paper proposes a novel holistic representation of … (Read full abstract)

Recognizing different actions with a unique approach can be a difficult task. This paper proposes a novel holistic representation of actions that we called "action signature". This 1D trajectory is obtained by parsing the 2D image containing the orientations of the gradient calculated on the motion feature map called motion-history image. In this way, the trajectory is a sketch representation of how the object motion varies in time. A robust statistical framework based on mixtures of von Mises distributions and dynamic programming for sequence alignment are used to compare and classify actions/trajectories. The experimental results show a rather high accuracy in distinguishing quite complicated actions, such as drinking, jumping, or abandoning an object.

2008 Relazione in Atti di Convegno

AD-HOC: Appearance Driven Human tracking with Occlusion Handling

Authors: Vezzani, Roberto; Cucchiara, Rita

AD-HOC copes with the problem of multiple people tracking in video surveillance in presence of large occlusions. The main novelty … (Read full abstract)

AD-HOC copes with the problem of multiple people tracking in video surveillance in presence of large occlusions. The main novelty is the adoption of an appearance-based approach in a formal Bayesian framework: the status of each object is defined at pixel level, where each pixel is characterized by the appearance, i.e. the color (integrated along the time) and the likelihood to belong to the object. With these data at pixel-level and a probability of non-occlusion at object-level, the problem of occlusions is addressed. The method does not aim at detecting the presence of an occlusion only, but classifies the type of occlusion at a sub-region level and evolve the status of theobject in a selective way. The AD-HOC tracking has been tested in many application for indoor and outdoor surveillance. Results on PETS2006 test set are reported where many people and abandoned objects are detected and tracked.

2008 Relazione in Atti di Convegno

Adaptive Course Generation through Learning Styles Representation

Authors: Sangineto, E; N., Capuano; M., Gaeta; A., Micarelli

Published in: UNIVERSAL ACCESS IN THE INFORMATION SOCIETY

2008 Articolo su rivista

Annotation Collection and Online Performance Evaluation for Video Surveillance: the ViSOR Project

Authors: Vezzani, Roberto; Cucchiara, Rita

This paper presents the Visor (VIdeo Surveillance Online Repository) project designed with the aim of establishing anopen platform for collecting, … (Read full abstract)

This paper presents the Visor (VIdeo Surveillance Online Repository) project designed with the aim of establishing anopen platform for collecting, annotating, retrieving, sharingsurveillance videos, and of evaluating the performanceof automatic surveillance systems. The main idea is to exploitthe collaborative paradigm spreading in the web communityto join together the ontology based annotation andretrieval concepts and the requirements of the computer visionand video surveillance communities. The ViSOR openrepository is based on a reference ontology which integratesmany concepts, also coming from LSCOM and MediaMillontologies. The web interface allows video browse, queryby annotated concepts or by keywords, compressed videopreview, media download and upload. The repository containsmetadata annotations, which can be either manuallycreated as ground truth or automatically generated by videosurveillance systems. Their automatic annotations can becompared each other or with the reference ground-truth exploitingan integrated on-line performance evaluator.

2008 Relazione in Atti di Convegno

Artificial vision for the surveillance video

Authors: Cucchiara, R.

Published in: MONDO DIGITALE

2008 Articolo su rivista

Page 86 of 106 • Total publications: 1059