Research into multi-modal analysis
for Digital Video Content Description


Researcher: Ciarán Ó Conaire B.Eng
Date: August 2004


Currently, I am using infrared video, along with a stereo pair of visible spectrum CCTV cameras.
The project aims to show the benefits of using multiple capture devices to gather information about a scene and to investigate how best to combine these data sources to accurately describe the scene.
Applications include :
  • Intelligent Surveillance
  • Smart capture devices
  • Digital Video analysis


I am working on various aspects of a cognitive vision system:

  • Background Modelling
  • Object Detection and Tracking
  • Stereo Depth Estimation
  • Infrared/Visible Image Alignment






Video Examples



Stereo Depth Estimation

The mpeg video here shows the view from the left camera of a stereo pair, alongside the estimated disparity-map.
Two videos were captured simultaneously with a pair of parallel identical video cameras.
The disparity map was calculated using the Lhuillier method of finding point correspondances.
Further research will make use of temporal information between frames to make the process more efficient and more accurate.



Background Modelling: Illumination Problems

This video (below) demonstrates the illumination problem in background modelling.
Even if brightness is ignored, the actual colour of regions changes depending on the lighting conditions.



Background Modelling: YUV Models

Using a one dimentional mixture of Gaussians model for each of the Y, U and V colour components of each video frame, this video was generated. Blue corresponds to foreground of Y, Green to U and Red to V; Mixtures of Red and Blue (magenta) corresponds to foreground detected in V and Y, etc.
Shadows can be seen as luminance (Y) foreground. There are obvious problems with the movement of trees and bushes and with the illumination variations.



Image Examples




Infrared Image Segmentation: Person Tracking

Segmentation of infrared imagery is important in identifying people and other hot objects, such as vehicles and animals.
Since the infrared camera itself is non-cooled, it gives off heat and this results in very noisy image data.
The very low contrast in the infrared image is due to the use of plain glass as a reflector. This glass absorbs a portion of the infrared

The image on the right shows the same scene captured in infrared and visible light. The contrast in this infrared image is clearer than the infrared image above. This is because it was captured directly and not reflected off glass. Infrared imagery has the added advantage that it is invariant to lighting conditions, so works in the pitch dark. Visible images have much greater resolution and show much more detail for static objects. Things such as bottles, pens, books etc. that are at room temperature do not appear very clear at all in infrared images. Notice the infrared reflection in the computer monitor.