Summary

Object detection consists in automatically localizing and recognizing in images the presence of objects belonging to a predefined set of categories, and it is still a challenging computer vision task. This is due to the presence of partial occlusions, dynamic backgrounds, and foreground clutter, like in people detection (Tu et al., 2008). The problem becomes more complex when there is the simultaneous presence of multiple objects and object categories (Lim et al., 2010) (Lim et al., 2011). In these cases, exploiting the knowledge about the spatial context of an object with respect to another can make the detection more robust (Siyahjani & Doretto, 2012). There are also cases where detections must happen regardless of the orientation of an object, and therefore designing rotation invariant representations may become important (Doretto & Yao, 2010). When instead there is the interest in detecting only one specific object, it is very easy to overfit the detector. A more robust approach is to leverage data from multiple objects and train a detector that can be adapted to be effective on a target with only one data sample, a.k.a. one-shot training or transfer learning (Yao & Doretto, 2010). Sometimes, instead, auxiliary information might be available during the detector training, and this information could be used to design a more robust training procedure (Motiian et al., 2016).

People Detection

3 minute read

People detection and tracking in video are fundamental Computer Vision capabilities that still constitute a research challenge. Important difficulties are du...

Dictionary Learning

2 minute read

Recent successes in the use of sparse coding for many computer vision applications have triggered the attention towards the problem of how an over-complete d...

Exemplar-based Object Layout

2 minute read

Recognizing the presence of object classes in an image, or image classification, has become an increasingly important topic of interest. Equally important, h...

Aerial Video Analysis

8 minute read

In aerial video moving objects of interest are typically very small, and being able to detect them is key to enable tracking. There are detection methods tha...

References

  1. CVPR
    Information bottleneck learning using privileged information for visual recognition Motiian, S., Piccirilli, M., Adjeroh, D., and Doretto, G. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2016. abstract bibTeX pdf
  2. ACCV
    Learning a Context Aware Dictionary for Sparse Representation Siyahjani, F., and Doretto, G. In Proceedings of The Asian Conference on Computer Vision, 2012. Oral abstract bibTeX pdf
  3. ISVC
    Multi-class Object Layout with Unsupervised Image Classification and Object Localization Lim, S., Doretto, G., and Rittscher, J. In International Symposium on Visual Computing, 2011. Oral abstract bibTeX pdf
  4. CVPR
    Region Moments: Fast invariant descriptors for detecting small image structures Doretto, G., and Yao, Y. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. abstract bibTeX pdf
  5. ECCVW
    Object Constellations: Scalable, simultaneous detection and recognition of multiple specific objects Lim, S., Doretto, G., and Rittscher, J. In Proceedings of the ECCV Workshop on Vision for Cognitive Tasks, 2010. Oral abstract bibTeX pdf
  6. CVPR
    Boosting for transfer learning with multiple sources Yao, Y., and Doretto, G. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010. abstract bibTeX pdf
  7. ECCV
    Unified crowd segmentation Tu, P., Sebastian, T., Doretto, G., Krahnstoever, N., Rittscher, J., and Yu, T. In Proceedings of European Conference on Computer Vision, 2008. abstract bibTeX pdf