Antonio
Torralba


Associate Professor
Esther and Harold E. Edgerton Associate Professor

Computer Science and Artificial Intelligence Laboratory
Dept. of Electrical Engineering and Computer Science
Massachussetts Institute of Technology

Office: 32-D432, 32 Vassar Street
             Cambridge, MA 02139

 

 
 

My research is in the areas of computer vision, machine learning and human visual perception. I am interested in scene and object recognition, among other things. Scene and object recognition are two related visual tasks generally studied separately. However, by devising systems that solve these tasks in an integrated fashion I believe it is possible to build more efficient and robust recognition systems.

 
 

Resources

LabelMe: the open annotation tool: Help us building a large database of annotated images. Explore the online query tool, Matlab toolbox, Wordnet hierarchy, and the LabelMe gallery.
Bryan Russell, Antonio Torralba and William T. Freeman

80 Million tiny images: explore a dense sampling of the visual world
Antonio Torralba, Rob Fergus, William T. Freeman

Scene Understanding Symposium (SUnS)
Aude Oliva, Thomas Serre, Antonio Torralba
February 2006, 2007.

Course on Recognizing and Learning Object Categories
Li Fei-Fei, Rob Fergus, Antonio Torralba
ICCV 2005, CVPR 2007.

The context challenge: How far can you go before having to run an object detector?


Gallery



Code

Gist, scene recognition


A simple object detector with boosting


Eye movements and attention


LabelMe toolbox and database



Publications


Recent publications

SIFT flow: dense correspondence across different scenes
C. Liu, J. Yuen, A. Torralba, J. Sivic, and W. T. Freeman
European Conference on Computer Vision (ECCV), 2008.

Small codes and large databases for recognition
A. Torralba, R. Fergus, Y. Weiss
IEEE Computer Vision and Pattern Recognition, June 2008.
Project page

Object Recognition by Scene Alignment
B. C. Russell, A. Torralba, C. Liu, R. Fergus, W. T. Freeman.
Advances in Neural Information Processing Systems, 2007.
Project page

80 million tiny images: a large dataset for non-parametric object and scene recognition
A. Torralba, R. Fergus, W. T. Freeman
In press, IEEE Transactions on Pattern Analysis and Machine Intelligence.
Project page

The role of context in object recognition
A. Oliva, A. Torralba
Trends in Cognitive Sciences, vol. 11(12), pp. 520-527. December 2007.

LabelMe: a database and web-based tool for image annotation
B. Russell, A. Torralba, K. Murphy, W. T. Freeman
International Journal of Computer Vision, pages 157-173, Volume 77, Numbers 1-3, May, 2008.
Project page


By topics

Large image databases for recognition

Object Recognition by Scene Alignment
B. C. Russell, A. Torralba, C. Liu, R. Fergus, W. T. Freeman.
Advances in Neural Information Processing Systems, 2007.

80 million tiny images: a large dataset for non-parametric object and scene recognition
A. Torralba, R. Fergus, W. T. Freeman
In press, IEEE Transactions on Pattern Analysis and Machine Intelligence.
Project page

LabelMe: a database and web-based tool for image annotation
B. Russell, A. Torralba, K. Murphy, W. T. Freeman
To appear in the International Journal of Computer Vision, 2007.
Project page

Dataset Issues in Object Recognition
J. Ponce, T. L. Berg, M. Everingham, D. A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid,
B. C. Russell, A. Torralba, C. K. I. Williams, J. Zhang, and A. Zisserman.
In Toward Category-Level Object Recognition. Springer-Verlag Lecture Notes in Computer Science, J. Ponce, M. Hebert, C. Schmid, and A. Zisserman (eds.), 2006.

Integrated generative models of scenes and objects

Describing Visual Scenes Using Transformed Objects and Parts
E. Sudderth, A. Torralba, W. T. Freeman, and A. Willsky.
To appear in the International Journal of Computer Vision, 2007.

Depth from Familiar Objects: A Hierarchical Model for 3D Scenes
E. Sudderth, A. Torralba, W. T. Freeman, and A. Wilsky
CVPR, June 2006.
Dataset

Describing Visual Scenes using Transformed Dirichlet Processes
E. Sudderth, A. Torralba, W. T. Freeman, and A. Wilsky
NIPS 2005.

Learning Hierarchical Models of Scenes, Objects, and Parts
E. Sudderth, A. Torralba, W. T. Freeman, and A. Wilsky
ICCV 2005.

Object recognition in context

Object detection and localization using local and global features
K. Murphy, A. Torralba, D. Eaton, W. T. Freeman
Lecture Notes in Computer Science (unrefeered). Sicily workshop on object recognition, 2005.

Contextual Models for Object Detection using Boosted Random Fields
A. Torralba, K. P. Murphy and W. T. Freeman
Adv. in Neural Information Processing Systems 17 (NIPS), pp. 1401-1408, 2005.
pdf | bibtex

Contextual priming for object detection
A. Torralba
International Journal of Computer Vision, Vol. 53(2), 169-191, 2003.

Context-based vision system for place and object recognition
A. Torralba, K. P. Murphy, W. T. Freeman and M. A. Rubin
IEEE Intl. Conference on Computer Vision (ICCV), Nice, France, October 2003.
Code and datasets

Using the forest to see the trees: a graphical model relating features, objects and scenes
P. Murphy, A. Torralba and W. T. Freeman
Adv. in Neural Information Processing Systems 16 (NIPS), Vancouver, BC, MIT Press, 2003.

Contextual modulation of target saliency.
A. Torralba
Adv. in Neural Information Processing Systems 14 (NIPS), MIT Press, 2001.

Statistical context priming for object detection
A. Torralba, P. Sinha
Proceedings of the International Conference on Computer Vision (ICCV), pp. 763-770, Vancouver, Canada, 2001.

Detecting faces in impoverished images
A. Torralba, P. Sinha
AI Memo 2001-028, CBCL Memo 208, 2001.

Sharing features for object detection

Sharing visual features for multiclass and multiview object detection
A. Torralba, K. P. Murphy and W. T. Freeman
IEEE Transactions on Pattern Analysis and Machine Intelligence , vol. 29, no. 5, pp. 854-869, May, 2007.
Code | bibtex

Sharing features: efficient boosting procedures for multiclass object detection
A. Torralba, K. P. Murphy and W. T. Freeman
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR). pp 762-769, 2004.

Shared features for multiclass object detection
A. Torralba, K. P. Murphy, W. T. Freeman
Towards Category-Level Object Recognition. Springer Lecture Notes in Computer Science (invited submission).

Scene recognition

Building the Gist of a Scene: The Role of Global Image Features in Recognition
A. Oliva, and A. Torralba
Visual Perception, Progress in Brain Research, vol 155. 2006.

Statistics of natural image categories
A. Torralba and A. Oliva
Network: computation in neural systems, Vol. 14, 391-412. 2003.

Depth estimation from image structure
A. Torralba, A. Oliva
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24(9): 1226-1238. 2003.

Scene-Centered Description from Spatial Envelope Properties
A. Oliva, A. Torralba
In Proc. 2nd Workshop on Biologically Motivated Computer Vision (BMCV'02), Tubingen, Germany. 2002.

Modeling the shape of the scene: a holistic representation of the spatial envelope
A. Oliva, A. Torralba
International Journal of Computer Vision, Vol. 42(3): 145-175, 2001.
Code | Datasets | LabelMe

Semantic organization of scenes using discriminant structural templates
A. Torralba, A. Oliva
Proceedings of the International Conference on Computer Vision, pp. 1253-1258, Korfu, Grece, 1999.

Global depth perception from familiar scene structure
A. Torralba, A. Oliva
AI-Memo 2001-036, CBCL Memo 213, 2001.

Indoor scene recognition
A. Torralba, P. Sinha
AI Memo 2001-015, CBCL Memo 202, 2001

Attention and eye movements on natural scenes

Contextual Guidance of Attention in Natural scenes: The role of Global features on object search
A. Torralba, A. Oliva, M. Castelhano and J. M. Henderson
Psychological Review. Vol 113(4) 766-786, Oct, 2006.
Project page

Human Learning of Contextual Priors for Object Search: Where does the time go?
B. Hidalgo-Sotelo, A. Oliva, and A. Torralba
Proceedings of the 3rd Workshop on Attention and Performance in Computer Vision at the Int. CVPR, 2005.

Contextual Influences on Saliency
A. Torralba
Neurobiology of Attention, Eds. L. Itti, G. Rees and J. Tsotsos. Pages 586-593. Academic Press / Elsevier. 2005

Saliency, objects and scenes: global scene factors in attention and object detection
A. Torralba, A. Oliva, M. Castelhano and J. M. Henderson
Vision Sciences Society Annual Meeting, Sarasota. 2004.

Modeling global scene factors in attention
A. Torralba
Journal of Optical Society of America A. Special Issue on Bayesian and Statistical Approaches to Vision. Vol. 20(7): 1407-1418, 2003.

Top-down control of visual attention in object detection
A. Oliva, A. Torralba, M. S. Castelhano and J. M. Henderson
Proceedings of the IEEE International Conference on Image Processing. Vol. I, pages 253-256; September 14-17, in Barcelona, Spain, 2003.

Computer graphics

Hybrid images
A. Oliva, A. Torralba and P. Schyns
ACM Transactions on Graphics, ACM Siggraph, 25-3, pp. 527-530. 2006.

Motion magnification
C. Liu, A. Torralba, W.T. Freeman, F. Durand and E.H. Adelson
ACM Trans. on Graphics, ACM Siggraph, 24-3, pp. 519-526, 2005.

Random Lens Imaging
R. Fergus, A. Torralba, W. T. Freeman
MIT CSAIL Technical Report 2006-058, 2006.

Shape and motion estimation

Shape from sheen. Three dimensional shape perception
R. W. Fleming, A. Torralba, and E. H. Adelson
(Eds.) Zaidi, Q., Springer

Specular reflections and the perception of shape
R. W. Fleming, A. Torralba and E. H. Adelson
Journal of Vision. Volume 4, Number 9, Article 10, Pages 798-820. 2004.

An Ensemble Prior of Image Structure for Cross-modal Inference
S. Ravela, A. Torralba, W. T. Freeman
ICCV 2005

Properties and applications of shape recipes
A. Torralba and W. T. Freeman
IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Madison, WI, June, 2003.

Shape Recipes: Scene Representations that Refer to the Image
W. T. Freeman, A. Torralba
Adv. in Neural Information Processing Systems 15 (NIPS), MIT Press.

An efficient neuromorphic analog network for motion estimation
A. Torralba, J. Hérault
IEEE Transactions on Circuits and Systems-I. Special Issue on Bio-Inspired Processors and CNNs for Vision. Vol. 46(2): 269-280, 1999.