16-824: Learning-based Methods in Vision (S'15)

Sunday, April 26, 2015

Reading for Monday 4/27

Main Reading:
X. Chen, C. Zitnick. Learning a Recurrent Visual Representation for Image Caption Generation, arXiv preprint arXiv:1411.5654 (2014).
Other reading:
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. Berg and T. Berg. Baby Talk: Understanding and Generating Image Descriptions, CVPR, 2011.
V. Ordonez, G. Kulkarni and T. Berg. Im2Text: Describing Images Using 1 Million Captioned Photographs, NIPS, 2011.

Sunday, April 19, 2015

Reading for Wednesday 4/20

X. Chen, A. Shrivastava and A. Gupta. NEIL: Extracting Visual Knowledge from Web Data, ICCV, 2013.

And additionally:

L. Li and F. Li. Optimol: automatic online picture collection via incremental model learning, IJCV, 88.2 (2010): 147-168.

S. Divvala, A. Farhadi and C. Guestrin. Learning Everything about Anything: Webly-Supervised Visual Concept Learning, CVPR, 2014.

Tuesday, April 14, 2015

Reading for Wednesday 4/15

H. Song, R. Girshick, S. Jegelka, J. Mairal, Z. Harchaoui and T. Darrell. On learning to localize objects with minimal supervision, ICML, 2014.

And additionally:

X. Chen, A. Shrivastava and A. Gupta. Enriching Visual Knowledge Bases via Object Discovery and Segmentation, CVPR, 2014.

C. Wang, W. Ren, K. Huang and T. Tan. Weakly Supervised Object Localization with Latent Category Learning, ECCV, 2014.

Sunday, April 12, 2015

Reading for Monday 4/13

A. Shrivastava, S. Singh and A. Gupta. Constrained Semi-Supervised Learning Using Attributes and Comparative Attributes, ECCV, 2012.

And additionally:

R. Fergus, Y. Weiss and A. Torralba. Semi-supervised Learning in Gigantic Image Collections, NIPS, 2009.

S. Sukhbaatar, J. Bruna, M. Paluri, L. Bourdev, R. Fergus. Training Convolutional Networks with Noisy Labels, arXiv preprint arXiv:1406.2080 (2014).

Saturday, April 4, 2015

Reading for Monday 4/6

Q. Le, M. Ranzato, R. Monga, M. Devin, K. Chen, G. Corrado, J. Dean and A. Ng. Building high-level features using large scale unsupervised learning, ICML, 2012.

And additionally:

B. Russell, A. Efros, J. Sivic, B. Freeman, A. Zisserman. Using Multiple Segmentations to Discover Objects and their Extent in Image Collections, CVPR, 2006.

Y. Lee and K. Grauman. Object-Graphs for Context-Aware Visual Category Discovery, CVPR, 2010.

Monday, March 30, 2015

Reading for Wednesday 4/1

B. Yao and F. Li. Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities, CVPR, 2010.

And additionally:

Z. Tu Auto-context and Its Application to High-level Vision Tasks, CVPR, 2008.

D. Hoiem, A.A. Efros, and M. Hebert, Putting Objects in Perspective, IJCV 2008.

Sunday, March 29, 2015

Reading for Monday 3/30

K. Simonyan, A. Zisserman. Two-Stream Convolutional Networks for Action Recognition in Videos, arXiv preprint arXiv:1406.2199 (2014).

And additionally:

A. Jain, A. Gupta, M. Rodriguez, L. Davis. Representing videos using mid-level discriminative patches, CVPR, 2013.

A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, F. Li. Large-scale Video Classification with Convolutional Neural Networks, CVPR, 2014.