For best experience please turn on javascript and use a modern browser!
You are using a browser that is no longer supported by Microsoft. Please upgrade your browser. The site may not present itself correctly if you continue browsing.

Dr. T.E.J. (Thomas) Mensink

Faculty of Science
Informatics Institute
Photographer: Monique Kooijmans

Visiting address
  • Science Park 904
Postal address
  • Postbus 94323
    1090 GH Amsterdam
Contact details
  • Profiel

    Assistant Professor at Computer Vision Group

    Since February 2017, I'm Assistant Professor in the Computer Vision group (headed by Prof. Theo Gevers) of the Informatics Institute of the Faculty of Science (FNWI). My research interests are 3DDL: combining 3D computer vision with deep learning.

    Before turning into assistant professor, I've been:

    • Visisting Researcher at UC Berkeley (Prof. T. Darrell) for 3 months in 2016.
    • PostDoc researcher at the ISIS group, since 2012
    • PhD researcher/student at LEAR/TOTH group of INRIA Grenoble and the Compyter Vision group of XRCE (2009-2012)
    • MSc AI student at the University of Amsterdam (2002-2007)
    Personal homepage Computer Vision group homepage
  • Publications

    2021

    • Lê, H-A., Mensink, T., Das, P., Karaoglu, S., & Gevers, T. (2021). EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes. In 2021 IEEE Winter Conference on Applications of Computer Vision: proceedings : 5-9 January 2021, virtual event (pp. 1578-1588). (WACV). IEEE Computer Society. https://doi.org/10.1109/WACV48630.2021.00162 [details]

    2020

    2019

    • Cappallo, S., Svetlichnaya, S., Garrigues, P., Mensink, T., & Snoek, C. G. M. (2019). New Modality: Emoji Challenges in Prediction, Anticipation, and Retrieval. IEEE Transactions on Multimedia, 21(2), 402-415. https://doi.org/10.1109/TMM.2018.2862363 [details]
    • Chen, Y., Mensink, T. E. J., & Gavves, E. (2019). 3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation. In International Conference on 3D Vision https://arxiv.org/abs/1910.01460
    • Galama, Y., & Mensink, T. (2019). IterGANs: Iterative GANs to learn and control 3D object transformation. Computer Vision and Image Understanding, 189, [102803]. https://doi.org/10.1016/j.cviu.2019.102803 [details]
    • Ibrahimi, S., Chen, S., Arya, D., Câmara, A., Chen, Y., Crijns, T., ... Mettes, P. (2019). Interactive Exploration of Journalistic Video Footage through Multimodal Semantic Matching. In MM'19: proceedings of the 27th ACM Conference on Multimedia : October 21-25, 2019, Nice, France (pp. 2196-2198). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/3343031.3350597 [details]

    2018

    • Guerriero, S., Caputo, B., & Mensink, T. E. J. (2018). Deep Nearest Class Mean Classifiers. In International Conference on Learning Representations Workshops OpenReview.
    • Le, H-A., Baslamisli, A. S., Mensink, T., & Gevers, T. (2018). Three for one and one for three: Flow, Segmentation, and Surface Normals. In British Machine Vision Conference 2018: BMVC 2018, Newcastle, UK, September 3-6, 2018 [201] BMVA Press. [details]

    2017

    • Bolles, R., Burns, J. B., Graciarena, M., Kathol, A., Lawson, A., McLaren, M., & Mensink, T. (2017). Spotting Audio-Visual Inconsistencies (SAVI) in Manipulated Video. In 30th IEEE Conference on Computer Vision and Pattern Recognition Workshops: CVPRW 2017 : 21-26 July 2016, Honolulu, Hawaii : proceedings (pp. 1907-1914). IEEE Computer Society. https://doi.org/10.1109/CVPRW.2017.238 [details]
    • Habibian, A., Mensink, T., & Snoek, C. G. M. (2017). Video2vec Embeddings Recognize Events when Examples are Scarce. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(10), 2089-2103. https://doi.org/10.1109/TPAMI.2016.2627563 [details]
    • Mensink, T., Jongstra, T., Mettes, P., & Snoek, C. G. M. (2017). Music-Guided Video Summarization using Quadratic Assignments. In ICMR '17: proceedings of the 2017 ACM International Conference on Multimedia Retrieval : June 6-9, 2017, Bucharest, Romania (pp. 58-64). New York, NY: The Association for Computing Machinery. https://doi.org/10.1145/3078971.3079024 [details]

    2016

    • Cappallo, S., Mensink, T., & Snoek, C. G. M. (2016). Video Stream Retrieval of Unseen Queries using Semantic Memory. In R. C. Wilson, E. R. Hancock, & W. A. P. Smith (Eds.), Proceedings of the British Machine Vision Conference: BMVC 2016 [143] BMVA Press. https://doi.org/10.5244/C.30.143 [details]
    • Kordumova, S., Mensink, T., & Snoek, C. G. M. (2016). Pooling Objects for Recognizing Scenes without Examples. In ICMR'16: proceedings of the 2016 ACM International Conference on Multimedia Retrieval: June 6-9, 2016, New York, NY, USA (pp. 143-150). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2911996.2912007 [details]

    2015

    • Cappallo, S., Mensink, T., & Snoek, C. G. M. (2015). Image2Emoji: Zero-shot Emoji Prediction for Visual Media. In MM'15: proceedings of the 2015 ACM Multimedia Conference: October 26-30, 2015, Brisbane, Australia (pp. 1311-1314). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2733373.2806335 [details]
    • Cappallo, S., Mensink, T., & Snoek, C. G. M. (2015). Latent Factors of Visual Popularity Prediction. In ICMR'15: proceedings of the 2015 ACM International Conference on Multimedia Retrieval: June 23-26, 2015, Shanghai, China (pp. 195-202). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2671188.2749405 [details]
    • Cappallo, S., Mensink, T., & Snoek, C. G. M. (2015). Query-by-Emoji Video Search. In MM'15: proceedings of the 2015 ACM Multimedia Conference: October 26-30, 2015, Brisbane, Australia (pp. 735-736). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2733373.2807961 [details]
    • Gavves, E., Mensink, T., Tommasi, T., Snoek, C. G. M., & Tuytelaars, T. (2015). Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks. In Proceedings: 2015 IEEE International Conference on Computer Vision: 11-18 December 2015, Santiago, Chile (pp. 2731-2739). Los Alamitos, CA: IEEE Computer Society. https://doi.org/10.1109/ICCV.2015.313 [details]
    • Habibian, A., Mensink, T., & Snoek, C. G. M. (2015). Discovering Semantic Vocabularies for Cross-Media Retrieval. In ICMR'15: proceedings of the 2015 ACM International Conference on Multimedia Retrieval: June 23-26, 2015, Shanghai, China (pp. 131-138). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2671188.2749403 [details]
    • Jain, M., van Gemert, J. C., Mensink, T., & Snoek, C. G. M. (2015). Objects2action: Classifying and localizing actions without any video example. In Proceedings: 2015 IEEE International Conference on Computer Vision: 11-18 December 2015, Santiago, Chile (pp. 4588-4596). Los Alamitos, CA: IEEE Computer Society. https://doi.org/10.1109/ICCV.2015.521 [details]
    • Mettes, P., van Gemert, J. C., Cappallo, S., Mensink, T., & Snoek, C. G. M. (2015). Bag-of-Fragments: Selecting and encoding video fragments for event detection and recounting. In ICMR'15: proceedings of the 2015 ACM International Conference on Multimedia Retrieval: June 23-26, 2015, Shanghai, China (pp. 427-434). New York, NY: Association for Computing Machinery. https://doi.org/10.1145/2671188.2749404 [details]
    • Nagel, M., Mensink, T., & Snoek, C. G. M. (2015). Event Fisher Vectors: Robust Encoding Visual Diversity of Visual Streams. In X. Xie, M. W. Jones, & G. K. L. Tam (Eds.), Proceedings of the British Machine Vision Conference 2015: BMVC 2015: 7-10 September, Swansea, UK [178] BMVA Press. https://doi.org/10.5244/C.29.178 [details]

    2014

    • Everts, I., van Gemert, J. C., Mensink, T., & Gevers, T. (2014). Robustifying Descriptor Instability using Fisher Vectors. IEEE Transactions on Image Processing, 23(12), 5698-5706. https://doi.org/10.1109/TIP.2014.2365955 [details]
    • Habibian, A., Mensink, T., & Snoek, C. G. M. (2014). Composite Concept Discovery for Zero-Shot Video Event Detection. In ICMR Glasgow 2014: proceedings of the ACM International Conference on Multimedia Retrieval 2014: April 1st-4th, 2014, Glasgow, UK (pp. 17-24). New York: Association for Computing Machinery. https://doi.org/10.1145/2578726.2578746 [details]
    • Habibian, A., Mensink, T., & Snoek, C. G. M. (2014). VideoStory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events. In MM '14: proceedings of the 2014 ACM Conference on Multimedia: November 3-7, 2014, Orlando, Florida, USA (pp. 17-26). New York: ACM. https://doi.org/10.1145/2647868.2654913 [details]
    • Li, Z., Gavves, E., Mensink, T., & Snoek, C. G. M. (2014). Attributes Make Sense on Segmented Objects. In D. Fleet, T. Pajdla, B. Schiele, & T. Tuytelaars (Eds.), Computer Vision – ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014: proceedings (Vol. VI, pp. 350-365). (Lecture Notes in Computer Science; Vol. 8694). Cham: Springer. https://doi.org/10.1007/978-3-319-10599-4_23 [details]
    • Mensink, T., & van Gemert, J. (2014). The Rijksmuseum Challenge: Museum-Centered Visual Recognition. In ICMR Glasgow 2014: proceedings of the ACM International Conference on Multimedia Retrieval 2014: April 1st-4th, 2014, Glasgow, UK (pp. 451-454). New York: Association for Computing Machinery. https://doi.org/10.1145/2578726.2578791 [details]
    • Mensink, T., Gavves, E., & Snoek, C. G. M. (2014). COSTA: Co-Occurrence Statistics for Zero-Shot Classification. In Proceedings: 2014 IEEE Conference on Computer Vision and Pattern Recognition: 23-28 June 2014, Columbus, Ohio (pp. 2441-2448). Los Alamitos, California: IEEE Computer Society. https://doi.org/10.1109/CVPR.2014.313 [details]
    • Snoek, C. G. M., van de Sande, K. E. A., Fontijne, D., Cappallo, S., van Gemert, J., Habibian, A., ... Smeulders, A. W. M. (2014). MediaMill at TRECVID 2014: Searching Concepts, Objects, Instances and Events in Video. In 2014 TREC Video Retrieval Evaluation: notebook papers and slides Gaithersburg, MD: National Institute of Standards and Technology. [details]

    2013

    2007

    • Mensink, T., Kröse, B. J. A., & Zajdel, W. P. (2007). Distributed Appearance Based Tracking using the EM algorithm. In Proceedings of the 2007 First ACM/IEEE International Conference on Distributed Smart Cameras (pp. 178-184). Vienna, Austria: IEEE. [details]

    2013

    • Mensink, T., Verbeek, J., Perronnin, F., & Csurka, G. (2013). Large Scale Metric Learning for Distance-Based Image Classification on Open Ended Data Sets. In G. M. Farinella, S. Battiato, & R. Cipolla (Eds.), Advanced topics in computer vision (pp. 243-276). (Advances in computer vision and pattern recognition). London: Springer. https://doi.org/10.1007/978-1-4471-5520-1_9 [details]

    Award

    • Kordumova, S., Mensink, T. & Snoek, C. G. M. (2016). ICMR Best Paper Prize.
    • Habibian, A., Mensink, T. & Snoek, C. G. M. (2014). ACM Multimedia Best Paper Prize.

    2021

    • Lê, H. -Â. (2021). Outdoor image understanding from multiple vision modalities. [details]
    This list of publications is extracted from the UvA-Current Research Information System. Questions? Ask the library or the Pure staff of your faculty / institute. Log in to Pure to edit your publications. Log in to Personal Page Publication Selection tool to manage the visibility of your publications on this list.
  • Ancillary activities
    No known ancillary activities