Scenery of Shanghai


Program booklet is available.

VenueShanghai International Convention Center
DateJune 23 TuesdayJune 24 WednesdayJune 25 ThursdayJune 26 Friday
 Tutorial & WorkshopMain ConferenceMain ConferenceMain Conference & Practitioner Day
08:30am Registration  
08:45amRegistrationWelcome and IntroductionRegistrationRegistration
09:00amEMR WorkshopTutorial 1Keynote 1Keynote 2Keynote 3
10:00am  Coffee BreakCoffee BreakCoffee Break
10:20am  Oral 1: Image RetrievalOral 4: Analytical Methods for
Multimedia Retrieval
Keynote 4
11:00amCoffee BreakCoffee Break   
11:20am    Invited Industry Talks
12:00pm  Lunch:
Riverside Hall @
1F, International Convention Center
Riverside Hall @
1F, International Convention Center
Riverside Hall @
1F, International Convention Center
Lunch: on your own   
1:40pm Oral 2: Person and ObjectsBest Paper SessionOral 5: Photo Applications
2:20pmTutorial 2Tutorial 1   
3:00pm  Coffee BreakCoffee BreakCoffee Break
3:20pm  Oral 3: ConceptsPanel DiscussionFull Paper Posters & Special Poster Session
3:40pmCoffee BreakCoffee Break   
4:20pm   Special Oral Session 
4:40pm  Short Paper Posters & Demos 
6:20pm  Banquet:
Shanghai Min Restaurant @
7F, International Convention Center
Conference Close
Pearl Room @
7F, International Convention Center

Best Paper Session
Session Chair: Cees Snoek
15Image Classification and Retrieval are ONE.
Lingxi Xie, Richang Hong, Bo Zhang and Qi Tian.
74Social Event Mining in Large Photo Collections.
Maia Zaharieva, Matthias Zeppelzauer, Manfred Del Fabro and Daniel Schopfhauser.
136Unified YouTube Video Recommendation via Cross-network Collaboration.
Ming Yan, Jitao Sang and Changsheng Xu.
241Bridging the Ultimate Semantic Gap: A Semantic Search Engine for Internet Videos.
Lu Jiang, Shoou-I Yu, Deyu Meng, Teruko Mitamura and Alex Hauptmann.
Oral 1: Image Retrieval
Session Chair: Qi Tian
29Fast Democratic Aggregation and Query Fusion for Image Search.
Zhanning Gao, Jianru Xue, Wengang Zhou, Shanmin Pang and Qi Tian. 
51DeepIndex for Accurate and Efficient Image Retrieval.
Yu Liu and Michael S. Lew. 
127Effective, Efficient, and Scalable Unsupervised Distance Learning in Image Retrieval Tasks.
Lucas Valem, Daniel C. G. Pedronette, Ricardo Torres, Edson Borin and Jurandy Almeida.
138Twin Feature and Similarity Maximal Matching for Image Retrieval.
Lei Wang, Hanli Wang and Fengkuangtian Zhu.
173Fusing Pointwise and Pairwise Labels for Supporting User-adaptive Image Retrieval
Lin Chen, Peng Zhang, Baoxin Li.
Oral 2: Person and Objects
Session Chair: Rita Cucchiara
71Facial Action Unit Classification with Hidden Knowledge under Incomplete Annotation.
Jun Wang, Shangfei Wang and Qiang Ji.
118Extracting 3D Trajectories of Objects from 2D Videos using Particle Filter.
Zeyd Boukhers, Kimiaki Shirahama, Frédéric Li and Marcin Grzegorzek.
123Space-time histograms and their application to person re-identification in TV shows.
Rémi Auguste, Jean Martinet and Pierre Tirilly.
131Temporal-Order Preserved Dynamic Quantization for Human Action Recognition from Multimodal Sensor Streams.
Jun Ye, Guo-Jun Qi and Kien Hua.
Oral 3: Concepts
Session Chair: Alan Smeaton
49Fine-Grained Image Categorization by Localizing Tiny Object Parts from Unannotated Images.
Luming Zhang, Yi Yang and Roger Zimmermann. 
83Robust and Discriminative Concept Factorization for Image Representation.
Yuchen Guo and Guiguang Ding. 
246Encoding Concept Prototypes for Video Event Detection and Summarization.
Masoud Mazloom, Amirhossein Habibian, Dong Liu, Cees Snoek and Shih-Fu Chang.
247Discovering Semantic Vocabularies for Cross-Media Retrieval.
Amirhossein Habibian, Thomas Mensink and Cees Snoek.
Oral 4: Analytical Methods for Multimedia Retrieval
Session Chair: Michele Merler
26Nonnegative Sparse Neighborhood Propagation.
Zhao Zhang, Mingbo Zhao, Li Zhang and Li Fanzhang. 
58Hierarchical Encoding of Binary Descriptors for Image Matching.
Zhendong Mao, Yongdong Zhang and Qi Tian. 
111Kernelizing Spatially Consistent Visual Matches for Fine-Grained Classification.
Valentin Leveau, Alexis Joly, Olivier Buisson and Patrick Valduriez.
202High-Dimensional Indexing by Sparse Approximation.
Pedro Borges, André Mourão and Joao Magalhaes.
222Diffusion-on-Manifold Aggregation of Local Features for Shape-based 3D Model Retrieval.
Takahiko Furuya and Ryutarou Ohbuchi.
Oral 5: Photo Applications
Session Chair: Benoit Huet
33Bundling Centre For Landmark Image Discovery.
Qian Zhang and Guoping Qiu. 
213To Keep or not to Keep: An Expectation-oriented Photo Selection Method for Personal Photo Collections.
Andrea Ceroni, Vassilios Solachidis, Claudia Niederée, Olga Papadopoulou, Nattiya Kanhabua and Vasileios Mezaris.
249Latent Factors of Visual Popularity Prediction.
Spencer Cappallo, Thomas Mensink and Cees Snoek.
253Visual Event Summarization on Social Media using Topic Modelling and Graph-based Ranking Algorithms.
Manos Schinas, Symeon Papadopoulos, Yiannis Kompatsiaris and Pericles Mitkas.
Special Oral Session: Weakly Supervised Learning for Big Multimedia Data Analysis
Session Chair: Luming Zhang
128Attribute Guided Dictionary Learning.
Wei Wang, Yan Yan and Nicu Sebe
179Online Multi-modal Co-indexing and Retrieval for Weakly Supervised Web Image Collections.
Lei Meng and Chunyan Miao.
217Weakly Supervised Random Forest for Multi-Label Image Clustering and Segmentation.
Yingjie Xia, Qianqian Zhu and Wei Wei.
223Harvesting Multiple Sources for User Profile Learning: a Big Data Study.
Aleksandr Farseev, Mohammad Akbari and Tat-Seng Chua.
Full Paper Posters (note that all the oral papers above will also be presented in the full paper poster session)
Session Chair: Nicu Sebe
1A Privacy-Preserving Bipartite Graph Matching Framework for Multimedia Analysis and Retrieval.
Wei-Ta Chu and Feng-Chi Chang.
18Describing Images with Hierarchical Concepts and Object Class Localization.
Yahong Han and Guang Li.
23Supervised Multi-scale Locality Sensitive Hashing.
Li Weng, I-Hong Jhuo, Miaojing Shi, Meng Sun, Wen-Huang Cheng and Laurent Amsaleg.
53A Novel Visual-Region-Descriptor-based Approach to Sketch-based Image Retrieval.
Cheng Jin and Yuejie Zhang.
63Location Prediction of Social Images via Generative Model.
Xiaoming Zhang and Zhoujun Li.
69Scalable Multimodal Search with Distributed Indexing by Sparse Hashing.
André Mourão and Joao Magalhaes.
73Insight in Image Collections by Multimedia Pivot Tables.
Marcel Worring and Dennis Koelma.
82Distribution Regularized Nonnegative Matrix Factorization for Transfer Visual Feature Learning.
Yuchen Guo and Guiguang Ding.
94Heterogeneous Semantic Level Features Fusion for Action Recognition.
Junjie Cai, Michele Merler, Sharath Pankanti and Qi Tian.
101Social Friend Recommendation Based on Network Correlation and Feature Co-Clustering.
Shangrong Huang, Jian Zhang, Shiyang Lu and Xian-Sheng Hua.
125Improving Diversity in Image Search via Supervised Relevance Scoring.
Eleftherios Spyromitros-Xioufis, Symeon Papadopoulos, Alexandru Lucian Ginsca, Adrian Popescu, Yiannis Kompatsiaris and Ioannis Vlahavas.
126Unsupervised Distance Learning by Rank Correlation Measures for Image Retrieval.
Cesar Okada, Daniel C. G. Pedronette and Ricardo Torres.
130Exploring Pooling Strategies based on Idiosyncrasies of Spatio-Temporal Interest Points.
Yuancheng Ye, Xiaodong Yang and Yingli Tian.
133Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning.
Jian Wang, Yonghao He, Cuicui Kang, Shiming Xiang and Chunhong Pan.
158Location-Based Parallel Tag Completion for Geo-tagged Social Photo Retrieval.
Jiaming Zhang, Shuhui Wang and Qingming Huang.
161Exploiting Spatial Relationship between Scenes for Hierarchical Video Geotagging.
Yifang Yin, Luming Zhang and Roger Zimmermann.
Prithwi Chakraborty, Ligang Zhang, Dian Tjondronegoro and Vinod Chandran.
189A Deep Neural Network for Modeling Music.
Pengjing Zhang, Xiaoqing Zheng, Wenqiang Zhang, Siyan Li, Sheng Qian, Wenqi He, Shangtong Zhang and Ziyuan Wang.
201Robust Seed Localization and Growing with Deep Convolutional Features for Scene Text Detection.
Hailiang Xu and Feng Su.
214Swap Retrieval: Retrieving images of cats when the query shows a dog.
Amir Ghodrati, Xu Jia, Marco Pedersoli and Tinne Tuytelaars.
224Graph Learning on K Nearest Neighbours for Automatic Image Annotation.
Feng Su and Like Xue.
227Scalable organization of collections of motion capture data via quantitative and qualitative analysis.
Songle Chen, Sun Zhengxing and Yan Zhang.
240Content-Based Video Search over 1 Million Videos with 1 Core in 1 Second.
Shoou-I Yu, Lu Jiang, Zhongwen Xu, Yi Yang and Alexander Hauptmann.
248Bag-of-Fragments: Selecting and encoding video fragments for event detection and recounting.
Pascal Mettes, Jan van Gemert, Spencer Cappallo, Thomas Mensink and Cees Snoek.
251Evaluating Two-Stream CNN for Video Classification.
Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang and Xiangyang Xue.
Short Paper Posters
Session Chair: Marcel Worring
3Content-based Image Retrieval Using Rotation-invariant Histograms of Oriented Gradients.
Jinhui Chen, Toru Nakashika, Tetsuya Takiguchi and Yasuo Ariki.
4Augmented Feature Fusion for Image Retrieval System.
Yang Zhou, Dan Zeng, Shiliang Zhang and Qi Tian.
30Parallel AP Clustering and Re-ranking for Automatic Image-Text Alignment and Large-Scale Web Image Search.
Jianping Fan.
37Accio: A Data Set for Face Track Retrieval in Movies Across Age.
Esam Ghaleb, Makarand Tapaswi, Ziad Al-Halah, Hazim Kemal Ekenel and Rainer Stiefelhagen.
38A Two-step Approach to Cross-modal Hashing.
Kaiye Wang, Wei Wang, Liang Wang and Ran He.
39Cross-Scenario Eyeglasses Retrieval via EGYPT Model.
Xiaoling Gu, Pai Peng, Mengwen Li, Lidan Shou and Gang Chen.
52People News Search via Name-Face Association Analysis.
Cheng Jin and Yuejie Zhang.
57Discovering the Latent Similarities of the KNN Graph by Metric Transformation.
Zhenzhong Kuang, Zongmin Li, Yujie Liu and Jianping Fan.
59Formation period matters: Towards socially consistent group detection via dense subgraph seeking.
Yanhao Zhang, Lei Qin, Shengping Zhang, Hongxun Yao and Qingming Huang.
60Memory vectors for particular object retrieval with multiple queries.
Ronan Sicre and Hervé Jégou.
62Semantic-aware Hashing for Social Image Retrieval.
Jinhui Tang and Zechao Li.
67Zero-shot Image Categorization via Image Correlation Exploration.
Lianli Gao, Jingkuan Song, Junming Shao, Xiaofeng Zhu and Hengtao Shen.
76Deep Bottleneck Feature for Image Classification.
Yan Song, Mcloughlin Ian and Li-Rong Dai.
78Maximally Visual-Homogeneous Region Detector for Large Scale Image Retrieval.
Gang Wang, Ke Gao, Yongdong Zhang and Jintao Li.
87Rapid Clothing Retrieval via Deep Learning of Binary Codes and Hierarchical Search.
Kevin Lin, Huei-Fang Yang, Kuan-Hsien Liu, Jen-Hao Hsiao and Chu-Song Chen.
93Information gain study for visual vocabulary construction.
Huu Ton Le, Thierry Urruty, Syntyche Gbehounou, Francois Lecellier and Christine Fernandez.
97Discriminative Latent Feature Space Learning for Cross-Modal Retrieval.
Xu Tang, Cheng Deng and Xinbo Gao.
103Image Retrieval by User-oriented Ranking.
Xueming Qian, Dan Lu and Xiaoxiao Liu.
105Spatial Constraint for Image Location Estimation.
Yisi Zhao and Xueming Qian.
115Shape-based Object Matching Using Point Context.
Cong Yang, Christian Feinen, Oliver Tiebe, Kimiaki Shirahama and Marcin Grzegorzek.
117Large Scale Image Annotation via Deep Representation Learning and Tag Embedding Learning.
Yonghao He, Jian Wang, Cuicui Kang, Shiming Xiang and Chunhong Pan.
124Probabilistic Matrix Factorization With Semantic And Visual Neighborhoods For Image Tag Completion.
Dimitrios Rafailidis.
129Exploiting multiple web resources towards collecting positive training samples for visual concept learning.
Olga Papadopoulou and Vasileios Mezaris.
134CRMActive : An Active Learning Based Approach For Effective Video Annotation And Retrieval.
Moitreya Chatterjee and Anton Leuski.
135Personalized Egocentric Video Summarization for Cultural Experience.
Patrizia Varini, Giuseppe Serra and Rita Cucchiara.
140EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval.
Shen Jialie, Mei Tao, Dacheng Tao, Xuelong Li and Yong Rui.
142Specific Person Retrieval via Incomplete Text Description.
Mang Ye, Chao Liang, Zheng Wang, Qingming Leng, Jun Chen and Jun Liu.
146Combining generic and specific information for cross-modal retrieval.
Thi Quynh Nhi Tran, Hervé Le Borgne and Michel Crucianu.
1483D Sketch-Based 3D Model Retrieval.
Bo Li, Yijuan Lu, Azeem Ghumman, Bradley Strylowski, Mario Gutierrez, Safiyah Sadiq, Scott Forster and Natacha Feola, Travis Bugerin.
151Boosting Prediction of Geo-location for Web Images Through Integrating Multiple Knowledge Sources.
Hao Kuang, Shiai Zhu and Abdulmotaleb El Saddik.
162Expression Recognition from Visible Images with the Help of Thermal Images.
Xiaoxiao Shi, Shangfei Wang and Yachen Zhu.
164Multi-facet Learning using Deep Convolutional Neural Network for Person-Related Categories in Photos.
Liangliang Cao, Zhicheng Yan and John R. Smith.
176Sketch-based Image Retrieval via Shape Words.
Changcheng Xiao, Changhu Wang, Liqing Zhang and Lei Zhang.
180Multiple Aesthetic Attribute Assessment by Exploiting Relations Among Aesthetic Attributes.
Zhen Gao, Shangfei Wang and Qiang Ji.
181Emotion recognition from users' eeg signals using hierarchical bayesian model with privileged information.
Zhen Gao and Shangfei Wang.
184Multi-Label Active Learning with Chi-Square Statistics for Image Classification.
Chen Ye, Jian Wu, Victor S. Sheng, Shiquan Zhao, Pengpeng Zhao and Zhiming Cui.
188Multiple Measurements and Joint Dimensionality Reduction for Large Scale Image Search with Short Vectors.
Filip Radenovic, Herve Jegou and Ondrej Chum.
193Exploring EEG for Object Detection and Retrieval.
Eva Mohedano, Amaia Salvador, Sergi Porta, Xavier Giro-I-Nieto, Kevin McGuinness, Graham Healy and Noel O'Connor.
221Kernel Local Descriptors with Implicit Rotation Matching.
Andrei Bursuc, Giorgos Tolias and Herve Jegou.
229Semantic Concept Annotation for User Generated Videos Using Soundtracks.
Qin Jin, Junwei Liang, Xixi He, Gang Yang, Jieping Xu and Xirong Li.
230Automatic Image Annotation using Deep Learning Representations.
Venkatesh N. Murthy, Subhransu Maji and R. Manmatha.
231“My Day in Review” Visually Summarising Noisy Lifelog Data.
Soumyadeb Chowdhury, Philip J. McParlane, Md. Sadek Ferdous and Joemon Jose.
232Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling.
Khalid Ashraf, Benjamin Elizalde, Forrest Iandola, Matthew Moskewicz, Julia Bernd, Gerald Friedland and Kurt Keutzer.
236Learning Binary Codes for Hashing via Feature Decomposition.
Xiao-Jiao Mao, Zhen-Fei Ju and Yu-Bin Yang.
243Multimodal Learning with Deep Boltzmann Machine for Emotion Prediction in User Generated Videos.
Lei Pang and Chong-Wah Ngo.
244Improving Automatic Name-Face Association using Celebrity Images on the Web.
Zhineng Chen, Bailan Feng, Chong-Wah Ngo, Caiyan Jia and Xiangsheng Huang.
Special Poster Session: Person Search and Verification from Rich Media Data
Session Chair: Nicu Sebe
96End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning.
Liliang Zhang, Liang Lin, Xian Wu, Shengyong Ding and Lei Zhang.
166Teaching Video Analytics Based on Human Behavior Mining.
Jinxian Qin, Hong Lu, Yaqian Zhou and Heqing Ya.
254Multi-view Face Detection Using Deep Convolutional Neural Networks.
Sachin Sudhakar Farfade, Mohammad Saberian and Li-Jia Li.
Session Chair: Benoit Huet
100Music Positioning and Annotation For Television Videos.
Gang Yang, Jieping Xu and Xirong Li.
149KinectSBR: A Kinect-Assisted 3D Sketch-Based 3D Model Retrieval System.
Bo Li, Yijuan Lu, Azeem Ghumman, Bradley Strylowski, Mario Gutierrez, Safiyah Sadiq, Scott Forster and Natacha Feola, Travis Bugerin.
153An Improved System For Real-Time Scene Text Recognition.
Haojin Yang, Cheng Wang, Xiaoyin Che, Sheng Luo and Christoph Meinel.
198DigInPix: visual named-entities identification in images and videos.
Pierre Letessier, Nicolas Hervé, Alexis Joly, Hakim Nabi, Mathieu Derval and Olivier Buisson.
256Mobile Media Thumbnailing.
Yingying Chen, Jinqiao Wang, Jing Liu and Hanqing Lu.
257IdeaPanel: A Large Scale Interactive Sketch-based Image Search System.
Changcheng Xiao, Changhu Wang, Liqing Zhang and Lei Zhang.
258A Sparse Ensemble Learning System For Efficient Semantic Indexing.
Sheng Tang, Hui Chen, Yu Li, Jun-Bin Xiao and Jin-Tao Li.
260A Multi-Sensory Gesture-Based Occupational Therapy Environment for Controlling Home Appliances.
Ahmed Khan, Syed Osama Hussain, Ahmad Qamar, Dr. Mohamed Abdur Rahman and Saleh Basalamah.
261Incremental Multimodal Query Construction for Video Search.
Shicheng Xu, Huan Li, Xiaojun Chang, Shoou-I Yu, Xingzhong Du, Xuanchong Li, Lu Jiang, Zexi Mao, Zhenzhong Lan, Susanne Burger and Alexander Hauptmann.
Invited Industry Talks
Session Chair: Jialie Shen
Title: From Perception to Intelligence
Presenter: Yong Zhao (CTO of DeepGlint)
Title: Frontier of iQiyi's Big Data Practices
Presenter: Chen Yang (Chief Architect, iQIYI)
Panel Discussion: Video Search in Big Data Era
Panel Facilitator: Alex Hauptmann
Panelists (tentative): Michael Witbrock, Alan Smeaton, Cees Snoek, Michele Merler
The popularity of smart mobile devices and social networks has resulted in a huge amount of data being shared on the Internet. In addition to YouTube and Flickr, several hugely popular newly emerging sites are largely video-based, including SnapChat and Vine. An increasingly higher proportion of information being shared on Internet is now in images/videos. We are now entering the era of big data for videos. Given the availability of many live, bigger quantity and higher variety of video data, how would this affect the ways we conduct research on video search? What metadata is now available and can be used to improve the analysis and search? Also, what role is there for content analysis if metadata is widely available? Finally, what new novel applications are possible or should be explored to take advantage of this trend?
This panel will examine these issues and discuss directions of video research and applications.

Sponsors and Partners