Best Paper Session
Session Chair: Cees Snoek |
15 | Image Classification and Retrieval are ONE. Lingxi Xie, Richang Hong, Bo Zhang and Qi Tian. |
74 | Social Event Mining in Large Photo Collections. Maia Zaharieva, Matthias Zeppelzauer, Manfred Del Fabro and Daniel Schopfhauser. |
136 | Unified YouTube Video Recommendation via Cross-network Collaboration. Ming Yan, Jitao Sang and Changsheng Xu. |
241 | Bridging the Ultimate Semantic Gap: A Semantic Search Engine for Internet Videos. Lu Jiang, Shoou-I Yu, Deyu Meng, Teruko Mitamura and Alex Hauptmann. |
Oral 1: Image Retrieval
Session Chair: Qi Tian |
29 | Fast Democratic Aggregation and Query Fusion for Image Search. Zhanning Gao, Jianru Xue, Wengang Zhou, Shanmin Pang and Qi Tian. |
51 | DeepIndex for Accurate and Efficient Image Retrieval. Yu Liu and Michael S. Lew. |
127 | Effective, Efficient, and Scalable Unsupervised Distance Learning in Image Retrieval Tasks. Lucas Valem, Daniel C. G. Pedronette, Ricardo Torres, Edson Borin and Jurandy Almeida. |
138 | Twin Feature and Similarity Maximal Matching for Image Retrieval. Lei Wang, Hanli Wang and Fengkuangtian Zhu. |
173 | Fusing Pointwise and Pairwise Labels for Supporting User-adaptive Image Retrieval Lin Chen, Peng Zhang, Baoxin Li. |
Oral 2: Person and Objects
Session Chair: Rita Cucchiara |
71 | Facial Action Unit Classification with Hidden Knowledge under Incomplete Annotation. Jun Wang, Shangfei Wang and Qiang Ji. |
118 | Extracting 3D Trajectories of Objects from 2D Videos using Particle Filter. Zeyd Boukhers, Kimiaki Shirahama, Frédéric Li and Marcin Grzegorzek. |
123 | Space-time histograms and their application to person re-identification in TV shows. Rémi Auguste, Jean Martinet and Pierre Tirilly. |
131 | Temporal-Order Preserved Dynamic Quantization for Human Action Recognition from Multimodal Sensor Streams. Jun Ye, Guo-Jun Qi and Kien Hua. |
Oral 3: Concepts
Session Chair: Alan Smeaton |
49 | Fine-Grained Image Categorization by Localizing Tiny Object Parts from Unannotated Images. Luming Zhang, Yi Yang and Roger Zimmermann. |
83 | Robust and Discriminative Concept Factorization for Image Representation. Yuchen Guo and Guiguang Ding. |
246 | Encoding Concept Prototypes for Video Event Detection and Summarization. Masoud Mazloom, Amirhossein Habibian, Dong Liu, Cees Snoek and Shih-Fu Chang. |
247 | Discovering Semantic Vocabularies for Cross-Media Retrieval. Amirhossein Habibian, Thomas Mensink and Cees Snoek. |
Oral 4: Analytical Methods for Multimedia Retrieval
Session Chair: Michele Merler |
26 | Nonnegative Sparse Neighborhood Propagation. Zhao Zhang, Mingbo Zhao, Li Zhang and Li Fanzhang. |
58 | Hierarchical Encoding of Binary Descriptors for Image Matching. Zhendong Mao, Yongdong Zhang and Qi Tian. |
111 | Kernelizing Spatially Consistent Visual Matches for Fine-Grained Classification. Valentin Leveau, Alexis Joly, Olivier Buisson and Patrick Valduriez. |
202 | High-Dimensional Indexing by Sparse Approximation. Pedro Borges, André Mourão and Joao Magalhaes. |
222 | Diffusion-on-Manifold Aggregation of Local Features for Shape-based 3D Model Retrieval. Takahiko Furuya and Ryutarou Ohbuchi. |
Oral 5: Photo Applications
Session Chair: Benoit Huet |
33 | Bundling Centre For Landmark Image Discovery. Qian Zhang and Guoping Qiu. |
213 | To Keep or not to Keep: An Expectation-oriented Photo Selection Method for Personal Photo Collections. Andrea Ceroni, Vassilios Solachidis, Claudia Niederée, Olga Papadopoulou, Nattiya Kanhabua and Vasileios Mezaris. |
249 | Latent Factors of Visual Popularity Prediction. Spencer Cappallo, Thomas Mensink and Cees Snoek. |
253 | Visual Event Summarization on Social Media using Topic Modelling and Graph-based Ranking Algorithms. Manos Schinas, Symeon Papadopoulos, Yiannis Kompatsiaris and Pericles Mitkas. |
Special Oral Session: Weakly Supervised Learning for Big Multimedia Data Analysis
Session Chair: Luming Zhang |
128 | Attribute Guided Dictionary Learning. Wei Wang, Yan Yan and Nicu Sebe |
179 | Online Multi-modal Co-indexing and Retrieval for Weakly Supervised Web Image Collections. Lei Meng and Chunyan Miao. |
217 | Weakly Supervised Random Forest for Multi-Label Image Clustering and Segmentation. Yingjie Xia, Qianqian Zhu and Wei Wei. |
223 | Harvesting Multiple Sources for User Profile Learning: a Big Data Study. Aleksandr Farseev, Mohammad Akbari and Tat-Seng Chua. |
Full Paper Posters (note that all the oral papers above will also be presented in the full paper poster session)
Session Chair: Nicu Sebe |
1 | A Privacy-Preserving Bipartite Graph Matching Framework for Multimedia Analysis and Retrieval. Wei-Ta Chu and Feng-Chi Chang. |
18 | Describing Images with Hierarchical Concepts and Object Class Localization. Yahong Han and Guang Li. |
23 | Supervised Multi-scale Locality Sensitive Hashing. Li Weng, I-Hong Jhuo, Miaojing Shi, Meng Sun, Wen-Huang Cheng and Laurent Amsaleg. |
53 | A Novel Visual-Region-Descriptor-based Approach to Sketch-based Image Retrieval. Cheng Jin and Yuejie Zhang. |
63 | Location Prediction of Social Images via Generative Model. Xiaoming Zhang and Zhoujun Li. |
69 | Scalable Multimodal Search with Distributed Indexing by Sparse Hashing. André Mourão and Joao Magalhaes. |
73 | Insight in Image Collections by Multimedia Pivot Tables. Marcel Worring and Dennis Koelma. |
82 | Distribution Regularized Nonnegative Matrix Factorization for Transfer Visual Feature Learning. Yuchen Guo and Guiguang Ding. |
94 | Heterogeneous Semantic Level Features Fusion for Action Recognition. Junjie Cai, Michele Merler, Sharath Pankanti and Qi Tian. |
101 | Social Friend Recommendation Based on Network Correlation and Feature Co-Clustering. Shangrong Huang, Jian Zhang, Shiyang Lu and Xian-Sheng Hua. |
125 | Improving Diversity in Image Search via Supervised Relevance Scoring. Eleftherios Spyromitros-Xioufis, Symeon Papadopoulos, Alexandru Lucian Ginsca, Adrian Popescu, Yiannis Kompatsiaris and Ioannis Vlahavas. |
126 | Unsupervised Distance Learning by Rank Correlation Measures for Image Retrieval. Cesar Okada, Daniel C. G. Pedronette and Ricardo Torres. |
130 | Exploring Pooling Strategies based on Idiosyncrasies of Spatio-Temporal Interest Points. Yuancheng Ye, Xiaodong Yang and Yingli Tian. |
133 | Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning. Jian Wang, Yonghao He, Cuicui Kang, Shiming Xiang and Chunhong Pan. |
158 | Location-Based Parallel Tag Completion for Geo-tagged Social Photo Retrieval. Jiaming Zhang, Shuhui Wang and Qingming Huang. |
161 | Exploiting Spatial Relationship between Scenes for Hierarchical Video Geotagging. Yifang Yin, Luming Zhang and Roger Zimmermann. |
178 | USING VIEWER’S FACIAL EXPRESSION AND HEART RATE FOR SPORTS VIDEO HIGHLIGHTS DETECTION. Prithwi Chakraborty, Ligang Zhang, Dian Tjondronegoro and Vinod Chandran. |
189 | A Deep Neural Network for Modeling Music. Pengjing Zhang, Xiaoqing Zheng, Wenqiang Zhang, Siyan Li, Sheng Qian, Wenqi He, Shangtong Zhang and Ziyuan Wang. |
201 | Robust Seed Localization and Growing with Deep Convolutional Features for Scene Text Detection. Hailiang Xu and Feng Su. |
214 | Swap Retrieval: Retrieving images of cats when the query shows a dog. Amir Ghodrati, Xu Jia, Marco Pedersoli and Tinne Tuytelaars. |
224 | Graph Learning on K Nearest Neighbours for Automatic Image Annotation. Feng Su and Like Xue. |
227 | Scalable organization of collections of motion capture data via quantitative and qualitative analysis. Songle Chen, Sun Zhengxing and Yan Zhang. |
240 | Content-Based Video Search over 1 Million Videos with 1 Core in 1 Second. Shoou-I Yu, Lu Jiang, Zhongwen Xu, Yi Yang and Alexander Hauptmann. |
248 | Bag-of-Fragments: Selecting and encoding video fragments for event detection and recounting. Pascal Mettes, Jan van Gemert, Spencer Cappallo, Thomas Mensink and Cees Snoek. |
251 | Evaluating Two-Stream CNN for Video Classification. Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang and Xiangyang Xue. |
Short Paper Posters
Session Chair: Marcel Worring |
3 | Content-based Image Retrieval Using Rotation-invariant Histograms of Oriented Gradients. Jinhui Chen, Toru Nakashika, Tetsuya Takiguchi and Yasuo Ariki. |
4 | Augmented Feature Fusion for Image Retrieval System. Yang Zhou, Dan Zeng, Shiliang Zhang and Qi Tian. |
30 | Parallel AP Clustering and Re-ranking for Automatic Image-Text Alignment and Large-Scale Web Image Search. Jianping Fan. |
37 | Accio: A Data Set for Face Track Retrieval in Movies Across Age. Esam Ghaleb, Makarand Tapaswi, Ziad Al-Halah, Hazim Kemal Ekenel and Rainer Stiefelhagen. |
38 | A Two-step Approach to Cross-modal Hashing. Kaiye Wang, Wei Wang, Liang Wang and Ran He. |
39 | Cross-Scenario Eyeglasses Retrieval via EGYPT Model. Xiaoling Gu, Pai Peng, Mengwen Li, Lidan Shou and Gang Chen. |
52 | People News Search via Name-Face Association Analysis. Cheng Jin and Yuejie Zhang. |
57 | Discovering the Latent Similarities of the KNN Graph by Metric Transformation. Zhenzhong Kuang, Zongmin Li, Yujie Liu and Jianping Fan. |
59 | Formation period matters: Towards socially consistent group detection via dense subgraph seeking. Yanhao Zhang, Lei Qin, Shengping Zhang, Hongxun Yao and Qingming Huang. |
60 | Memory vectors for particular object retrieval with multiple queries. Ronan Sicre and Hervé Jégou. |
62 | Semantic-aware Hashing for Social Image Retrieval. Jinhui Tang and Zechao Li. |
67 | Zero-shot Image Categorization via Image Correlation Exploration. Lianli Gao, Jingkuan Song, Junming Shao, Xiaofeng Zhu and Hengtao Shen. |
76 | Deep Bottleneck Feature for Image Classification. Yan Song, Mcloughlin Ian and Li-Rong Dai. |
78 | Maximally Visual-Homogeneous Region Detector for Large Scale Image Retrieval. Gang Wang, Ke Gao, Yongdong Zhang and Jintao Li. |
87 | Rapid Clothing Retrieval via Deep Learning of Binary Codes and Hierarchical Search. Kevin Lin, Huei-Fang Yang, Kuan-Hsien Liu, Jen-Hao Hsiao and Chu-Song Chen. |
93 | Information gain study for visual vocabulary construction. Huu Ton Le, Thierry Urruty, Syntyche Gbehounou, Francois Lecellier and Christine Fernandez. |
97 | Discriminative Latent Feature Space Learning for Cross-Modal Retrieval. Xu Tang, Cheng Deng and Xinbo Gao. |
103 | Image Retrieval by User-oriented Ranking. Xueming Qian, Dan Lu and Xiaoxiao Liu. |
105 | Spatial Constraint for Image Location Estimation. Yisi Zhao and Xueming Qian. |
115 | Shape-based Object Matching Using Point Context. Cong Yang, Christian Feinen, Oliver Tiebe, Kimiaki Shirahama and Marcin Grzegorzek. |
117 | Large Scale Image Annotation via Deep Representation Learning and Tag Embedding Learning. Yonghao He, Jian Wang, Cuicui Kang, Shiming Xiang and Chunhong Pan. |
124 | Probabilistic Matrix Factorization With Semantic And Visual Neighborhoods For Image Tag Completion. Dimitrios Rafailidis. |
129 | Exploiting multiple web resources towards collecting positive training samples for visual concept learning. Olga Papadopoulou and Vasileios Mezaris. |
134 | CRMActive : An Active Learning Based Approach For Effective Video Annotation And Retrieval. Moitreya Chatterjee and Anton Leuski. |
135 | Personalized Egocentric Video Summarization for Cultural Experience. Patrizia Varini, Giuseppe Serra and Rita Cucchiara. |
140 | EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval. Shen Jialie, Mei Tao, Dacheng Tao, Xuelong Li and Yong Rui. |
142 | Specific Person Retrieval via Incomplete Text Description. Mang Ye, Chao Liang, Zheng Wang, Qingming Leng, Jun Chen and Jun Liu. |
146 | Combining generic and specific information for cross-modal retrieval. Thi Quynh Nhi Tran, Hervé Le Borgne and Michel Crucianu. |
148 | 3D Sketch-Based 3D Model Retrieval. Bo Li, Yijuan Lu, Azeem Ghumman, Bradley Strylowski, Mario Gutierrez, Safiyah Sadiq, Scott Forster and Natacha Feola, Travis Bugerin. |
151 | Boosting Prediction of Geo-location for Web Images Through Integrating Multiple Knowledge Sources. Hao Kuang, Shiai Zhu and Abdulmotaleb El Saddik. |
162 | Expression Recognition from Visible Images with the Help of Thermal Images. Xiaoxiao Shi, Shangfei Wang and Yachen Zhu. |
164 | Multi-facet Learning using Deep Convolutional Neural Network for Person-Related Categories in Photos. Liangliang Cao, Zhicheng Yan and John R. Smith. |
176 | Sketch-based Image Retrieval via Shape Words. Changcheng Xiao, Changhu Wang, Liqing Zhang and Lei Zhang. |
180 | Multiple Aesthetic Attribute Assessment by Exploiting Relations Among Aesthetic Attributes. Zhen Gao, Shangfei Wang and Qiang Ji. |
181 | Emotion recognition from users' eeg signals using hierarchical bayesian model with privileged information. Zhen Gao and Shangfei Wang. |
184 | Multi-Label Active Learning with Chi-Square Statistics for Image Classification. Chen Ye, Jian Wu, Victor S. Sheng, Shiquan Zhao, Pengpeng Zhao and Zhiming Cui. |
188 | Multiple Measurements and Joint Dimensionality Reduction for Large Scale Image Search with Short Vectors. Filip Radenovic, Herve Jegou and Ondrej Chum. |
193 | Exploring EEG for Object Detection and Retrieval. Eva Mohedano, Amaia Salvador, Sergi Porta, Xavier Giro-I-Nieto, Kevin McGuinness, Graham Healy and Noel O'Connor. |
221 | Kernel Local Descriptors with Implicit Rotation Matching. Andrei Bursuc, Giorgos Tolias and Herve Jegou. |
229 | Semantic Concept Annotation for User Generated Videos Using Soundtracks. Qin Jin, Junwei Liang, Xixi He, Gang Yang, Jieping Xu and Xirong Li. |
230 | Automatic Image Annotation using Deep Learning Representations. Venkatesh N. Murthy, Subhransu Maji and R. Manmatha. |
231 | “My Day in Review” Visually Summarising Noisy Lifelog Data. Soumyadeb Chowdhury, Philip J. McParlane, Md. Sadek Ferdous and Joemon Jose. |
232 | Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling. Khalid Ashraf, Benjamin Elizalde, Forrest Iandola, Matthew Moskewicz, Julia Bernd, Gerald Friedland and Kurt Keutzer. |
236 | Learning Binary Codes for Hashing via Feature Decomposition. Xiao-Jiao Mao, Zhen-Fei Ju and Yu-Bin Yang. |
243 | Multimodal Learning with Deep Boltzmann Machine for Emotion Prediction in User Generated Videos. Lei Pang and Chong-Wah Ngo. |
244 | Improving Automatic Name-Face Association using Celebrity Images on the Web. Zhineng Chen, Bailan Feng, Chong-Wah Ngo, Caiyan Jia and Xiangsheng Huang. |
Special Poster Session: Person Search and Verification from Rich Media Data
Session Chair: Nicu Sebe |
96 | End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning. Liliang Zhang, Liang Lin, Xian Wu, Shengyong Ding and Lei Zhang. |
166 | Teaching Video Analytics Based on Human Behavior Mining. Jinxian Qin, Hong Lu, Yaqian Zhou and Heqing Ya. |
254 | Multi-view Face Detection Using Deep Convolutional Neural Networks. Sachin Sudhakar Farfade, Mohammad Saberian and Li-Jia Li. |
Demos
Session Chair: Benoit Huet |
100 | Music Positioning and Annotation For Television Videos. Gang Yang, Jieping Xu and Xirong Li. |
149 | KinectSBR: A Kinect-Assisted 3D Sketch-Based 3D Model Retrieval System. Bo Li, Yijuan Lu, Azeem Ghumman, Bradley Strylowski, Mario Gutierrez, Safiyah Sadiq, Scott Forster and Natacha Feola, Travis Bugerin. |
153 | An Improved System For Real-Time Scene Text Recognition. Haojin Yang, Cheng Wang, Xiaoyin Che, Sheng Luo and Christoph Meinel. |
198 | DigInPix: visual named-entities identification in images and videos. Pierre Letessier, Nicolas Hervé, Alexis Joly, Hakim Nabi, Mathieu Derval and Olivier Buisson. |
256 | Mobile Media Thumbnailing. Yingying Chen, Jinqiao Wang, Jing Liu and Hanqing Lu. |
257 | IdeaPanel: A Large Scale Interactive Sketch-based Image Search System. Changcheng Xiao, Changhu Wang, Liqing Zhang and Lei Zhang. |
258 | A Sparse Ensemble Learning System For Efficient Semantic Indexing. Sheng Tang, Hui Chen, Yu Li, Jun-Bin Xiao and Jin-Tao Li. |
260 | A Multi-Sensory Gesture-Based Occupational Therapy Environment for Controlling Home Appliances. Ahmed Khan, Syed Osama Hussain, Ahmad Qamar, Dr. Mohamed Abdur Rahman and Saleh Basalamah. |
261 | Incremental Multimodal Query Construction for Video Search. Shicheng Xu, Huan Li, Xiaojun Chang, Shoou-I Yu, Xingzhong Du, Xuanchong Li, Lu Jiang, Zexi Mao, Zhenzhong Lan, Susanne Burger and Alexander Hauptmann. |
Invited Industry Talks
Session Chair: Jialie Shen |
Title: From Perception to Intelligence
Presenter: Yong Zhao (CTO of DeepGlint) |
Title: Frontier of iQiyi's Big Data Practices
Presenter: Chen Yang (Chief Architect, iQIYI) |
Panel Discussion: Video Search in Big Data Era
Panel Facilitator: Alex Hauptmann
Panelists (tentative): Michael Witbrock, Alan Smeaton, Cees Snoek, Michele Merler |
Abstract:
The popularity of smart mobile devices and social networks has resulted in a huge amount of data being shared on the Internet. In addition to YouTube and Flickr, several hugely popular newly emerging sites are largely video-based, including SnapChat and Vine. An increasingly higher proportion of information being shared on Internet is now in images/videos. We are now entering the era of big data for videos. Given the availability of many live, bigger quantity and higher variety of video data, how would this affect the ways we conduct research on video search? What metadata is now available and can be used to improve the analysis and search? Also, what role is there for content analysis if metadata is widely available? Finally, what new novel applications are possible or should be explored to take advantage of this trend?
This panel will examine these issues and discuss directions of video research and applications. |