Scenery of Shanghai

Accepted Papers

Full Papers

1   A Privacy-Preserving Bipartite Graph Matching Framework for Multimedia Analysis and Retrieval
15  Image Classification and Retrieval are ONE
18  Describing Images with Hierarchical Concepts and Object Class Localization
23  Supervised Multi-scale Locality Sensitive Hashing
26  Nonnegative Sparse Neighborhood Propagation
29  Fast Democratic Aggregation and Query Fusion for Image Search
33  Bundling Centre For Landmark Image Discovery
49  Fine-Grained Image Categorization by Localizing Tiny Object Parts from Unannotated Images
51  DeepIndex for Accurate and Efficient Image Retrieval
53  A Novel Visual-Region-Descriptor-based Approach to Sketch-based Image Retrieval
58  Hierarchical Encoding of Binary Descriptors for Image Matching
63  Location Prediction of Social Images via Generative Model
69  Scalable Multimodal Search with Distributed Indexing by Sparse Hashing
71  Facial Action Unit Classification with Hidden Knowledge under Incomplete Annotation
73  MInsight in Image Collections by Multimedia Pivot Tables
74  Social Event Mining in Large Photo Collections
82  Distribution Regularized Nonnegative Matrix Factorization for Transfer Visual Feature Learning
83  Robust and Discriminative Concept Factorization for Image Representation
94  Heterogeneous Semantic Level Features Fusion for Action Recognition
101 Social Friend Recommendation Based on Network Correlation and Feature Co-Clustering
111 Kernelizing Spatially Consistent Visual Matches for Fine-Grained Classification
118 Extracting 3D Trajectories of Objects from 2D Videos using Particle Filter
123 Space-time histograms and their application to person re-identification in TV shows
125 Improving Diversity in Image Search via Supervised Relevance Scoring
126 Unsupervised Distance Learning by Rank Correlation Measures for Image Retrieval
127 Effective, Efficient, and Scalable Unsupervised Distance Learning in Image Retrieval Tasks
130 Exploring Pooling Strategies based on Idiosyncrasies of Spatio-Temporal Interest Points
131 Temporal-Order Preserved Dynamic Quantization for Human Action Recognition from Multimodal Sensor Streams
133 Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning
136 Unified YouTube Video Recommendation via Cross-network Collaboration
138 Twin Feature and Similarity Maximal Matching for Image Retrieval
158 Location-Based Parallel Tag Completion for Geo-tagged Social Photo Retrieval
161 Exploiting Spatial Relationship between Scenes for Hierarchical Video Geotagging
173 Fusing Pointwise and Pairwise Labels for Supporting Personalized Image Retrieval
189 A Deep Neural Network for Modeling Music
201 Robust Seed Detection and Growing with Deep Convolutional Features for Scene Text Detection
202 High-Dimensional Indexing by Sparse Approximation
213 To Keep or not to Keep: An Expectation-oriented Photo Selection Method for Personal Photo Collections
214 Swap Retrieval: Retrieving images of cats when the query shows a dog
222 Diffusion-on-Manifold Aggregation of Local Features for Shape-based 3D Model Retrieval
224 Graph Learning on K Nearest Neighbours for Automatic Image Annotation
227 Scalable organization of collections of motion capture data via quantitative and qualitative analysis
240 Content-Based Video Search over 1 Million Videos with 1 Core in 1 Second
241 Bridging the Ultimate Semantic Gap: A Semantic Search Engine for Internet Videos
246 Encoding Concept Prototypes for Video Event Detection and Summarization
247 Discovering Semantic Vocabularies for Cross-Media Retrieval
248 Bag-of-Fragments: Selecting and encoding video fragments for event detection and recounting
249 Latent Factors of Visual Popularity Prediction
251 Building A Deep Learning System for Video Classification: An In-depth Study           
253 Visual Event Summarization on Social Media using Topic Modelling and Graph-based Ranking Algorithms

Short Papers

3   Content-based Image Retrieval Using Rotation-invariant Histograms of Oriented Gradients
4   Augmented Feature Fusion for Image Retrieval System
30  Parallel AP Clustering and Re-ranking for Automatic Image-Text Alignment and Large-Scale Web Image Search
37  Accio: A Data Set for Face Track Retrieval in Movies Across Age
38  A Two-step Approach to Cross-modal Hashing
39  Cross-Scenario Eyeglasses Retrieval via EGYPT Model
52  People News Search via Name-Face Association Analysis
57  Discovering the Latent Similarities of the KNN Graph by Metric Transformation
59  Formation period matters: Towards socially consistent group detection via dense subgraph seeking
60  Memory vectors for particular object retrieval with multiple queries
62  Semantic-aware Hashing for Social Image Retrieval
67  Zero-shot Image Categorization via Image Correlation Exploration
76  Deep Bottleneck Feature for Image Classification
78  Maximally Visual-Homogeneous Region Detector for Large Scale Image Retrieval
87  Rapid Clothing Retrieval via Deep Learning of Binary Codes and Hierarchical Search
93  Information gain study for visual vocabulary construction
97  Discriminative Latent Feature Space Learning for Cross-Modal Retrieval
103 Image Retrieval by User-oriented Ranking
105 Spatial Constraint for Image Location Estimation
115 Shape-based Object Matching Using Point Context
117 Large Scale Image Annotation via Deep Representation Learning and Tag Embedding Learning
124 Probabilistic Matrix Factorization With Semantic And Visual Neighborhoods For Image Tag Completion
129 Exploiting multiple web resources towards collecting positive training samples for visual concept learning
134 CRMActive : An Active Learning Based Approach For Effective Video Annotation And Retrieval
135 Personalized Egocentric Video Summarization for Cultural Experience
140 EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval
142 Specific Person Retrieval via Incomplete Text Description
146 Combining generic and specific information for cross-modal retrieval
148 3D Sketch-Based 3D Model Retrieval
151 Boosting Prediction of Geo-location for Web Images Through Integrating Multiple Knowledge Sources
162 Expression Recognition from Visible Images with the Help of Thermal Images
164 Multi-facet Learning using Deep Convolutional Neural Network for Person-Related Categories in Photos
176 Sketch-based Image Retrieval via Shape Words
180 Multiple Aesthetic Attribute Assessment by Exploiting Relations Among Aesthetic Attributes
181 Emotion recognition from users' eeg signals using hierarchical bayesian model with privileged information
184 Multi-Label Active Learning with Chi-Square Statistics for Image Classification
188 Multiple Measurements and Joint Dimensionality Reduction for Large Scale Image Search with Short Vectors
193 Exploring EEG for Object Detection and Retrieval
221 Kernel Local Descriptors with Implicit Rotation Matching
229 Semantic Concept Annotation for User Generated Videos Using Soundtracks
230 Automatic Image Annotation using Deep Learning Representations
231 “My Day in Review” Visually Summarising Noisy Lifelog Data
232 Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling
236 Learning Binary Codes for Hashing via Feature Decomposition
243 Multimodal Learning with Deep Boltzmann Machine for Emotion Prediction in User Generated Videos
244 Improving Automatic Name-Face Association using Celebrity Images on the Web


100 Music Positioning and Annotation For Television Videos
149 KinectSBR: A Kinect-Assisted 3D Sketch-Based 3D Model Retrieval System
153 An Improved System For Real-Time Scene Text Recognition
198 DigInPix: visual named-entities identification in images and videos
256 Mobile Media Thumbnailing
257 IdeaPanel: A Large Scale Interactive Sketch-based Image Search System
258 A Sparse Ensemble Learning System For Efficient Semantic Indexing
260 A Multi-Sensory Gesture-Based Occupational Therapy Environment for Controlling Home Appliances
261 Incremental Multimodal Query Construction for Video Search

Special Session Papers

96  End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning
128 Attribute Guided Dictionary Learning
166 Teaching Video Analytics Based on Human Behavior Mining
179 Online Multi-modal Co-indexing and Retrieval for Weakly Supervised Web Image Collections
217 Weakly Supervised Random Forest for Multi-Label Image Clustering and Segmentation
223 Harvesting Multiple Sources for User Profile Learning: a Big Data Study
254 Multi-view Face Detection Using Deep Convolutional Neural Networks          

Sponsors and Partners