The 2nd ACM International Workshop on Interactive Multimedia on Mobile and Portable Devices (IMMPD2012)

In Conjunction with ACM Multimedia 2012, Nara, Japan

Call for Papers

Important Dates

Organizers

Technical Program Committee

Paper Submission

Review

Program

Contacts

Keynote Speaker

Prof. Winston Hsu, National Taiwan University, Taipei, Taiwan

Title: Emerging Challenges and Opportunities in Exploiting Mobile Photos and Videos

Abstract: The prevalence of mobile cameras and social media sharing services have drastically increased the volume of photos and videos, which pose a new challenge for developing efficient retrieval, mining, and visualization approaches. Meanwhile, the mobile devices have almost been the must in daily life for capturing important moments, browsing the photos and videos shared by the families and friends, or working as Q&A devices for understanding the objects of interests (e.g., landmarks, products, video programs, or restaurant reviews, etc.) by taking a snapshot and searching the related information. In this talk, we will first look into the emerging opportunities and promising solutions to manipulate such large-scale data through mobile devices. We will point the properties that we can leverage from mobile devices and go through the challenges (e.g., small mobile screen size, low bandwidth, limited storage and computing power), which further pose new constraints in photo/video content analysis and retrieval in the emerging mobile platform. Meanwhile, instead of treating the user-generated photos/videos as independent workloads, we will demonstrate the exciting applications in mobile recommendation and human activity mining as, in a holistic manner, analyzing such sheer amount of photos and videos, which are generally to record people activities and those the most memorable for the users. We will also review emerging industrial and academic research problems and the pilot experiments over million-scale photo/video collections.

Bio: Dr. Winston Hsu is an active researcher dedicated to novel algorithms and systems for ultra-large-scale image/video retrieval, social media mining/recommendation, multimedia analytics, and mobile and cloud-based multimedia applications. He is keen to realizing advanced researches towards business deliverables. He has been recognized with technical awards and best paper awards in multimedia research community. He delivered keynote speeches and lectured several state-of-the-art tutorials in top conferences. He is in the technical committee for top conferences (ACM Multimedia, SIGIR, and WWW) and the organizing committee for ACM Multimedia and ICME. He is on the Editorial Board for IEEE Multimedia Magazine and Journal of Multimedia (JMM). He is an Associate Professor in the Department of Computer Science and Information Engineering, NTU and the founder of MiRA (Multimedia indexing, Retrieval, and Analysis) Research Group in Communication and Multimedia Lab (CMLab). He received his Ph.D. (2007) degree from Columbia University, New York. Before that, he worked for years as a founding engineer in the multimedia software company, CyberLink Corp., where experiencing Engineer, Project Leader, and R&D Manager. He is recently awarded 2011 Ta-You Wu Memorial Award, a national and prestigious recognition for young researchers.

Technical Program

From

To

Details

13: 30

13:35

Welcome

Minoru Etoh, NTT DOCOMO, Japan

13:35

14:30

Keynote

Emerging Challenges and Opportunities in Exploiting Mobile Photos and Videos

Winston Hsu, National Taiwan University, Taipei, Taiwan

Session 1: Oral Session 1

Session Chair: Minoru Etoh, NTT DOCOMO, Japan

 

14:30 - 15:30

14:30

14:50

Enabling Portable Animation Browsing by Transforming Animations into Comics

Wei-Ta Chu, National Chung Cheng University, Taiwan

Hsing-Han Wang, National Chung Cheng University, Taiwan

14:50

15:10

Error-tolerant Interactive Image Segmentation by Using Dynamic and Iterated Graph-Cuts

Ozan Sener, Middle East Technical University

Kemal Ugur, Nokia Research Center 
Aydin Alatan, Middle East Technical University

15:10

15:30

Augmented Poselets for Human Body Pose Inference by a Probabilistic Graphical Model

Pol Cirujeda, Universitat Pompeu Fabra, Spain

Xavier Binefa, Universitat Pompeu Fabra, Spain

15:30

16:00

Coffee Break

Session 2: Oral Session 2

Session Chair: Minoru Etoh, NTT DOCOMO, Japan

 

16:00 - 17:00

16:00

16:20

Mining for Motivation: Using a single wearable accelerometer to detect people’s interests

Hayley Hung, University of Amsterdam, Netherlands

Gwenn Englebienne, University of Amsterdam, Netherlands

16:20

16:40

Real-time Mobile Recipe Suggestion System using Food Image Recognition

Takum Maruyama, Univ. Electro-Commu., Japan
Keiji Yanai, Univ. Electro-Commu., Japan

16:40

17:00

Effective browsing of long audio recordings

Camille Goudeseune, University of Illinois, USA

Call for Papers

With the development of silicon technologies, mobile and portable electronics devices, such as personal computers, mobile phones, digital cameras, and PDA, have become ubiquitous for people's daily life. These devices provide multimedia sources for entertainment, communication, and so on. How to design user interfaces of these products that enable natural, intuitive and fun interaction is one of the main challenges the multimedia community is facing.

Considering that mobile and portable devices are usually supplied with multiple sensors (e.g., camera and microphone), how to employ multimodal information for interaction has recently received much attention in both academia and industry. But interactive multimedia is still an under-explored field. Many challenges exist when moving to multimodal interaction: for example, how to annotate and search huge data acquired by using multiple sensors, especially in the unconstrained end-user environments? how to effectively extract and select representative multimedia features for human behavior recognition? and how to select the fusion strategy of multimodal data for a given application? To address these challenges, we must adapt the existing approaches or find new solutions suitable for multimedia interaction on mobile and portable devices.

This workshop will bring together researchers from both academia and industry in domains including computer vision, audio and speech processing, machine learning, pattern recognition, communications, human-computer interaction, and media technology to share and discuss recent advances in interactive multimedia. Topics include, but are not limited to:

  • Multimedia description and markup
  • Multimedia representation and annotation
  • Multimedia search and retrieval
  • Presence and environment sensing
  • Face detection, tracking, and recognition
  • Hand detection, tracking, and recognition
  • Emotion/mood recognition
  • Gesture/action/activity recognition
  • Audio-visual recognition and interaction
  • Novel interaction (accelerometer, touch screen, haptics, voice, etc.)
  • Multimodal data modeling and fusion
  • Multimedia content adaptation
  • Social media computing and interaction 
  • Context-aware services
  • Interaction with depth-sensing (e.g., Kinect)

Important Dates

  • Paper submission deadline: 07 July 2012 (extended)
  • Notification of acceptance: 30 July 2012
  • Camera-ready due: 15 August 2012
  • Workshop: 2 November 2012

Workshop Chairs

  • Ling Shao, The University of Sheffield, UK
  • Caifeng Shan, Philips Research, The Netherlands
  • Minoru Etoh, NTT DOCOMO, Japan

Technical Program Committee

  • Xavier Binefa, University of Barcelona, Spain
  • Yun (Raymond) Fu, SUNY at Buffalo, USA
  • Ling Guan, Ryerson University, Canada
  • Wolfgang Hurst, Utrecht University, The Netherlands   
  • Winston Hsu, National Taiwan University, Taiwan
  • Xiaoyi Jiang, University of Munster, Germany
  • Yu-Gang Jiang, Fudan University, China
  • Shihong Lao, Omron, Japan
  • Alexander C. Loui, Kodak Research Labs, USA
  • Anton Nijholt, University of Twente, The Netherlands
  • Yoichi Sato, University of Tokyo, Japan
  • Ling Shao, The University of Sheffield, UK
  • Jinhui Tang, Nanjing University of Science and Technology, China
  • Kari Pulli, NVIDIA, USA
  • Meng Wang, Hefei University of Technology, China
  • Ning Xu, Dolby Laboratories, USA
  • Hui Zhang, United International College, China
  • Wei-Shi Zheng, Sun Yat-Sen University, China

Paper Submission

  • When submitting a manuscript to this workshop, the authors acknowledge that no paper substantially similar in content has been submitted to another conference, workshop, or journal.
  • Full papers may be up to 6 pages (in ACM format). All papers must follow the general paper submission instructions of ACM Multimedia 2012.
  • Papers must be submitted in PDF format using the ACM Multimedia submission site .

Review and Publication

  • Each submission will be reviewed by at least three TPC members and/or external reviewers for originality, significance, clarity, soundness, relevance and technical contents.
  • The review process will be double-blind.
  • Accepted papers will be published by ACM in the workshop proceedings together with the ACM Multimedia 2012 proceedings. Extended versions of accepted papers will be invited to submit to a special issue of a top journal or an edited book.

Contacts

  • Ling Shao (ling dot shao at sheffield dot ac dot uk)
  • Caifeng Shan (caifeng dot shan at gmail dot com)