Tag Archives: Multimedia

CrowdMM 2014 @ ACM MM Call for Papers

The power of crowds – leveraging a large number of human contributors and the capabilities of human computation – has enormous potential to address key challenges in the area of multimedia research. Crowdsourcing offers a time- and resource-efficient method for collecting large volumes of input for system design and evaluation, making it possible to optimize multimedia systems more rapidly and to address human factors more effectively.

At present, crowdsourcing remains notoriously difficult to exploit effectively in multimedia settings: the challenge arises from the fact that a community of users or workers is a complex and dynamic system highly sensitive to changes in the form and the parameterization of their activities.

The third CrowdMM workshop takes place in Orlando, FL, right along ACM Multimedia 2014. For more information, topics and important dates visit: http://www.crowdmm.org/call-for-papers/

CfP: ACM MMSys 2014 Dataset Track

The ACM Multimedia Systems conference (http://www.mmsys.org) provides a forum for researchers, engineers, and scientists to present and share their latest research findings in multimedia systems. While research about specific aspects of multimedia systems is regularly published in the various proceedings and transactions of the networking, operating system, real-time system, and database communities, MMSys aims to cut across these domains in the context of multimedia data types. This provides a unique opportunity to view the intersections and interplay of the various approaches and solutions developed across these domains to deal with multimedia data types. Furthermore, MMSys provides an avenue for communicating research that addresses multimedia systems holistically.

As an integral part of the conference since 2011 2012, the Dataset Track provides an opportunity for researchers and practitioners to make their work available (and citable) to the multimedia community. MMSys encourages and recognizes dataset sharing, and seeks contributions in all areas of multimedia (not limited to MM systems). Authors publishing datasets will benefit by increasing the public awareness of their effort in collecting the datasets.

In particular, authors of datasets accepted for publication will receive:

  • Dataset hosting from MMSys for at least 5 years
  • Citable publication of the dataset description in the proceedings published by ACM
  • 15 minutes oral presentation time at the MMSys 2014 Dataset Track

All submissions will be peer-reviewed by at least two members of the technical program committee of the MMSys 2014. Datasets will be evaluated by the committee on the basis of the collection methodology and the value of the dataset as a resource for the research community.

Submission Guidelines 

Authors interested in submitting a dataset should

(A) Make their data available by providing a public URL for download

(B) Write a short paper describing:

  1. motivation for data collection and intended use of the data set,
  2. the format of the data collected, 
  3. the methodology used to collect the dataset, and 
  4. basic characterizing statistics from the dataset.

Papers should be at most 6 pages long (in PDF format) prepared in the ACM style and written in English.

Important dates

  • Data set paper submission deadline: November 11, 2021
  • Notification: December 20, 2021
  • MMSys conference : March 19 - 21, 2014

MMsys Datasets

Previous accepted datasets can be accessed at

  • http://traces.cs.umass.edu/index.php/MMsys/MMsys (2013)
  • http://web.cs.wpi.edu/~claypool/mmsys-dataset/ (2011-2012)


For further queries and extra information, please contact us at mlux@itec.uni-klu.ac.at. Most recent information can be found on http://www.mmsys.org

2021-07-07 (ml): Updated URLs and “2011”

CBMI 2013 — Deadline extended to March 11, 2021

The 11th International Content Based Multimedia Indexing Workshop is to bring together the various communities involved in all aspects of content-based multimedia indexing, retrieval, browsing and presentation. Following the ten successful previous events of CBMI (Toulouse 1999, Brescia 2001, Rennes 2003, Riga 2005, Bordeaux 2007, London 2008, Chania 2009, Grenoble 2010, Madrid 2011, and Annecy 2012), the University of Pannonia, Hungary organizes the 11th Context Based Multimedia Indexing Workshop on June 17-19 2013 in the historical town of Veszprém, Hungary, near the spectacular Lake Balaton. The workshop will host invited keynote talks and regular, special and demo sessions with contributed research papers.

For more information see http://cbmi2013.mik.uni-pannon.hu/

Call for Papers: WIAMIS 2013: The 14th International Workshop on Image and Audio Analysis for Multimedia Interactive Services

Topics of interest include, but are not limited to:

– Multimedia content analysis and understanding
– Content-based browsing, indexing and retrieval of images, video and audio
– Advanced descriptors and similarity metrics for multimedia
– Audio and music analysis, and machine listening
– Audio-driven multimedia content analysis
– 2D/3D feature extraction
– Motion analysis and tracking
– Multi-modal analysis for event recognition
– Human activity/action/gesture recognition
– Video/audio-based human behavior analysis
– Emotion-based content classification and organization
– Segmentation and reconstruction of objects in 2D/3D image sequences
– 3D data processing and visualization
– Content summarization and personalization strategies
– Semantic web and social networks
– Advanced interfaces for content analysis and relevance feedback
– Content-based copy detection
– Analysis and tools for content adaptation
– Analysis for coding efficiency and increased error resilience
– Multimedia analysis hardware and middleware
– End-to-end quality of service support
– Multimedia analysis for new and emerging applications
– Advanced multimedia applications

Important dates:

- Proposal for Special Sessions: 4th January 2013
- Notification of Special Sessions Acceptance: 11th January 2013
- Paper Submission: 8th March 2013
- Notification of Papers Acceptance: 3rd May 2013
- Camera-ready Papers: 24th May 2013

See http://wiamis2013.wp.mines-telecom.fr/ for more information.

Nice but out of reach: popular multimedia platforms

Netflix was reported last year to be the source of nearly 30% of the North American internet backbone traffic. Well that’s impressive, but that’s something that many non North Americans can’t understand … and there’s a simple reason for that: the service is not available in many countries. Several well known and well received services are restricted to a range of IP adresses that are considered in a geographic location where users have access to this services. Here is a small but still interesting list of services that have obviously impact on the usage of the internet, but cannot be accessed in many European countries.

  • Netflix - major video streaming service (subscription based)
  • Pandora - music streaming service / adaptive online radio (ad supported)
  • Hulu - major video streaming service of already aired TV content (ad supported)
  • Vevo - music video streaming service (ad supported). Most of the music videos on Vevo are available on YouTube for Austrians, but most of these music videos are not accessible of Germans.
  • NBC - video streaming service of already aired NBC TV content.
  • ABC - video streaming service of already aired ABC TV content.


Lire and Lire Demo v 0.9 released

I just released Lire and Lire Demo in version 0.9 on sourceforge.net. Basically it’s the alpha version with additional speed and stability enhancements for bag of visual words (BoVW) indexing. While this has already been possible in earlier versions I re-furbished vocabulary creation (k-means clustering) and indexing to support up to 4 CPU cores. I also integrated a function to add documents to BoVW indexes incrementally. So a list of major changes since Lire 0.8 includes

  • Major speed-up due to change and re-write of indexing strategies for local features
  • Auto color correlation and color histogram features improved
  • Re-ranking filter based on global features and LSA
  • Parallel bag of visual words indexing and search supporting SURF and SIFT including incremental index updates (see also in the wiki)
  • Added functionality to Lire Demo including support for new Lire features and a new result list view

Download and try:

  • Source and binaries (or as tar.bz2)
  • Lire Demo

MMM 2012 Call for Special Session Proposals

International Conference on Multimedia Modeling 2012
Jan. 4-6, 2012, Klagenfurt, Austria

The International MultiMedia Modeling Conference (MMM) is a leading international conference for researchers and industry practitioners to share their new ideas, original research results and practical development experiences from all MMM related areas. MMM2012 welcomes proposals for special sessions focusing on specific new challenges in multimedia research. Topics of interest include, but are not limited to:

  • 3D object and face retrieval
  • Annotation and multimedia metadata
  • Cross-modal and cross-media analysis and modeling
  • Events and actions in multimedia
  • Multimedia in interactive entertainment
  • Music and audio content analysis
  • Modeling user context in multimedia retrieval

Also the topics mentioned in the conference call for papers in the fields of multimedia content analysis, multimedia signal processing and communications, and multimedia applications and services (see also http://mmm2012.org/call-for-papers/) are of interest for the special session.

A typical MMM special session features 5-6 contribution discussing the proposed topic. The proposal should include the following information:

  • Tentative title of the proposed special session
  • Names and affiliations of the organizers (including brief bio and contact information)
  • Session abstract (statement of the significance of topic)
  • List of potential contributors (together with tentative paper titles) who agree to submit a paper if the proposal is accepted

Proposals will be evaluated based on the timeliness and significance of the topic, as well as the qualifications of the organizers and the tentative papers proposed.

Papers of accepted special sessions need to be submitted using the MMM2012 conference submission system. Special session organizers will be responsible for managing the review process of the papers submitted to their special sessions. All special session papers will be included in the conference proceedings.

Important dates:

  • Proposal submission: June 6, 2011
  • Notifications: June 20, 2011
  • Papers submission: July 22, 2011

Special session co-chairs:

  • Marco Bertini, Università di Firenze, Italy, bertini<at>dsi.unifi.it
  • Mathias Lux, Klagenfurt University, Austria, mlux<at>itec.uni-klu.ac.at

For more information, please visit http://www.mmm2012.org/

Final Call for Papers: Special Issue on Searching Speech

ACM Transactions on Information Systems is soliciting contributions to a special issue on the topic of “Searching Speech”. The special issue will be devoted to algorithms and systems that use speech recognition and other types of spoken audio processing techniques to retrieve information, and, in particular, to provide access to spoken audio content or multimedia content with a speech track.

Submission Deadline: 1 March 2021

The field of spoken content indexing and retrieval has a long history dating back to the development of the first broadcast news retrieval systems in the 1990s. More recently, however, work on searching speech has been moving towards spoken audio that is produced spontaneously and in conversational settings. In contrast to the planned speech that is typical for the broadcast news domain, spontaneous, conversational speech is characterized by high variability and the lack of inherent structure. Domains in which researchers face such challenges include: lectures, meetings, interviews, debates, conversational broadcast (e.g., talk-shows), podcasts, call center recordings, cultural heritage archives, social video on the Web, spoken natural language queries and the Spoken Web.
We invite the submission of papers that describe research in the following areas:

  • Integration of information retrieval algorithms with speech recognition and audio analysis techniques
  • Interfaces and techniques to improve user interaction with speech collections
  • Indexing diverse, large scale collections
  • Search effectiveness and efficiency, including exploitation of additional information sources

For more information see http://tois.acm.org/announcement.html

CfP Workshop on Multimedia on the Web 2011

in conjunction with i-Know and i-Semantics 2011
8th Sept. 2011, Graz, Austria

Streaming video has recently surpassed peer-to-peer networks in terms of network capacity hunger. Reports estimate a share of 40% of peak network capacity dedicated to entertainment, mostly streaming video. A large share of this traffic originates from web based services. YouTube alone takes up to 8% of the prime time internet traffic. So multimedia on the web is currently a big issue. While transmission currently works in a best effort system, multimedia information system on the web are far from being perfect. Retrieval, annotation, validated and useful metadata, reliable and trusted services, and user interaction and context-based adaptation are still under discussion and allow improvement. Currently, the Web itself faces dramatic changes, looking for example at the spread of social networks, Linked Data or the impact of HTML5 or WebM. These activities also have a deep effect on multimedia data and content provider. Following this, we aim to bring together researchers from the area of multimedia and the web to discuss innovative ideas and new directions in this workshop.

Continue reading

ACM MM 2010 Open Source Competition started

Currently I’m sitting in this year’s ACM Multimedia open source software competition sessions. Marco Bertini is our chair and we have already seen openSMILE and the Open SVC Decoder. The openSMILE framework is something I’ll definitely revisit as it allows for emotion classification in speech :). Currently the Sonic Visualizer is shown.