<?xml version="1.0" encoding="UTF-8" standalone="yes"?><oembed><version><![CDATA[1.0]]></version><provider_name><![CDATA[The Multimedia Commons Initiative]]></provider_name><provider_url><![CDATA[https://multimediacommons.wordpress.com]]></provider_url><author_name><![CDATA[bjernd]]></author_name><author_url><![CDATA[https://multimediacommons.wordpress.com/author/bjernd/]]></author_url><title><![CDATA[Tools and Demos]]></title><type><![CDATA[link]]></type><html><![CDATA[<p>The Multimedia Commons effort includes interactive demos that show how the dataset is being used, along with analysis and retrieval tools to help researchers take advantage of this massive resource. Some of these tools are included in the resources hosted on Amazon Web Services, while other tools and demos arise from independent efforts and are hosted externally.</p>
<table border="0">
<tbody>
<tr>
<td><strong>Jump to:</strong><br />
<a href="#yfccbrowser">YFCC100M Browsers</a><br />
<a href="#mmcsearch">Multimedia Commons Search</a><br />
<a href="#audiocaffe">audioCaffe: Audio Analysis With Deep Neural Nets</a><br />
<a href="#videosearch100m">videosearch100M: Semantic Search</a><br />
<a href="#visualsearch">YFCC100M Visual Search Utility</a><br />
<a href="#misimilarity">MI-File CBIR: Similarity-Based Image Retrieval</a><br />
<a href="#evento">Evento360: Discovering Social Events</a><br />
<a href="#otherresources">Other Demos and Projects</a></td>
<td style="vertical-align:top;" width="260"><a href="https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png" rel="attachment wp-att-369"><img data-attachment-id="369" data-permalink="https://multimediacommons.wordpress.com/featured-tools/flickr_sample_24/" data-orig-file="https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png" data-orig-size="1039,697" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Flickr_Sample_24" data-image-description="" data-image-caption="" data-medium-file="https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?w=300" data-large-file="https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?w=1024" class="wp-image-369" src="https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?w=250" alt="Sample image: Sunset with casino sign" width="250" srcset="https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?w=250 250w, https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?w=500 500w, https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?w=150 150w, https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?w=300 300w" sizes="(max-width: 250px) 100vw, 250px" /></a></td>
</tr>
</tbody>
</table>
<h2><a name="yfccbrowser"></a><strong>YFCC100M Browsers</strong></h2>
<p>There are two browsers with which the dataset can be easily explored. For example, they allow you to visualize the distribution of media with particular tags across users, times, and places; view the original media with those tags on Flickr; and download the metadata for your query results. More details are available on a separate <a href="https://multimediacommons.wordpress.com/yfcc100m-browser/">page</a>.</p>
<hr />
<h2><a name="audiocaffe"></a><strong>audioCaffe: Audio Analysis With Deep Neural Nets</strong></h2>
<p>The audioCaffee audio-content analysis tool and a demonstration experiment may be found in the <code>tools/audioCaffe/</code> directory in our <a href="http://multimedia-commons.s3-website-us-west-2.amazonaws.com/" target="_blank" rel="noopener noreferrer">AWS S3 data store</a>, and is also included in <a href="https://s3-us-west-2.amazonaws.com/multimedia-commons/tools/etc/MultimediaCommons-audioCaffe-v0.2.template" target="_blank" rel="noopener noreferrer">the Multimedia Commons CloudFormation Template</a>. The demonstration experiment &#8212; a MED-ium Cup of audioCaffe &#8212; uses data from the YLI Multimedia Event Detection (MED) subcorpus. It will give you a taste of what you can do with a big corpus of computed audio features like YLI and a flexible set of analysis tools.</p>
<p>The directory also includes a build of audioCaffe; you can <a href="https://github.com/ashrafk/audioCaffeInitial" target="_blank" rel="noopener noreferrer">check for updates at GitHub</a>. audioCaffe is a deep neural net-based audio content analysis tool that leverages the deep-learning framework Caffe. audioCaffe is an open-source resource being developed as part of the <a href="http://multimedia.icsi.berkeley.edu/scalable-big-data-analysis/smash/" target="_blank" rel="noopener noreferrer">SMASH</a> project, which aims to provide a single software framework for a variety of content-analysis tasks (speech recognition, audio analysis, video event detection, etc.).</p>
<p><strong>Citation:</strong> Khalid Ashraf, Benjamin Elizalde, Forrest Iandola, Matthew Moskewicz, Gerald Friedland, Kurt Keutzer, and Julia Bernd. 2015. Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling. In <i>Proceedings of the 5th ACM International Conference on Multimedia Retrieval (ICMR &#8217;15)</i>. New York: ACM, 611-614.</p>
<p><strong>Cheers to:</strong> This cup of audioCaffe was grown, roasted, ground, and brewed by Khalid Ashraf, Kurt Keutzer, Gerald Friedland, Benjamin Elizalde, and Jia Yangqing.</p>
<p><strong>Funding:</strong> AudioCaffe is funded by a National Science Foundation grant for the SMASH project (grant <a href="http://www.nsf.gov/awardsearch/showAward?AWD_ID=1251276" target="_blank" rel="noopener noreferrer">IIS-1251276</a>). (Any opinions, findings, and conclusions expressed on this website are those of the individual researchers and do not necessarily reflect the views of the funding agency.)</p>
<p><strong>Contacts:</strong> Questions can be directed to <a href="mailto:ashrafkhalid@berkeley.edu" target="_blank" rel="noopener noreferrer">ashrafkhalid[chez]berkeley[stop]edu</a>.</p>
<hr />
<h2><a name="videosearch100m"></a><strong>videosearch100M: Semantic Search</strong></h2>
<p>A team at Carnegie Mellon University has been building a video search system based on matching (relatively) simple semantic concepts like <i>dancing</i> with visual and motion features (including convolutional neural network features and dense trajectories). They added features and annotations extracted from the ~800,000 YFCC100M videos to their system, and have released those resources publicly.</p>
<ul>
<li><strong><a href="https://sites.google.com/site/videosearch100m/ads" target="_blank" rel="noopener noreferrer">Check out the videosearch100m demo here.</a></strong></li>
<li><strong><a href="https://sites.google.com/site/videosearch100m/api" target="_blank" rel="noopener noreferrer">Find out how to use the videosearch100m API here.</a></strong></li>
<li><strong><a href="https://sites.google.com/site/videosearch100m/download" target="_blank" rel="noopener noreferrer">Get the features and concept annotations here.</a></strong></li>
</ul>
<p>Watch this space for updates on (hopefully) getting the full set of new features on AWS!</p>
<hr />
<h2><a name="visualsearch"></a><strong>YFCC100M Visual Search Utility</strong></h2>
<p>Researchers at the Information Technologies Institute in Thermi, Greece, have produced a nearest-neighbor based visual search utility for the YFCC100M, using an IVFPQ index based on SURF+VLAD features.</p>
<p><strong><a href="http://mklab.iti.gr/project/visual-features-and-search-index-flickr-100m-corpus" target="_blank" rel="noopener noreferrer">Check out the YFCC100M Visual Search Utility here.</a></strong></p>
<p>Available in <a href="http://multimedia-commons.s3-website-us-west-2.amazonaws.com/" target="_blank" rel="noopener noreferrer">the Multimedia Commons S3 data store on AWS</a>:</p>
<ul>
<li>The index file and the learning files for the Visual Search Utility: <code>features/features/image/vgg-vlad-yfcc/vlad/yfcc100m_ivfpq.zip</code> and <code>/learning_files.zip</code></li>
<li>The SURF+VLAD features used as the basis for the Utility: <code>features/features/image/vgg-vlad-yfcc/vlad/full/</code> &#8212; see the <a href="http://www.multimediacommons.org/other-feature-corpora/">Other Feature Corpora</a> page for a description.</li>
</ul>
<p><strong>Citation:</strong> Adrian Popescu, Eleftherios Spyromitros-Xioufis, Symeon Papadopoulos, Hervé Le Borgne, and Yiannis Kompatsiaris. Towards an Automatic Evaluation of Retrieval Performance With Large Scale Image Collections. In <i>Proceedings of the ACM Multimedia 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions (MMCommons &#8217;15), Brisbane, Australia, October 2015</i>. <a href="http://users.auth.gr/espyromi/publications/papers/MMCOMMONS2015.pdf" target="_blank" rel="noopener noreferrer">PDF of Preprint.</a></p>
<hr />
<p><a name="misimilarity"></a></p>
<h2><strong>MI-File CBIR: Similarity-Based Image Retrieval</strong></h2>
<p>A research group at the Istituto di Scienza e Tecnologie dell’Informazione “A. Faedo” (ISTI) has developed a similarity-based search engine for the YFCC100M that retrieves YFCC100M images based on visual similarity to a query image and predicts the most appropriate tag. The system uses deep neural network features and the Metric Inverted File technique.</p>
<p><strong><a href="http://mifile.deepfeatures.org/" target="_blank" rel="noopener noreferrer">Check out the MI-File CBIR Search Engine and Tag Predictor here.</a></strong></p>
<p>The Hybrid-CNN features that were used as the basis for the search engine are available in <a href="http://multimedia-commons.s3-website-us-west-2.amazonaws.com/" target="_blank" rel="noopener noreferrer">the Multimedia Commons S3 data store on AWS</a>, under <code>features/features/image/hybrid-cnn/</code>. See the <a href="http://www.multimediacommons.org/other-feature-corpora/">Other Feature Corpora</a> page for a description.</p>
<p><strong>For More Information:</strong> See <a href="http://www.deepfeatures.org/" target="_blank" rel="noopener noreferrer">the ISTI Deep Feature Corpus website</a>.</p>
<hr />
<h2><a name="evento"></a><strong>Evento360: Discovering Social Events</strong></h2>
<p>The YFCC100M images were used for <a href="http://www.acmmm.org/2015/call-for-contributions/multimedia-grand-challenges/" target="_blank" rel="noopener noreferrer">a Grand Challenge sponsored by Yahoo</a> at ACM Multimedia 2015. Participants were asked to build systems to automatically detect particular social or cultural events in the YFCC100M dataset, analyze their structure, and summarize them. The Grand Challenge provided an example of how having a freely available web-scale multimedia dataset like YFCC100M can move research forward at the level of a whole community.</p>
<p>ICSI researchers participated in the Challenge, producing a retrieval system called &#8220;Evento360&#8221; that uses hierarchical clustering based on both visual and audio information, as well as clustering of metadata. <i>Link to demo coming soon! We&#8217;re having technical difficulties.</i></p>
<hr />
<h2><strong>ImageSnippets: a<span class="lt-line-clamp__line"> system for creating structured, transportable </span><span class="lt-line-clamp__line">metadata for your images</span></strong></h2>
<p><a href="http://www.imagesnippets.com/" target="_blank" rel="noopener noreferrer">ImageSnippets</a> is a new kind of software tool that lets you compose and attach machine-readable descriptions to images, using cutting-edge Semantic Web and LInked Data technology and standards.</p>
<p>Once created, these descriptions have many uses: apps can do more meaningful searching for images in an image corpus; your described images can be found on the Web in powerful new ways, and seamlessly incorporated into other linked data tools; and your image and copyright information can be held secure on protected servers, safe from metadata stripping in social media. But first you have to create it, which has been a problem. ImageSnippets makes this easy.</p>
<p>ImageSnippets is a new kind of digital asset management system built on principles from the semantic web. We are a group of designers, cognitive scientists, programmers, and image makers interested in taking image management to a whole new level.</p>
<p>The lightweight image ontology (LIO) used by ImageSnippets was designed by Patrick Hayes and Margaret Warren and is open and freely usable and can be found at: <a href="http://lov.okfn.org/dataset/lov/vocabs/lio" target="_blank" rel="noopener noreferrer">http://lov.okfn.org/dataset/lov/vocabs/lio</a></p>
<p>The ImageSnippets triple-store of image data can be accessed at <a href="http://datahub.io" target="_blank" rel="noopener noreferrer">datahub.io</a></p>
<hr />
<h2><a name="otherresources"></a><strong>Other Demos and Projects</strong></h2>
<p><strong>Demos and Tools Using Multimedia Commons Data:</strong></p>
<ul>
<li>The YFCC100M images were used by researchers at University of North Carolina as the basis for <a href="http://www.cs.unc.edu/~jheinly/reconstructing_the_world.html" target="_blank" rel="noopener noreferrer">a tool that creates 3D reconstructions of landmarks</a> from diverse sets of user-generated images.</li>
<li>The ISI Foundation developed <a href="http://www.datainterfaces.org/projects/flickr/" target="_blank" rel="noopener noreferrer">Flickr Cities</a>, a tool for visualizing what types of images get captured in particular cities at different times and seasons.</li>
</ul>
<p>The projects listed here form only a sample of what is available!</p>
<p><strong>Research in the Multimedia Commons:</strong></p>
<ul>
<li>The MediaEval Placing Task is using the YLI-GEO dataset (with features, YFCC100M metadata, and original media) for its 2014, 2015, and 2016 benchmarks. Check out: <a href="http://ceur-ws.org/Vol-1263/" target="_blank" rel="noopener noreferrer">2014 Working Notes Proceedings</a> | <a href="http://ceur-ws.org/Vol-1436/" target="_blank" rel="noopener noreferrer">2015 Working Notes Proceedings</a></li>
<li>The first Multimedia COMMONS workshop featured research that used the YFCC100M dataset and other Multimedia Commons features, annotations, and resources. Check out <a href="https://dl.acm.org/citation.cfm?id=2814815" target="_blank" rel="noopener noreferrer">the Proceedings of MMCommons&#8217;15</a>.</li>
<li>Find <a href="https://scholar.google.com/scholar?q=yfcc100m" target="_blank" rel="noopener noreferrer">other papers referencing the YFCC100M</a>.</li>
</ul>
]]></html><thumbnail_url><![CDATA[https://multimediacommons.files.wordpress.com/2015/12/flickr_sample_24.png?fit=440%2C330]]></thumbnail_url><thumbnail_width><![CDATA[440]]></thumbnail_width><thumbnail_height><![CDATA[295]]></thumbnail_height></oembed>