<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SemanticMetadata.net &#187; Software</title>
	<atom:link href="http://www.semanticmetadata.net/tag/software/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.semanticmetadata.net</link>
	<description></description>
	<lastBuildDate>Wed, 11 Jan 2012 16:35:54 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Searching with Lire in big datasets</title>
		<link>http://www.semanticmetadata.net/2011/10/27/searching-with-lire-in-big-datasets/</link>
		<comments>http://www.semanticmetadata.net/2011/10/27/searching-with-lire-in-big-datasets/#comments</comments>
		<pubDate>Thu, 27 Oct 2011 09:17:34 +0000</pubDate>
		<dc:creator>Mathias Lux</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Lire]]></category>
		<category><![CDATA[tricks]]></category>

		<guid isPermaLink="false">http://www.semanticmetadata.net/?p=798</guid>
		<description><![CDATA[Having received several complaints about the slowness of Lire when searching in 100k+ documents I took my time to write a small how to to explain approaches for search in big (relatively) data sets. Lire has the ability to create indexes with lots of different features (descriptors, like RGB color histograms or CEDD). While this [...]]]></description>
			<content:encoded><![CDATA[<p>Having received several complaints about the slowness of Lire when searching in 100k+ documents I took my time to write a small how to to explain approaches for search in big (relatively) data sets.</p>
<p>Lire has the ability to create indexes with lots of different features (descriptors, like RGB color histograms or CEDD). While this opens the opportunity to flexibility at search time as we can select the feature at the time we create a query, the index tends to get bigger and bigger and searcher take longer and longer.</p>
<p>With a data set of 121,379 images the index created with the features selected for default in Lire Demo has a size of 14,3 <acronym title="Gigabyte">GB</acronym> on the disk. In contrast to that an index just storing the CEDD feature along with the image identifier has a size of 29 <acronym title="Megabyte">MB</acronym>.</p>
<p>Due to the size of the index also linear search tends to get slower. While for the index stripped down to the CEDD feature and the identifier searching takes (on a AMD Quad-Core computer with 4GB RAM and Java 1.7) roughly 0.33 seconds, searching the big index takes 7 minutes and 3 seconds.</p>
<p style="text-align: center;"><img class="aligncenter" title="charts" src="http://www.semanticmetadata.net/wiki/lib/exe/fetch.php?media=lire:search-big.png" alt="" width="451" height="135" /></p>
<p>So if you want to index and search big data sets (&gt; 100.000 images for instance) I recommend to</p>
<ul>
<li>
<div>select which features you need,</div>
</li>
<li>
<div>create the index with a minimum set of features, and</div>
</li>
<li>
<div>eventually split the index per feature and select the index on the fly instead of the feature</div>
</li>
<li>
<div>also you can load the index into RAM</div>
</li>
</ul>
<p>For more on loading the index to RAM and the option to use local features read on in the <a href="http://www.semanticmetadata.net/wiki/doku.php?id=lire:manydocs">developer wiki</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.semanticmetadata.net/2011/10/27/searching-with-lire-in-big-datasets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lire and Lire Demo v 0.9 released</title>
		<link>http://www.semanticmetadata.net/2011/10/20/lire-and-lire-demo-v-0-9-released/</link>
		<comments>http://www.semanticmetadata.net/2011/10/20/lire-and-lire-demo-v-0-9-released/#comments</comments>
		<pubDate>Thu, 20 Oct 2011 11:37:23 +0000</pubDate>
		<dc:creator>Mathias Lux</dc:creator>
				<category><![CDATA[Dev]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Lire]]></category>
		<category><![CDATA[LireDemo]]></category>
		<category><![CDATA[Release]]></category>
		<category><![CDATA[Retrieval]]></category>

		<guid isPermaLink="false">http://www.semanticmetadata.net/?p=780</guid>
		<description><![CDATA[I just released Lire and Lire Demo in version 0.9 on sourceforge.net. Basically it&#8217;s the alpha version with additional speed and stability enhancements for bag of visual words (BoVW) indexing. While this has already been possible in earlier versions I re-furbished vocabulary creation (k-means clustering) and indexing to support up to 4 CPU cores. I [...]]]></description>
			<content:encoded><![CDATA[<p>I just released Lire and Lire Demo in version 0.9 on sourceforge.net. Basically it&#8217;s the alpha version with additional speed and stability enhancements for <em>bag of visual words (BoVW)</em> indexing. While this has already been possible in earlier versions I re-furbished vocabulary creation (k-means clustering) and indexing to support up to 4 CPU cores. I also integrated a function to add documents to BoVW indexes incrementally. So a list of major changes since Lire 0.8 includes</p>
<ul>
<li>Major speed-up due to change and re-write of indexing strategies for local features</li>
<li>Auto color correlation and color histogram features improved</li>
<li>Re-ranking filter based on global features and LSA</li>
<li>Parallel <em>bag of visual words</em> indexing and search supporting SURF and SIFT including incremental index updates (see also in the <a href="http://www.semanticmetadata.net/wiki/doku.php?id=lire:bovw">wiki</a>)</li>
<li>Added functionality to Lire Demo including support for new Lire features and a new result list view</li>
</ul>
<p>Download and try:</p>
<ul>
<li><a href="http://sourceforge.net/projects/caliph-emir/files/Lire/Lire%200.9/Lire-0.9.zip/download">Source and binaries</a> (or as <a href="http://sourceforge.net/projects/caliph-emir/files/Lire/Lire%200.9/Lire-0.9.tar.bz2/download">tar.bz2</a>)</li>
<li><a href="http://sourceforge.net/projects/caliph-emir/files/Lire/Lire%200.9/LireDemo_v0.9.zip/download">Lire Demo</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.semanticmetadata.net/2011/10/20/lire-and-lire-demo-v-0-9-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lire Demo 0.9 alpha 2 just released</title>
		<link>http://www.semanticmetadata.net/2011/08/05/lire-demo-0-9-alpha-2-just-released/</link>
		<comments>http://www.semanticmetadata.net/2011/08/05/lire-demo-0-9-alpha-2-just-released/#comments</comments>
		<pubDate>Fri, 05 Aug 2011 10:41:46 +0000</pubDate>
		<dc:creator>Mathias Lux</dc:creator>
				<category><![CDATA[Dev]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[demo]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Lire]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[Release]]></category>

		<guid isPermaLink="false">http://www.semanticmetadata.net/?p=786</guid>
		<description><![CDATA[Finally I found some time to go through Lire and fix several of the &#8212; for me &#8212; most annoying bugs. While this is still work in progress I have a preview with the demo uploaded to sf.net. New features are: Auto Color Correlogram and Color Histogram features improved Re-ranking based on different features supported [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.semanticmetadata.net/wp-content/uploads/2011/08/LireDemo_v0.9_a2.png"><img class="alignright size-thumbnail wp-image-789" title="LireDemo_v0.9_a2" src="http://www.semanticmetadata.net/wp-content/uploads/2011/08/LireDemo_v0.9_a2-150x150.png" alt="" width="150" height="150" /></a>Finally I found some time to go through Lire and fix several of the &#8212; for me &#8212; most annoying bugs. While this is still work in progress I have a preview with the demo uploaded to sf.net. New features are:</p>
<ul>
<li>Auto Color Correlogram and Color Histogram features improved</li>
<li>Re-ranking based on different features supported</li>
<li>Enhanced results view</li>
<li>Much faster indexing (parallel, use -server switch for your JVM)</li>
<li>Much faster search (re-write of the searhc code in Lire)</li>
<li>New developer menu for faster switching of search features</li>
<li>Re-ranking of results based on latent semantic analysis</li>
</ul>
<p>You can find the updated Lire Demo along with a windows launcher here, Mac and Linux users please run it using &#8220;java -jar &#8230; &#8221; or double click (if your windows manager supports actions like that <img src='http://www.semanticmetadata.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<ul>
<li><a href="https://sourceforge.net/projects/caliph-emir/files/Lire/Lire%200.9/LireDemo-v09a2.zip/download">Download Liredemo v0.9 alpha 2</a></li>
</ul>
<p>The source is &#8212; of course &#8212; GPL and available in the <a href="http://caliph-emir.svn.sourceforge.net/viewvc/caliph-emir/">SVN</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.semanticmetadata.net/2011/08/05/lire-demo-0-9-alpha-2-just-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lire 0.8 released</title>
		<link>http://www.semanticmetadata.net/2010/03/11/lire-0-8-released/</link>
		<comments>http://www.semanticmetadata.net/2010/03/11/lire-0-8-released/#comments</comments>
		<pubDate>Thu, 11 Mar 2010 15:15:33 +0000</pubDate>
		<dc:creator>Mathias Lux</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[cbir]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Library]]></category>
		<category><![CDATA[Lire]]></category>
		<category><![CDATA[Release]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://www.semanticmetadata.net/2010/03/11/lire-0-8-released/</guid>
		<description><![CDATA[I just released LIRe v0.8. LIRe &#8211; Lucene Image Retrieval &#8211; is a Java library for easy content based image retrieval. Based on Lucene it doesn&#8217;t need a database and works reliable and rather fast. Major change in this version is the support of Lucene 3.0.1, which has a changed API and better performance on [...]]]></description>
			<content:encoded><![CDATA[<p>I just released LIRe v0.8. LIRe &#8211; Lucene Image Retrieval &#8211; is a Java library for easy content based image retrieval. Based on Lucene it doesn&#8217;t need a database and works reliable and rather fast. Major change in this version is the support of Lucene 3.0.1, which has a changed API and better performance on some OS. A critical bug was fixed in the Tamura feature implementation. It now definitely performs better <img src='http://www.semanticmetadata.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Hidden in the depths of the code there is an implementation of the approximate fast indexing approach of G. Amato. It copes with the problem of linear search and provides a method for fast approximate retrieval for huge repositories (millions?). Unfortunately I haven&#8217;t tested with millions, just with tens thousands, which proves that it works, but it doesn&#8217;t show how fast.</p>
<p><strong>Links</strong></p>
<ul>
<li><a href="https://sourceforge.net/projects/caliph-emir/files/">Download Lire &amp; Lire Demo</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.semanticmetadata.net/2010/03/11/lire-0-8-released/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>ZoRa Photo Director</title>
		<link>http://www.semanticmetadata.net/2010/03/04/zora-photo-director/</link>
		<comments>http://www.semanticmetadata.net/2010/03/04/zora-photo-director/#comments</comments>
		<pubDate>Thu, 04 Mar 2010 12:46:26 +0000</pubDate>
		<dc:creator>Mathias Lux</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[asset management]]></category>
		<category><![CDATA[Lire]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://www.semanticmetadata.net/?p=651</guid>
		<description><![CDATA[Just got word from Berthold Daum that he has integrated LIRe in the ZoRa Photo Director. That&#8217;s desktop asset management application, written in Java which allows for management of large photo collections. Source and binaries (Win &#38; Linux) are available at http://www.photozora.org. Not to forget: it&#8217;s built on Eclipse! Links http://www.photozora.org]]></description>
			<content:encoded><![CDATA[<p>Just got word from Berthold Daum that he has integrated LIRe in the <em>ZoRa Photo Director</em>. That&#8217;s desktop asset management application, written in Java which allows for management of large photo collections. Source and binaries (Win &amp; Linux) are available at <a href="http://www.photozora.org/html/zora_photo_director.html">http://www.photozora.org</a>. Not to forget: it&#8217;s built on Eclipse!</p>
<p><strong>Links</strong></p>
<ul>
<li><a href="http://www.photozora.org/html/zora_photo_director.html">http://www.photozora.org</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.semanticmetadata.net/2010/03/04/zora-photo-director/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Finding duplicate code &#8230;</title>
		<link>http://www.semanticmetadata.net/2008/07/02/finding-duplicate-code/</link>
		<comments>http://www.semanticmetadata.net/2008/07/02/finding-duplicate-code/#comments</comments>
		<pubDate>Wed, 02 Jul 2008 09:58:05 +0000</pubDate>
		<dc:creator>Mathias Lux</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Teaching]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[OpenSource]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://www.semanticmetadata.net/2008/07/02/finding-duplicate-code/</guid>
		<description><![CDATA[I recently found myself in a scenario, where I tried to figure out how implementation clusters have been implicitly created within a group of students. All of them were given a task (with 4 sub tasks) for a whole semester. Everyone was meant to do the task alone, but collaboration was allowed. However I needed [...]]]></description>
			<content:encoded><![CDATA[<p>I recently found myself in a scenario, where I tried to figure out how implementation clusters have been implicitly created within a group of students. All of them were given a task (with 4 sub tasks) for a whole semester. Everyone was meant to do the task alone, but collaboration was allowed. However I needed to know who helped whom and &#8211; of course &#8211; who helped whom with source code.</p>
<p>A colleague had a similar problem and he pointed me to <a href="http://pmd.sourceforge.net/cpd.html">PMD CPD</a> (= PMD Copy &amp; Paste Detector) . This tool works lightning fast and has a GUI <img src='http://www.semanticmetadata.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Also its open source -&gt; respect!</p>
<p><strong>Links:</strong></p>
<ul>
<li><a href="http://pmd.sourceforge.net/cpd.html">http://pmd.sourceforge.net/cpd.html </a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.semanticmetadata.net/2008/07/02/finding-duplicate-code/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

