<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SemanticMetadata.net &#187; stats</title>
	<atom:link href="http://www.semanticmetadata.net/tag/stats/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.semanticmetadata.net</link>
	<description></description>
	<lastBuildDate>Wed, 11 Jan 2012 16:35:54 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Less than 20% of Flickr images tagged &#8230;</title>
		<link>http://www.semanticmetadata.net/2008/07/03/less-than-20-of-flickr-images-tagged/</link>
		<comments>http://www.semanticmetadata.net/2008/07/03/less-than-20-of-flickr-images-tagged/#comments</comments>
		<pubDate>Thu, 03 Jul 2008 12:42:59 +0000</pubDate>
		<dc:creator>Mathias Lux</dc:creator>
				<category><![CDATA[Tagging]]></category>
		<category><![CDATA[Web2.0]]></category>
		<category><![CDATA[Development]]></category>
		<category><![CDATA[flickr]]></category>
		<category><![CDATA[Imaging]]></category>
		<category><![CDATA[stats]]></category>

		<guid isPermaLink="false">http://www.semanticmetadata.net/2008/07/03/less-than-20-of-flickr-images-tagged/</guid>
		<description><![CDATA[While writing a scientific paper on tag recommendation I checked &#8211; just out of curiosity &#8211; the share of images tagged by their uploaders on Flickr. I found out that 4 out of five images are untagged and that less than 15% of images have 2 or more tags. My method and detailed results: In [...]]]></description>
			<content:encoded><![CDATA[<p>While writing a scientific paper on tag recommendation I checked &#8211; just out of curiosity &#8211; the share of images tagged by their uploaders on <a href="http://flickr.com">Flickr</a>. I found out that 4 out of five images are untagged and that less than 15% of images have 2 or more tags.</p>
<p>My method and detailed results: In general one would need a random sample for such an investigation, but a truly random sample is hard to obtain without access to the data base. Therefore I just grabbed 20,004 images from the RSS feed for recent uploads and counted the number of tagged images. Easy enough I also computed the confidence interval:</p>
<ul>
<li>In my sample 3,650 images were tagged with at least one tag, that makes p1=18.25%</li>
<ul>
<li> With alpha=0.99 p1 is in [16.84, 19.66].</li>
<li>That leaves more than 4 out of 5 images untagged.</li>
</ul>
<li>Also in my sample 2,628 images were tagged with at least two tags, that makes p2=13,14%</li>
<ul>
<li>With alpha=0.99 p2 is in [11.9, 14.37].</li>
<li>That means that less than 15% of the images images have more than one tag.</li>
</ul>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.semanticmetadata.net/2008/07/03/less-than-20-of-flickr-images-tagged/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

