Less than 20% of Flickr images tagged …

July 3, 2008 on 1:42 pm | In Tagging, Web2.0 |

While writing a scientific paper on tag recommendation I checked - just out of curiosity - the share of images tagged by their uploaders on Flickr. I found out that 4 out of five images are untagged and that less than 15% of images have 2 or more tags.

My method and detailed results: In general one would need a random sample for such an investigation, but a truly random sample is hard to obtain without access to the data base. Therefore I just grabbed 20,004 images from the RSS feed for recent uploads and counted the number of tagged images. Easy enough I also computed the confidence interval:

  • In my sample 3,650 images were tagged with at least one tag, that makes p1=18.25%
    • With alpha=0.99 p1 is in [16.84, 19.66].
    • That leaves more than 4 out of 5 images untagged.
  • Also in my sample 2,628 images were tagged with at least two tags, that makes p2=13,14%
    • With alpha=0.99 p2 is in [11.9, 14.37].
    • That means that less than 15% of the images images have more than one tag.


1 Comment »

RSS feed for comments on this post. TrackBack URI

  1. It would be interesting to see whether there is a difference between the tagging ratio of recently uploaded images, and images that have been on the server for a while, as you might reasonably assume that a flickr user may need some time (a few weeks?) to go over his/her photos and tag them.

    Comment by Markus — July 3, 2008 #

Leave a comment

XHTML: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

© 2004-2007 by Mathias Lux
>> Contents of this page are licensed under the CreativeCommons Attribution 2.5 license <<