Number of items: 2.
KohlschÃ¼tter, Christian A Densitometric Analysis of Web Template Content.
What makes template content in the Web so special that we need to remove it? In this paper I present a large-scale aggregate analysis of textual Web content, corroborating statistical laws from the ï¬eld of Quantitative Linguistics. I analyze the idiosyncrasy of template content compared to regular âfull textâ content and derive a simple yet suitable quantitative model.
and Firan, Claudiu S.
and Paiu, Raluca Deriving Music Theme Annotations from User Tags.
Music theme annotations would be really beneï¬cial for supporting retrieval, but are often neglected by users while annotating. Thus, in order to support users in tagging and to ï¬ll the gaps in the tag space, in this paper we develop algorithms for recommending theme annotations. Our methods exploit already existing user tags, the lyrics of music tracks, as well as combinations of both. We compare the results for our recommended theme annotations against genre and style recommendations â a much easier and already studied task. We evaluate the quality of our recommended tags against an expert ground truth data set. Our results are promising and provide interesting insights into possible extensions for music tagging systems to support music search.
About this site
This website has been set up for WWW2009 by Christopher Gutteridge of the University of Southampton, using our EPrints software.
We (Southampton EPrints Project) intend to preserve the files and HTML pages of this site for many years, however we will turn it into flat files for long term preservation. This means that at some point in the months after the conference the search, metadata-export, JSON interface, OAI etc. will be disabled as we "fossilize" the site. Please plan accordingly. Feel free to ask nicely for us to keep the dynamic site online longer if there's a rally good (or cool) use for it... [this has now happened, this site is now static]