Identifying Facets in Query-Biased Sets of Blog Posts
Wouter de Winter and Maarten de Rijke
We investigate the identification of facets of query-biased sets of blog posts. Given a set of blog posts relevant to a topic, we compare several methods for identifying facets of the topic in this set. Building on a clustering of a set of blog posts, we compare several cluster labeling methods, and find that a method that makes use of blog and blog search specific features outperforms other methods. We also present efficiencyimproving feature sets for clustering; our proposed method is fast enough to be deployed online.
Short Paper
Available as PDF