Query sensitive comparative summarization of search results using concept based segmentation
This addresses the need for users to quickly compare information across websites for decision-making, but it is incremental as it builds on existing query-sensitive summarization techniques.
The paper tackles the problem of summarizing multiple web pages from search results for comparison, proposing a method that segments HTML documents into concept blocks and extracts important sentences based on query relevance to generate comparative summaries on the fly, reducing user browsing time and effort.
Query sensitive summarization aims at providing the users with the summary of the contents of single or multiple web pages based on the search query. This paper proposes a novel idea of generating a comparative summary from a set of URLs from the search result. User selects a set of web page links from the search result produced by search engine. Comparative summary of these selected web sites is generated. This method makes use of HTML DOM tree structure of these web pages. HTML documents are segmented into set of concept blocks. Sentence score of each concept block is computed with respect to the query and feature keywords. The important sentences from the concept blocks of different web pages are extracted to compose the comparative summary on the fly. This system reduces the time and effort required for the user to browse various web sites to compare the information. The comparative summary of the contents would help the users in quick decision making.