Universiti Teknologi Malaysia Institutional Repository

A clickstream-based web page significance ranking metric for web crawlers

Selamat, Ali and Ahmadi-Abkenari, Fatemeh (2011) A clickstream-based web page significance ranking metric for web crawlers. In: The 5th Malaysian Software Engineering Conference (Mysec 2011).

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1109/MySEC.2011.6140674

Abstract

The unpredictable fast growing dimension of the World Wide Web and its non-static nature causes considerable obstacles for Web crawlers including the presence of some incorrect and irrelevant answers among search results set and the scaling issues. Hence, solutions that are more promising are in demand to provide more accurate search outcomes. Because implementing existed Web page importance metrics either link based or context based within a parallel crawler can not be an absolute solution for the coverage of authorized fresh Web content and the accuracy concerns, so employing these metrics is not the final approach within search engines' architecture. This paper proposes an analysis on clickstream data in order to discover the popularity of Web pages in crawl frontier through proposing the metric itself and presenting the experimental results on ranking the UTM Web pages based on the proposed discussed metric.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:web page
Divisions:Computing
ID Code:45469
Deposited By: Haliza Zainal
Deposited On:10 Jun 2015 03:00
Last Modified:29 Aug 2017 00:59

Repository Staff Only: item control page