A novel design of hidden web crawler using ontology
This addresses the problem of accessing structured, unstructured, and dynamic data on the Deep Web for the database community, but appears incremental as it builds on existing crawler methods with ontology integration.
The paper tackled the challenge of accessing Deep Web content hidden behind HTML forms by developing a crawler that uses ontologies, reporting promising results in performance evaluation.
Deep Web is content hidden behind HTML forms. Since it represents a large portion of the structured, unstructured and dynamic data on the Web, accessing Deep-Web content has been a long challenge for the database community. This paper describes a crawler for accessing Deep-Web using Ontologies. Performance evaluation of the proposed work showed that this new approach has promising results.