IROct 16, 2014

On the Feasibility and Implications of Self-Contained Search Engines in the Browser

arXiv:1410.4500v11 citationsHas Code
Originality Incremental advance
AI Analysis

This demonstrates feasibility for offline and private search applications, potentially enabling new hybrid architectures and an online search marketplace.

The paper investigates building a self-contained JavaScript search engine that runs entirely in the browser, including index construction and query evaluation, and finds it is sufficiently responsive for interactive applications despite performance shortfalls compared to Lucene.

JavaScript engines inside modern browsers are capable of running sophisticated multi-player games, rendering impressive 3D scenes, and supporting complex, interactive visualizations. Can this processing power be harnessed for information retrieval? This paper explores the feasibility of building a JavaScript search engine that runs completely self-contained on the client side within the browser---this includes building the inverted index, gathering terms statistics for scoring, and performing query evaluation. The design takes advantage of the IndexDB API, which is implemented by the LevelDB key-value store inside Google's Chrome browser. Experiments show that although the performance of the JavaScript prototype falls far short of the open-source Lucene search engine, it is sufficiently responsive for interactive applications. This feasibility demonstration opens the door to interesting applications in offline and private search across multiple platforms as well as hybrid split-execution architectures whereby clients and servers collaboratively perform query evaluation. One possible future scenario is the rise of an online search marketplace in which commercial search engine companies and individual users participate as rational economic actors, balancing privacy, resource usage, latency, and other factors based on customizable utility profiles.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes