Robust, Web and Terabyte Retrieval with Hummingbird SearchServer™ at TREC 2004

Stephen Tomlinson. To appear in Proceedings of the Thirteenth Text Retrieval Conference (TREC 2004). Gaithersburg, Maryland, November 2004. NIST Special Publication 500-261.

Abstract

Hummingbird participated in 3 tracks of TREC 2004: the ad hoc task of the Robust Retrieval Track (find at least one relevant document in the first 10 rows from 1.9GB of news and government data), the mixed navigational and distillation task of the Web Track (find the home or named page or key resource pages in 1.2 million pages (18GB) from the .GOV domain), and the ad hoc task of the Terabyte Track (find all the relevant documents with high precision from 25.2 million pages (426GB) from the .GOV domain). In the robustness task, SearchServer found a relevant document in the first 10 rows for 46 of the 49 new short (Title-only) topics. In the web task, SearchServer returned a desired page in the first 10 rows for more than 75% of the 225 queries. In the terabyte task, SearchServer found a relevant document in the first 10 rows for 45 of the 49 short topics.

Full Paper

Related Information


Last Updated: 2005 July 1

Comments are welcome at comments@stephent.com.

Copyright © 2005 Stephen Tomlinson http://www.stephent.com/ir/papers/trec2004.html