|
Nick Craswell 数据挖掘实验室
|
|
Research Overview I am interested in Web search evaluation, mostly on enterprise-scale webs but also the World Wide Web. I built the VLC, VLC2, WT2g and .GOV test collections, which have been made available to research groups around the world. David Hawking and I coordinated the TREC Web Track experiments. I am currently involved in the TREC Terabyte Track and Enterprise Track. Some publications: Book chapter preprint (pdf), IR′01 (citeseer) and CSIRO′01 (pdf). I also work on effective Web search, which means making use of information in pages, link structure and URL structure to generate more useful Web search results. Some papers: SIGIR′05 (pdf), SIGIR′01 (pdf), TOIS′03 (pdf) (copying is by permission of ACM, Inc.) and ADCS′03 (pdf). My PhD was in distributed information retrieval (thesis pdf) which means building a system on top of multiple engines/databases that already exist. My recent work in the area has considered whether (or when) DIR is really practical. Some papers: ADC′99 (ps), DL′00 (pdf), ADC′03 (pdf) and ADC′04 (pdf). |
|
Publications (bibtex) (DBLP) 数据挖掘研究院 (Numbers in square brackets are citation counts from Google scholar, including self citations, at August 2005.) 数据挖掘研究院 2005 Relevance weighting for query independent evidence (pdf)
Focused crawling for both topical relevance and quality of medical information (pdf) 数据挖掘研究院
Quality and Relevance of Domain-specific Search: A Case Study in Mental Health (pdf) 数据挖掘研究院
Very Large Scale Retrieval and Web Search (pdf) 数据挖掘实验室
2004 数据挖掘研究院 Toward Better Weighting of Anchors (pdf) 数据挖掘研究院
Testbed for Information Extraction from Deep Web (pdf) 数据挖掘研究院
How Valuable is External Link Evidence when Searching Enterprise Webs? (pdf) 数据挖掘研究院
Overview of the TREC-2004 Web Track (pdf)
Performance and Cost Tradeoffs in Web Search (pdf)
2003 数据挖掘研究院 [56] Engineering a multi-purpose test collection for Web retrieval experiments (doi)
[10] Query-independent evidence in home page finding (pdf) (copying is by permission of ACM, Inc.) 数据挖掘研究院
[6] Automated Discovery of Search Interfaces on the Web (pdf) 数据挖掘实验室
Overview of the TREC-2003 Web Track (pdf) 数据挖掘研究院
Predicting Fame and Fortune: PageRank or Indegree? (pdf) 数据挖掘研究院
Summary of the SIGIR 2003 workshop on defining evaluation methodologies for terabyte-scale test collections (pdf) 数据挖掘研究院
TREC12 Web Track at CSIRO (pdf)
2002 [37] Overview of the TREC-2002 Web Track (pdf)
Buying bestsellers online: A case study in Search & Searchability (pdf)
CSIRO INEX experiments: XML search using PADRE (pdf)
Enterprise search: What works and what doesn′t (pdf) 数据挖掘研究院
TREC11 Web and Interactive Tracks at CSIRO (pdf) 数据挖掘研究院
XML Document Retrieval with PADRE (pdf)
2001 数据挖掘研究院 [64] Effective site finding using link anchor information (pdf)
[52] Overview of the TREC-2001 Web Track (pdf)
[31] Measuring search engine quality (citeseer) 数据挖掘研究院
[8] Which search engine is best at finding online services? (pdf)
Visual Clustering of Image Search Results (citeseer) 数据挖掘实验室
Panoptic Expert: Searching for experts not just for documents (pdf) 数据挖掘研究院
TREC10 Web and Interactive Tracks at CSIRO (pdf) 数据挖掘研究院
Which search engine is best at finding airline site home pages? (pdf)
2000 数据挖掘研究院 [41] Server Selection on the World Wide Web (pdf)
[9] Dark matter on the Web (pdf) 数据挖掘研究院
Methods for Distributed Information Retrieval (pdf)
An intranet reality check for TREC ad hoc (pdf)
Chart of darkness: Mapping a large intranet (pdf) 数据挖掘研究院
Efficient and flexible search using text and metadata (pdf) 数据挖掘研究院
1999 [79] Results and challenges in Web search evaluation (pdf)
[30] Merging Results from Isolated Search Engines (ps)
Is it fair to evaluate Web systems using TREC ad hoc methods? (pdf) 数据挖掘研究院
ACSys TREC-8 experiments (pdf) 数据挖掘研究院
Overview of TREC-8 Web track (ps) 数据挖掘实验室
1998 数据挖掘研究院 [59] Overview of TREC-7 Very Large Collection Track (pdf) 数据挖掘实验室
[11] ACSys TREC-7 experiments (pdf)
1997 数据挖掘实验室 ANU/ACSys TREC-6 experiments (pdf) 数据挖掘研究院
Aglets: A good idea for spidering? (pdf)
|


