Publications

(also available in BibTeX format and as a formatted bibliography in PDF.)

Important note: Full-text versions of papers provided via this site are author versions. In the case of ACM publications (TOIS, SIGIR, CIKM, etc) they are posted here by permission of ACM for your personal use; in the case of Springer publications (IR, etc) they are posted here by permission of Springer for your personal use. Not for redistribution.

Please cite definitive versions as noted in the bibtex entries.

2009

100. Proceedings of the 14th Australasian Document Computing Symposium bibtex
Judy Kay, Paul Thomas, and Andrew Trotman (eds)
To appear.
99. Web indexing on a diet: Template removal with the sandwich algorithm bibtex
Tom Rowlands, Paul Thomas, and Stephen Wan
in Proceedings of the 14th Australasian Document Computing Symposium. Sydney, Australia. To appear.
98. Stakeholders and their respective costs-benefits in IR evaluation  (pdf) bibtex
Cecile Paris, Nathalie Colineau, Paul Thomas, and Ross Wilkinson
in Proceedings of the SIGIR workshop on the Future of IR Evaluation. Boston, USA.
97. New methods for creating testfiles: Tuning enterprise search with C-TEST  (pdf) bibtex
David Hawking, Paul Thomas, Tom Gedeon, Tim Jones, and Tom Rowlands
in Proceedings of the SIGIR workshop on the Future of IR Evaluation. Boston, USA.
96. C-TEST: Supporting novelty and diversity in testfiles for search evaluation  (pdf) bibtex
David Hawking, Tom Rowlands, and Paul Thomas
in Proceedings of the SIGIR workshop on redundancy, diversity and interdependent document relevance. Boston, USA.
95. Quality-oriented search for depression portals  (pdf) bibtex
Thanh Tang, Ramesh Sankaranarayana, David Hawking, Kathleen Griffiths, and Nick Craswell
in Proceedings of ECIR 2009. Toulouse, France, pp. 637-644.
94. Nullification test collections for web spam and SEO  (pdf) bibtex
Timothy Jones, Ramesh Sankaranarayana, David Hawking, and Nick Craswell
in AIRWeb '09: Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web. Madrid, Spain, pp. 53-60.
93. SUSHI: Scoring scaled samples for server selection  (pdf) bibtex
Paul Thomas and Milad Shokouhi
in Proceedings of ACM SIGIR. Boston, USA.
92. Effective query expansion for federated search  (pdf) bibtex
Milad Shokouhi, Leif Azzopardi, and Paul Thomas
in Proceedings of ACM SIGIR. Boston, USA.
91. Server selection methods in personal metasearch: a comparative empirical study  (pdf) bibtex
Paul Thomas and David Hawking
Information Retrieval , 2009, pp. 581-604.
90. Web indexing on a diet: template removal with the sandwich algorithm  (pdf) bibtex
Stephen Wan, Tom Rowlands, and Paul Thomas
Tech Report: 09120, CSIRO ICT Centre.
89. Quality of language models for distributed information retrieval  (pdf) bibtex
Paul Thomas
Tech Report: 08119, CSIRO ICT Centre.

2008

88. Server Characterisation and Selection for Personal Metasearch  (pdf) bibtex
Paul Thomas
SIGIR Forum 42(2), pp. 108-109.
87. Anonymous folksonomies for small enterprise webs: a case study  (pdf) bibtex
Tom Rowlands, David Hawking, and Ramesh Sankaranarayana
in Proceedings of ADCS '08. .
86. Proceedings of the Eighteenth Australasian Document Computing Symposium  (http://es.csiro.au/adcs2008/) bibtex
Rob McArthur, Paul Thomas, Andrew Turpin, and Mingfing Wu (eds)
85. Experiences evaluating personal metasearch  (pdf) bibtex
Paul Thomas and David Hawking
in Proceedings of IIiX. .
84. Implementation of PIS  (pdf) bibtex
Paul Thomas
Tech Report: 200802, Department of Computer Science, Australian National University.
83. Server characterisation and selection for personal metasearch  (pdf) bibtex
Paul Thomas
82. Generalising Multiple Capture-Recapture to Non-Uniform Sample Sizes  (pdf) bibtex
Paul Thomas
in Proceedings of ACM SIGIR. Singapore.
81. Relevance Assessment: Are Judges Exchangeable and Does it Matter?  (pdf) bibtex
Peter Bailey, Nick Craswell, Ian Soboroff, Paul Thomas, Arjen de Vries, and Emine Yilmaz
in Proceedings of ACM SIGIR. Singapore.

2007

80. A framework for measuring the impact of Web spam.  (pdf) bibtex
Timothy Jones, David Hawking, and Ramesh Sankaranarayana
in Proceedings of ADCS '07. .
79. The CSIRO enterprise search test collection  (pdf) bibtex
Peter Bailey, Nick Craswell, Ian Soboroff, and Arjen P. de Vries
SIGIR Forum 41(2), pp. 42-45.
78. Does Brandname Influence Perceived Search Result Quality? Yahoo!,  (pdf) bibtex
Peter Bailey, Paul Thomas, and David Hawking
in Proceedings of ADCS 2007. Melbourne, Australia.
77. TREC 2007 Enterprise Track at CSIRO  (pdf) bibtex
Peter Bailey, Deepak Agrawal, and Anuj Kumar
in The Sixteenth Text Retrieval Conference (TREC 2007) Proceedings. Gaithersburg, USA.
76. Overview of the TREC Enterprise 2007 Track  (pdf) bibtex
Peter Bailey, Nick Craswell, Ian Soboroff, and Arjen P. de Vries
in The Sixteenth Text Retrieval Conference (TREC 2007) Proceedings. Gaithersburg, USA.
75. Towards higher quality health search results: Automated quality rating of depression websites  (pdf) bibtex
David Hawking, Thanh Tang, Ramesh Sankaranarayana, Kathleen Griffiths, Nick Craswell, and Peter Bailey
in Proceedings of Medinfo 2007 Workshop on ``Models of trust for health websites". Brisbane, Australia.
74. Characteristics of .au Websites: An Analysis of Large-Scale Web Crawl Data from 2005  (pdf) bibtex
Robert Ackland, Amanda Spink, and Peter Bailey
in Proceedings of the Thirteenth Australasian World Wide Web Conference (AusWeb07). Coffs Harbour, Australia.
73. Understanding the Relationship of Information Need Specificity to Search Query Length  (pdf) bibtex
Nina Phan, Peter Bailey, and Ross Wilkinson
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 709-710.
72. Evaluating sampling methods for uncooperative collections  (pdf) bibtex
Paul Thomas and David Hawking
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 503-510.
71. Fast Generation of Result Snippets in Web Search  (pdf) bibtex
Andrew Turpin, Yohannes Tsegay, David Hawking, and Hugh Williams
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 127-134.
70. Workload sampling for enterprise search evaluation  (pdf) bibtex
Tom Rowlands, David Hawking, and Ramesh Sankaranarayana
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 887-888.
69. Estimating the value of automatic disambiguation bibtex
Paul Thomas and Tom Rowlands
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 719-720.
68. CSIRO  (pdf) bibtex
Alexander Krumpholz and David Hawking
in (Ed.s), Advances in XML Information Retrieval and Evaluation: INEX 2006. Springer-Verlag, pp. 73-81.
67. Does Topic Metadata Help with Web Search?  (pdf) bibtex
David Hawking and Justin Zobel
JASIST 58(5), pp. 613-628.

2006

66. Improving rankings in small-scale web search using click-implied descriptions  (pdf) bibtex
David Hawking, Tom Rowlands, and Matt Adcock
Australian Journal of Intelligent Information Processing Systems. ADCS 2006 special issue. 9(2), pp. 17-24.
65. InexBib - Retrieving XML Elements Based on External Evidence  (pdf) bibtex
Alexander Krumpholz and David Hawking
Australian Journal of Intelligent Information Processing Systems. ADCS 2006 special issue. 9(2), pp. 72-79.
64. My Instant Expert  (pdf) bibtex
George Ferizis and Peter Bailey
in Proceedings of ADCS 2006. Brisbane, Australia, pp. 25-32.
63. Possible Approaches to Evaluating Adaptive Question Answering for Mobile Environments  (pdf) bibtex
Peter Bailey and George Ferizis
in Poster Proceedings of the First International Workshop on Adaptive Information Retrieval (AIR). Glasgow, UK.
62. Secure Search in Enterprise Webs: Tradeoffs in Efficient Implementation for Document Level Security  (pdf) bibtex
Peter Bailey, David Hawking, and Brett Matson
in Proceedings of CIKM 2006. Arlington, VA.
61. Evaluation by Comparing Result Sets in Context  (pdf) bibtex
Paul Thomas and David Hawking
in Proceedings of CIKM 2006. Arlington, VA.
60. How things work: Web Search Engines (Part 1)  (pdf) bibtex
David Hawking
IEEE Computer June, 2006, pp. 86-88.
59. How things work: Web Search Engines (Part 2)  (pdf) bibtex
David Hawking
IEEE Computer August, 2006, pp. 88-90.
58. Toward meaningful test collections for information integration benchmarking  (pdf) bibtex
Peter Bailey, David Hawking, and Alexander Krumpholz
in Proceedings of IIWeb 2006 (WWW Workshop). Edinburgh, UK.
57. Towards Practical Genre Classification of Web Documents (Poster)  (pdf) bibtex
George Ferizis and Peter Bailey
in Proceedings of WWW 2006. .
56. Quality and Relevance of Domain-specific Search: A Case Study in Mental Health  (pdf) bibtex
Thanh Tang, Nick Craswell, David Hawking, Kathleen Griffiths, and Helen Christensen
Information Retrieval 9(2), pp. 207-225.
55. A Perspective on Web Information Retrieval (Introduction to the Web IR Special Issue)  (pdf) bibtex
Massimo Melucci and David Hawking
Information Retrieval 9(2), pp. 207-225.

2005

54. TREC 14 Enterprise Track at CSIRO and ANU  (pdf) bibtex
Mingfang Wu, Paul Thomas, and David Hawking
in Proceedings of TREC-2005. Gaithersburg, MD.
53. Recommended Reading for IR  (pdf) bibtex
Alistair Moffat, Justin Zobel, and David Hawking (Ed.s)
SIGIR Forum 39(2), pp. 3-14.
52. Automated Assessment of the Quality of Depression Websites  (pdf) bibtex
Kathleen Griffiths, Thanh Tang, David Hawking, and Helen Christensen
Journal of Medical Internet Research , 2005.
51. Server Selection Methods in Hybrid Portal Search  (pdf) bibtex
David Hawking and Paul Thomas
in Proceedings of ACM SIGIR 2005. Salvador, Brazil, pp. 75-82.
50. Context in Enterprise Search and Delivery  (pdf) bibtex
David Hawking and C\'e
in Proceedings of ACM SIGIR 2005 Workshop on Information Retrieval in Context (IRiX). Salvador, Brazil, pp. 14-16.
49. Focused crawling for both relevance and quality of medical information  (pdf) bibtex
Thanh Tang, David Hawking, Nick Craswell, and Kathleen Griffiths
in Proceedings of CIKM'2005. Bremen, Germany, pp. 147-154.
48. The Very Large Collection and Web Tracks  (pdf) bibtex
David Hawking and Nick Craswell
in Ellen Voorhees and Donna Harman (Ed.s), TREC: Experiment. MIT Press.

2004

47. Focused Crawling in Depression Portal Search: A Feasibility Study  (pdf) bibtex
Thanh Tang, David Hawking, Nick Craswell, and Ramesh S. Sankaranarayana
in Proceedings of the Australasian Document Computing Symposium ADCS 2004. Melbourne, Australia, pp. 2-9.
46. Overview of the TREC-2004 Web Track  (pdf) bibtex
Nick Craswell and David Hawking
in Proceedings of TREC-2004. Gaithersburg, MD.
45. Performance and Cost Tradeoffs in Web Search  (pdf) bibtex
Nick Craswell, Francis Crimmins, David Hawking, and Alistair Moffat
in Proceedings of the Australasian Database Conference ADC2004. Dunedin, New Zealand, pp. 161-170.
44. How Valuable is External Link Evidence when Searching Enterprise Webs?  (pdf) bibtex
David Hawking, Nick Craswell, Francis Crimmins, and Trystan Upstill
in Proceedings of the Australasian Database Conference ADC2004. Dunedin, New Zealand, pp. 77-84.
43. Challenges in Enterprise Search  (pdf) bibtex
David Hawking
in Proceedings of the Australasian Database Conference ADC2004. Dunedin, New Zealand, pp. 15-26.
42. Towards Better Weighting of Anchors (Poster)  (pdf) bibtex
David Hawking, Trystan Upstill, and Nick Craswell
in Proceedings of SIGIR'2004. Sheffield, UK, pp. 512-513.

2003

41. TREC12 Web Track  (pdf) bibtex
Nick Craswell, David Hawking, Trystan Upstill, Alistair McLean, Ross Wilkinson, and Mingfang Wu
in Proceedings of TREC-2003. Gaithersburg, MD.
40. Overview of the TREC-2003  (pdf) bibtex
Nick Craswell, David Hawking, Ross Wilkinson, and Mingfang Wu
in Proceedings of TREC-2003. Gaithersburg, MD.
39. Summary of the SIGIR 2003  (pdf) bibtex
Ian Soboroff, Ellen Voorhees, and Nick Craswell
SIGIR Forum 37(2), pp. 55-58.
38. Predicting Fame and Fortune: PageRank or Indegree?  (pdf) bibtex
Trystan Upstill, Nick Craswell, and David Hawking
in Proceedings of the Australasian Document Computing Symposium, ADCS2003. Canberra, Australia, pp. 31-40.
37. Query-independent evidence in home page finding  (pdf) bibtex
Trystan Upstill, Nick Craswell, and David Hawking
ACM Transactions on Information Systems (TOIS) 21(3), pp. 286-313.
36. Result Merging Strategies for a Current News MetaSearcher  (pdf) bibtex
Yves Rasolofo, David Hawking, and Jacques Savoy
Information Processing and Management , 2003, pp. 581-609.
35. On Collection Size and Retrieval Effectiveness  (html) bibtex
David Hawking and Stephen Robertson
Information Retrieval 6(1), pp. 99-150.
34. Engineering a multi-purpose test collection for Web retrieval experiments  (pdf) bibtex
Peter Bailey, Nick Craswell, and David Hawking
Information Processing and Management 39(6), pp. 853-871.
33. Very Large Scale Information Retrieval bibtex
David Hawking
in Gregory Grefenstette and Steve Rennals (Ed.s), Text and Speech Triggered Information Access. Springer.
32. Automated Discovery of Search Interfaces on the Web  (pdf) bibtex
Jared Cope, Nick Craswell, and David Hawking
in The Fourteenth Australasian Database Conference. Adelaide, Australia.
31. A Task Oriented Approach to Delivery in Mobile Environments  (pdf) bibtex
Francois Paradis, Francis Crimmins, and Nadine Ozkan
in 4th International Conference on Mobile Data Management. Melbourne, Australia.

2002

30. TREC11 Web and Interactive Tracks at CSIRO  (pdf) bibtex
Nick Craswell, David Hawking, James Thom, Trystan Upstill, Ross Wilkinson, and Mingfang Wu
in Proceedings of TREC-2002. Gaithersburg, MD.
29. Overview of the TREC-2002 Web Track  (pdf) bibtex
Nick Craswell and David Hawking
in Proceedings of TREC-2002. Gaithersburg, MD.
28. CSIRO INEX experiments: XML search using PADRE  (pdf) bibtex
Anne-Marie Vercoustre, James A. Thom, Alexander Krumpholz, Ian Mathieson, Peter Wilkins, Mingfang Wu, Nick Craswell, and David Hawking
in INEX 2002 Workshop. Dagstuhl, Germany.
27. Buying bestsellers online: A case study in Search and Searchability  (pdf) bibtex
Trystan Upstill, Nick Craswell, and David Hawking
in 7th Australasian Document Computing Symposium. Sydney, Australia.
26. XML Document Retrieval with PADRE  (pdf) bibtex
Nick Craswell, David Hawking, Alexander Krumpholz, Ian Mathieson, James A. Thom, Anne-Marie Vercoustre, Peter Wilkins, and Mingfang Wu
in 7th Australasian Document Computing Symposium. Sydney, Australia.
25. Enterprise search: What works and what doesn't  (pdf) bibtex
David Hawking, Nick Craswell, Francis Crimmins, and Trystan Upstill
in Proceedings of the Infonortics Search Engines Meeting. San Francisco, CA.

2001

24. Measuring search engine quality  (pdf) bibtex
David Hawking, Nick Craswell, Peter Bailey, and Kathleen Griffiths
Information Retrieval 4(1), pp. 33-59.
23. Effective site finding using link anchor information  (pdf) bibtex
Nick Craswell, David Hawking, and Stephen Robertson
in Proceedings of ACM SIGIR 2001. New Orleans, LA, pp. 250-257.
22. Which search engine is best at finding airline site home pages?  (pdf) bibtex
Nick Craswell, David Hawking, and Kathleen Griffiths
Tech Report: 01, CSIRO Mathematical and Information Sciences.
21. Which search engine is best at finding online services?  (pdf) bibtex
David Hawking, Nick Craswell, and Kathleen Griffiths
in Poster Proceedings of WWW10. Hong Kong.
20. Overview of the TREC-2001  (pdf) bibtex
David Hawking and Nick Craswell
in Proceedings of TREC. Gaithersburg, MD.
19. TREC10 Web and Interactive Tracks at CSIRO  (pdf) bibtex
Nick Craswell, David Hawking, Ross Wilkinson, and Mingfang Wu
in Proceedings of TREC. Gaithersburg, MD.
18. Panoptic Expert: Searching for experts not just for documents  (pdf) bibtex
Nick Craswell, David Hawking, Anne-Marie Vercoustre, and Peter Wilkins
in Ausweb Poster Proceedings. Coffs Harbour, Australia.
17. Visual Clustering of Image Search Results  (pdf) bibtex
Trystan Upstill, Raj Nagappan, and Nick Craswell
in SPIE Visual Data Exploration and Analysis VIII. San Jose, CA.

2000

16. Server Selection on the World Wide Web  (pdf) bibtex
Nick Craswell, Peter Bailey, and David Hawking
in Proceedings of the ACM Digital Libraries Conference. San Antonio, TX, pp. 37-46.
15. Overview of TREC-9 Web Track  (pdf) bibtex
David Hawking
in Proceedings of TREC. Gaithersburg, MD, pp. 131-150.
14. ACSys/CSIRO TREC-9  (pdf) bibtex
David Hawking
in Proceedings of TREC. Gaithersburg, MD.
13. Methods for Distributed Information Retrieval  (pdf) bibtex
Nick Craswell
12. Dark matter on the Web  (pdf) bibtex
Peter Bailey, Nick Craswell, and David Hawking
in WWW-9 Poster Proceedings. Amsterdam, The Netherlands.
11. An intranet reality check for TREC  (pdf) bibtex
David Hawking, Peter Bailey, and Nick Craswell
Tech Report: , CSIRO Mathematical and Information Sciences.
10. Chart of darkness: Mapping a large intranet  (pdf) bibtex
Peter Bailey, Nick Craswell and David Hawking
Tech Report: , CSIRO Mathematical and Information Sciences.
9. Efficient and flexible search using text and metadata  (pdf) bibtex
David Hawking, Peter Bailey, and Nick Craswell
Tech Report: , CSIRO Mathematical and Information Sciences.

1999

8. Is it fair to evaluate Web systems using TREC  (pdf) bibtex
Nick Craswell, Peter Bailey, and David Hawking
in ACM SIGIR '99 Workshop on Web Retrieval. Berkeley, CA.
7. Merging Results from Isolated Search Engines  (pdf) bibtex
Nick Craswell, David Hawking, and Paul Thistlewaite
in Proceedings of the 10th Australasian Database Conference. Auckland, New Zealand, pp. 189-200.
6. Overview of TREC-8 Web track  (pdf) bibtex
David Hawking, Ellen Voorhees, Nick Craswell, and Peter Bailey
in Proceedings of TREC-8. Gaithersburg, MD, pp. 131-150.
5. ACSys TREC-8 experiments  (pdf) bibtex
David Hawking, Peter Bailey, and Nick Craswell
in Proceedings of TREC-8. Gaithersburg, MD.
4. Results and challenges in Web search evaluation  (pdf) bibtex
David Hawking, Nick Craswell, Paul Thistlewaite, and Donna Harman
in Proceedings of WWW8. Toronto, Canada, pp. 1321-1330.
3. Methods for Information Server Selection  (pdf) bibtex
David Hawking and Paul Thistlewaite
ACM Transactions on Information Systems. 17(1), pp. 40-76.
2. Scaling Up the TREC  (http://dx.doi.org/10.1023/a:1009938405269) bibtex
David Hawking, Paul Thistlewaite, and Donna Harman
Information Retrieval 1(1), pp. 115-137.
1. Plans for the TREC-9 Web Track  (pdf) bibtex
David Hawking
SIGIR Forum 33(2), pp. 17-18.