Publications
(also available in BibTeX format and as a formatted bibliography in PDF.)Important note: Full-text versions of papers provided via this site are author versions. In the case of ACM publications (TOIS, SIGIR, CIKM, etc) they are posted here by permission of ACM for your personal use; in the case of Springer publications (IR, etc) they are posted here by permission of Springer for your personal use. Not for redistribution.
Please cite definitive versions as noted in the bibtex entries.
2010
103. Evaluating server selection for federated search bibtexPaul Thomas and Milad Shokouhi
in Proceedings of ECIR. Milton Keynes, UK. To appear.
2009
102. Focused and aggregated search: A perspective from natural language generation bibtexCecile Paris, Stephen Wan, and Paul Thomas101. Proceedings of the 14th Australasian Document Computing Symposium (http://es.csiro.au/adcs2009/) bibtex
Information Retrieval , 2009. To appear.
Judy Kay, Paul Thomas, and Andrew Trotman (eds)100. Web indexing on a diet: Template removal with the sandwich algorithm (pdf) bibtex
Tom Rowlands, Paul Thomas, and Stephen Wan99. Stakeholders and their respective costs-benefits in IR evaluation (pdf) bibtex
in Proceedings of the 14th Australasian Document Computing Symposium. Sydney, Australia.
Cecile Paris, Nathalie Colineau, Paul Thomas, and Ross Wilkinson98. New methods for creating testfiles: Tuning enterprise search with C-TEST (pdf) bibtex
in Proceedings of the SIGIR workshop on the Future of IR Evaluation. Boston, USA.
David Hawking, Paul Thomas, Tom Gedeon, Tim Jones, and Tom Rowlands97. C-TEST: Supporting novelty and diversity in testfiles for search evaluation (pdf) bibtex
in Proceedings of the SIGIR workshop on the Future of IR Evaluation. Boston, USA.
David Hawking, Tom Rowlands, and Paul Thomas96. Quality-oriented search for depression portals (pdf) bibtex
in Proceedings of the SIGIR workshop on redundancy, diversity and interdependent document relevance. Boston, USA.
Thanh Tang, Ramesh Sankaranarayana, David Hawking, Kathleen Griffiths, and Nick Craswell95. Nullification test collections for web spam and SEO (pdf) bibtex
in Proceedings of ECIR 2009. Toulouse, France, pp. 637-644.
Timothy Jones, Ramesh Sankaranarayana, David Hawking, and Nick Craswell94. SUSHI: Scoring scaled samples for server selection (pdf) bibtex
in AIRWeb '09: Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web. Madrid, Spain, pp. 53-60.
Paul Thomas and Milad Shokouhi93. Effective query expansion for federated search (pdf) bibtex
in Proceedings of ACM SIGIR. Boston, USA.
Milad Shokouhi, Leif Azzopardi, and Paul Thomas92. Server selection methods in personal metasearch: a comparative empirical study (pdf) bibtex
in Proceedings of ACM SIGIR. Boston, USA.
Paul Thomas and David Hawking91. Web indexing on a diet: template removal with the sandwich algorithm (pdf) bibtex
Information Retrieval , 2009, pp. 581-604.
Stephen Wan, Tom Rowlands, and Paul Thomas90. Quality of language models for distributed information retrieval (pdf) bibtex
Tech Report: 09120, CSIRO ICT Centre.
Paul Thomas
Tech Report: 08119, CSIRO ICT Centre.
2008
89. Server Characterisation and Selection for Personal Metasearch (pdf) bibtexPaul Thomas88. Anonymous folksonomies for small enterprise webs: a case study (pdf) bibtex
SIGIR Forum 42(2), pp. 108-109.
Tom Rowlands, David Hawking, and Ramesh Sankaranarayana87. Proceedings of the Thirteenth Australasian Document Computing Symposium (http://es.csiro.au/adcs2008/) bibtex
in Proceedings of ADCS '08. .
Rob McArthur, Paul Thomas, Andrew Turpin, and Mingfing Wu (eds)86. Overview of the TREC 2008 Enterprise Track (pdf) bibtex
Krisztian Balog, Ian Soboroff, Paul Thomas, Peter Bailey, Nick Craswell, and Arjen P. de Vries85. Experiences evaluating personal metasearch (pdf) bibtex
in The Seventeenth Text Retrieval Conference (TREC) Proceedings. .
Paul Thomas and David Hawking84. Implementation of PIS (pdf) bibtex
in Proceedings of IIiX. .
Paul Thomas83. Server characterisation and selection for personal metasearch (pdf) bibtex
Tech Report: 200802, Department of Computer Science, Australian National University.
Paul Thomas82. Generalising Multiple Capture-Recapture to Non-Uniform Sample Sizes (pdf) bibtex
Paul Thomas81. Relevance Assessment: Are Judges Exchangeable and Does it Matter? (pdf) bibtex
in Proceedings of ACM SIGIR. Singapore.
Peter Bailey, Nick Craswell, Ian Soboroff, Paul Thomas, Arjen de Vries, and Emine Yilmaz
in Proceedings of ACM SIGIR. Singapore.
2007
80. A framework for measuring the impact of Web spam. (pdf) bibtexTimothy Jones, David Hawking, and Ramesh Sankaranarayana79. The CSIRO enterprise search test collection (pdf) bibtex
in Proceedings of ADCS '07. .
Peter Bailey, Nick Craswell, Ian Soboroff, and Arjen P. de Vries78. Does Brandname Influence Perceived Search Result Quality? Yahoo!, (pdf) bibtex
SIGIR Forum 41(2), pp. 42-45.
Peter Bailey, Paul Thomas, and David Hawking77. TREC 2007 Enterprise Track at CSIRO (pdf) bibtex
in Proceedings of ADCS 2007. Melbourne, Australia.
Peter Bailey, Deepak Agrawal, and Anuj Kumar76. Overview of the TREC Enterprise 2007 Track (pdf) bibtex
in The Sixteenth Text Retrieval Conference (TREC 2007) Proceedings. Gaithersburg, USA.
Peter Bailey, Nick Craswell, Ian Soboroff, and Arjen P. de Vries75. Towards higher quality health search results: Automated quality rating of depression websites (pdf) bibtex
in The Sixteenth Text Retrieval Conference (TREC 2007) Proceedings. Gaithersburg, USA.
David Hawking, Thanh Tang, Ramesh Sankaranarayana, Kathleen Griffiths, Nick Craswell, and Peter Bailey74. Characteristics of .au Websites: An Analysis of Large-Scale Web Crawl Data from 2005 (pdf) bibtex
in Proceedings of Medinfo 2007 Workshop on ``Models of trust for health websites". Brisbane, Australia.
Robert Ackland, Amanda Spink, and Peter Bailey73. Understanding the Relationship of Information Need Specificity to Search Query Length (pdf) bibtex
in Proceedings of the Thirteenth Australasian World Wide Web Conference (AusWeb07). Coffs Harbour, Australia.
Nina Phan, Peter Bailey, and Ross Wilkinson72. Evaluating sampling methods for uncooperative collections (pdf) bibtex
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 709-710.
Paul Thomas and David Hawking71. Fast Generation of Result Snippets in Web Search (pdf) bibtex
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 503-510.
Andrew Turpin, Yohannes Tsegay, David Hawking, and Hugh Williams70. Workload sampling for enterprise search evaluation (pdf) bibtex
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 127-134.
Tom Rowlands, David Hawking, and Ramesh Sankaranarayana69. Estimating the value of automatic disambiguation bibtex
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 887-888.
Paul Thomas and Tom Rowlands68. CSIRO (pdf) bibtex
in Proceedings of ACM SIGIR 2007. Amsterdam, Netherlands, pp. 719-720.
Alexander Krumpholz and David Hawking67. Does Topic Metadata Help with Web Search? (pdf) bibtex
in (Ed.s), Advances in XML Information Retrieval and Evaluation: INEX 2006. Springer-Verlag, pp. 73-81.
David Hawking and Justin Zobel
JASIST 58(5), pp. 613-628.
2006
66. Improving rankings in small-scale web search using click-implied descriptions (pdf) bibtexDavid Hawking, Tom Rowlands, and Matt Adcock65. InexBib - Retrieving XML Elements Based on External Evidence (pdf) bibtex
Australian Journal of Intelligent Information Processing Systems. ADCS 2006 special issue. 9(2), pp. 17-24.
Alexander Krumpholz and David Hawking64. My Instant Expert (pdf) bibtex
Australian Journal of Intelligent Information Processing Systems. ADCS 2006 special issue. 9(2), pp. 72-79.
George Ferizis and Peter Bailey63. Possible Approaches to Evaluating Adaptive Question Answering for Mobile Environments (pdf) bibtex
in Proceedings of ADCS 2006. Brisbane, Australia, pp. 25-32.
Peter Bailey and George Ferizis62. Secure Search in Enterprise Webs: Tradeoffs in Efficient Implementation for Document Level Security (pdf) bibtex
in Poster Proceedings of the First International Workshop on Adaptive Information Retrieval (AIR). Glasgow, UK.
Peter Bailey, David Hawking, and Brett Matson61. Evaluation by Comparing Result Sets in Context (pdf) bibtex
in Proceedings of CIKM 2006. Arlington, VA.
Paul Thomas and David Hawking60. How things work: Web Search Engines (Part 1) (pdf) bibtex
in Proceedings of CIKM 2006. Arlington, VA.
David Hawking59. How things work: Web Search Engines (Part 2) (pdf) bibtex
IEEE Computer June, 2006, pp. 86-88.
David Hawking58. Toward meaningful test collections for information integration benchmarking (pdf) bibtex
IEEE Computer August, 2006, pp. 88-90.
Peter Bailey, David Hawking, and Alexander Krumpholz57. Towards Practical Genre Classification of Web Documents (Poster) (pdf) bibtex
in Proceedings of IIWeb 2006 (WWW Workshop). Edinburgh, UK.
George Ferizis and Peter Bailey56. Quality and Relevance of Domain-specific Search: A Case Study in Mental Health (pdf) bibtex
in Proceedings of WWW 2006. .
Thanh Tang, Nick Craswell, David Hawking, Kathleen Griffiths, and Helen Christensen55. A Perspective on Web Information Retrieval (Introduction to the Web IR Special Issue) (pdf) bibtex
Information Retrieval 9(2), pp. 207-225.
Massimo Melucci and David Hawking
Information Retrieval 9(2), pp. 207-225.
2005
54. TREC 14 Enterprise Track at CSIRO and ANU (pdf) bibtexMingfang Wu, Paul Thomas, and David Hawking53. Recommended Reading for IR (pdf) bibtex
in Proceedings of TREC-2005. Gaithersburg, MD.
Alistair Moffat, Justin Zobel, and David Hawking (Ed.s)52. Automated Assessment of the Quality of Depression Websites (pdf) bibtex
SIGIR Forum 39(2), pp. 3-14.
Kathleen Griffiths, Thanh Tang, David Hawking, and Helen Christensen51. Server Selection Methods in Hybrid Portal Search (pdf) bibtex
Journal of Medical Internet Research , 2005.
David Hawking and Paul Thomas50. Context in Enterprise Search and Delivery (pdf) bibtex
in Proceedings of ACM SIGIR 2005. Salvador, Brazil, pp. 75-82.
David Hawking and C\'e49. Focused crawling for both relevance and quality of medical information (pdf) bibtex
in Proceedings of ACM SIGIR 2005 Workshop on Information Retrieval in Context (IRiX). Salvador, Brazil, pp. 14-16.
Thanh Tang, David Hawking, Nick Craswell, and Kathleen Griffiths48. The Very Large Collection and Web Tracks (pdf) bibtex
in Proceedings of CIKM'2005. Bremen, Germany, pp. 147-154.
David Hawking and Nick Craswell
in Ellen Voorhees and Donna Harman (Ed.s), TREC: Experiment. MIT Press.
2004
47. Focused Crawling in Depression Portal Search: A Feasibility Study (pdf) bibtexThanh Tang, David Hawking, Nick Craswell, and Ramesh S. Sankaranarayana46. Overview of the TREC-2004 Web Track (pdf) bibtex
in Proceedings of the Australasian Document Computing Symposium ADCS 2004. Melbourne, Australia, pp. 2-9.
Nick Craswell and David Hawking45. Performance and Cost Tradeoffs in Web Search (pdf) bibtex
in Proceedings of TREC-2004. Gaithersburg, MD.
Nick Craswell, Francis Crimmins, David Hawking, and Alistair Moffat44. How Valuable is External Link Evidence when Searching Enterprise Webs? (pdf) bibtex
in Proceedings of the Australasian Database Conference ADC2004. Dunedin, New Zealand, pp. 161-170.
David Hawking, Nick Craswell, Francis Crimmins, and Trystan Upstill43. Challenges in Enterprise Search (pdf) bibtex
in Proceedings of the Australasian Database Conference ADC2004. Dunedin, New Zealand, pp. 77-84.
David Hawking42. Towards Better Weighting of Anchors (Poster) (pdf) bibtex
in Proceedings of the Australasian Database Conference ADC2004. Dunedin, New Zealand, pp. 15-26.
David Hawking, Trystan Upstill, and Nick Craswell
in Proceedings of SIGIR'2004. Sheffield, UK, pp. 512-513.
2003
41. TREC12 Web Track (pdf) bibtexNick Craswell, David Hawking, Trystan Upstill, Alistair McLean, Ross Wilkinson, and Mingfang Wu40. Overview of the TREC-2003 (pdf) bibtex
in Proceedings of TREC-2003. Gaithersburg, MD.
Nick Craswell, David Hawking, Ross Wilkinson, and Mingfang Wu39. Summary of the SIGIR 2003 (pdf) bibtex
in Proceedings of TREC-2003. Gaithersburg, MD.
Ian Soboroff, Ellen Voorhees, and Nick Craswell38. Predicting Fame and Fortune: PageRank or Indegree? (pdf) bibtex
SIGIR Forum 37(2), pp. 55-58.
Trystan Upstill, Nick Craswell, and David Hawking37. Query-independent evidence in home page finding (pdf) bibtex
in Proceedings of the Australasian Document Computing Symposium, ADCS2003. Canberra, Australia, pp. 31-40.
Trystan Upstill, Nick Craswell, and David Hawking36. Result Merging Strategies for a Current News MetaSearcher (pdf) bibtex
ACM Transactions on Information Systems (TOIS) 21(3), pp. 286-313.
Yves Rasolofo, David Hawking, and Jacques Savoy35. On Collection Size and Retrieval Effectiveness (html) bibtex
Information Processing and Management , 2003, pp. 581-609.
David Hawking and Stephen Robertson34. Engineering a multi-purpose test collection for Web retrieval experiments (pdf) bibtex
Information Retrieval 6(1), pp. 99-150.
Peter Bailey, Nick Craswell, and David Hawking33. Very Large Scale Information Retrieval bibtex
Information Processing and Management 39(6), pp. 853-871.
David Hawking32. Automated Discovery of Search Interfaces on the Web (pdf) bibtex
in Gregory Grefenstette and Steve Rennals (Ed.s), Text and Speech Triggered Information Access. Springer.
Jared Cope, Nick Craswell, and David Hawking31. A Task Oriented Approach to Delivery in Mobile Environments (pdf) bibtex
in The Fourteenth Australasian Database Conference. Adelaide, Australia.
Francois Paradis, Francis Crimmins, and Nadine Ozkan
in 4th International Conference on Mobile Data Management. Melbourne, Australia.
2002
30. TREC11 Web and Interactive Tracks at CSIRO (pdf) bibtexNick Craswell, David Hawking, James Thom, Trystan Upstill, Ross Wilkinson, and Mingfang Wu29. Overview of the TREC-2002 Web Track (pdf) bibtex
in Proceedings of TREC-2002. Gaithersburg, MD.
Nick Craswell and David Hawking28. CSIRO INEX experiments: XML search using PADRE (pdf) bibtex
in Proceedings of TREC-2002. Gaithersburg, MD.
Anne-Marie Vercoustre, James A. Thom, Alexander Krumpholz, Ian Mathieson, Peter Wilkins, Mingfang Wu, Nick Craswell, and David Hawking27. Buying bestsellers online: A case study in Search and Searchability (pdf) bibtex
in INEX 2002 Workshop. Dagstuhl, Germany.
Trystan Upstill, Nick Craswell, and David Hawking26. XML Document Retrieval with PADRE (pdf) bibtex
in 7th Australasian Document Computing Symposium. Sydney, Australia.
Nick Craswell, David Hawking, Alexander Krumpholz, Ian Mathieson, James A. Thom, Anne-Marie Vercoustre, Peter Wilkins, and Mingfang Wu25. Enterprise search: What works and what doesn't (pdf) bibtex
in 7th Australasian Document Computing Symposium. Sydney, Australia.
David Hawking, Nick Craswell, Francis Crimmins, and Trystan Upstill
in Proceedings of the Infonortics Search Engines Meeting. San Francisco, CA.
2001
24. Measuring search engine quality (pdf) bibtexDavid Hawking, Nick Craswell, Peter Bailey, and Kathleen Griffiths23. Effective site finding using link anchor information (pdf) bibtex
Information Retrieval 4(1), pp. 33-59.
Nick Craswell, David Hawking, and Stephen Robertson22. Which search engine is best at finding airline site home pages? (pdf) bibtex
in Proceedings of ACM SIGIR 2001. New Orleans, LA, pp. 250-257.
Nick Craswell, David Hawking, and Kathleen Griffiths21. Which search engine is best at finding online services? (pdf) bibtex
Tech Report: 01, CSIRO Mathematical and Information Sciences.
David Hawking, Nick Craswell, and Kathleen Griffiths20. Overview of the TREC-2001 (pdf) bibtex
in Poster Proceedings of WWW10. Hong Kong.
David Hawking and Nick Craswell19. TREC10 Web and Interactive Tracks at CSIRO (pdf) bibtex
in Proceedings of TREC. Gaithersburg, MD.
Nick Craswell, David Hawking, Ross Wilkinson, and Mingfang Wu18. Panoptic Expert: Searching for experts not just for documents (pdf) bibtex
in Proceedings of TREC. Gaithersburg, MD.
Nick Craswell, David Hawking, Anne-Marie Vercoustre, and Peter Wilkins17. Visual Clustering of Image Search Results (pdf) bibtex
in Ausweb Poster Proceedings. Coffs Harbour, Australia.
Trystan Upstill, Raj Nagappan, and Nick Craswell
in SPIE Visual Data Exploration and Analysis VIII. San Jose, CA.
2000
16. Server Selection on the World Wide Web (pdf) bibtexNick Craswell, Peter Bailey, and David Hawking15. Overview of TREC-9 Web Track (pdf) bibtex
in Proceedings of the ACM Digital Libraries Conference. San Antonio, TX, pp. 37-46.
David Hawking14. ACSys/CSIRO TREC-9 (pdf) bibtex
in Proceedings of TREC. Gaithersburg, MD, pp. 131-150.
David Hawking13. Methods for Distributed Information Retrieval (pdf) bibtex
in Proceedings of TREC. Gaithersburg, MD.
Nick Craswell12. Dark matter on the Web (pdf) bibtex
Peter Bailey, Nick Craswell, and David Hawking11. An intranet reality check for TREC (pdf) bibtex
in WWW-9 Poster Proceedings. Amsterdam, The Netherlands.
David Hawking, Peter Bailey, and Nick Craswell10. Chart of darkness: Mapping a large intranet (pdf) bibtex
Tech Report: , CSIRO Mathematical and Information Sciences.
Peter Bailey, Nick Craswell and David Hawking9. Efficient and flexible search using text and metadata (pdf) bibtex
Tech Report: , CSIRO Mathematical and Information Sciences.
David Hawking, Peter Bailey, and Nick Craswell
Tech Report: , CSIRO Mathematical and Information Sciences.
1999
8. Is it fair to evaluate Web systems using TREC (pdf) bibtexNick Craswell, Peter Bailey, and David Hawking7. Merging Results from Isolated Search Engines (pdf) bibtex
in ACM SIGIR '99 Workshop on Web Retrieval. Berkeley, CA.
Nick Craswell, David Hawking, and Paul Thistlewaite6. Overview of TREC-8 Web track (pdf) bibtex
in Proceedings of the 10th Australasian Database Conference. Auckland, New Zealand, pp. 189-200.
David Hawking, Ellen Voorhees, Nick Craswell, and Peter Bailey5. ACSys TREC-8 experiments (pdf) bibtex
in Proceedings of TREC-8. Gaithersburg, MD, pp. 131-150.
David Hawking, Peter Bailey, and Nick Craswell4. Results and challenges in Web search evaluation (pdf) bibtex
in Proceedings of TREC-8. Gaithersburg, MD.
David Hawking, Nick Craswell, Paul Thistlewaite, and Donna Harman3. Methods for Information Server Selection (pdf) bibtex
in Proceedings of WWW8. Toronto, Canada, pp. 1321-1330.
David Hawking and Paul Thistlewaite2. Scaling Up the TREC (http://dx.doi.org/10.1023/a:1009938405269) bibtex
ACM Transactions on Information Systems. 17(1), pp. 40-76.
David Hawking, Paul Thistlewaite, and Donna Harman1. Plans for the TREC-9 Web Track (pdf) bibtex
Information Retrieval 1(1), pp. 115-137.
David Hawking
SIGIR Forum 33(2), pp. 17-18.

