Site-Map

Contact

Marijn Koolen
Archives & Information Studies
Room 2.13b
Turfdraagsterpad 9
1012XT Amsterdam
The Netherlands

Phone: +31 20 525 2295
E-mail: m.h.a.koolen@uva.nl

Marijn Koolen

About me

I am an assistant professor in the Archives & Information Studies group at the Faculty of Humanities of the University of Amsterdam. I also hold a Post Doc position on the README project. I'm mainly working in the field of Information Retrieval.

As a Ph.D, I worked on the MuSeUM project, which was a collaboration of the UvA and the Haags Gemeentemuseum. My thesis supervisor was Jaap Kamps.

My Ph.D. thesis is about using link information for information retrieval. A digital copy of my thesis can be found here: The Meaning of Structure: the Value of Link Evidence for Information Retrieval.

Education

I have a M.Sc (2005) degree in Artificial Intelligence at the University of Amsterdam. My master thesis is on Constructing Language Resources for Historic Document Retrieval.

Research Interests

My research interests lie mainly in the fields of Information retrieval and information science:

  • Web & Wikipedia retrieval, including
    • link structure analysis, result diversity, entity ranking, search log analysis, clickthrough analysis
  • Book search, literature search & cultural heritage IR
    • Social book search, book navigation, access methods, metadata, free-text and controlled vocabulary
  • Structured document retrieval and XML IR
  • Focused retrieval
  • Evaluation, measures & test-collection building
  • Web 2.0

Activities

Organiser of the workshop Something on Search and Searchers.

Co-organiser of the INEX Book Track of 2009-2012.

Programme committee for

  • CIKM 2011 and
  • DIR 2011-2012,
  • ECIR 2012.
  • INEX 2008-2011,
  • SIGIR 2011-2012,
  • WSDM 2010-2012,

Student mentor for ECIR 2011.

Research internship with Microsoft Research Cambridge in 2008.

I have been an active participant in several INEX Tracks since 2006, the TREC 2007 Legal Track and the TREC Web Track of 2009-2010.

Publications

2011

[51] Frans Adriaans, Marijn Koolen, and Jaap Kamps. University of Amsterdam at INEX 2011: Book and data centric tracks. In Shlomo Geva, Jaap Kamps, and Ralf Schenkel, editors, INEX 2011 Workshop Pre-proceedings, pages 36-48, 2011. [ bib | .pdf ]
[50] Gabriella Kazai, Marijn Koolen, Jaap Kamps, Antoine Doucet, and Monica Landoni. Overview of the INEX 2011 book track. In Shlomo Geva, Jaap Kamps, and Ralf Schenkel, editors, INEX 2011 Workshop Pre-proceedings, pages 11-35, 2011. [ bib | .pdf ]
[49] Marijn Koolen and Jaap Kamps. University of Amsterdam at the TREC 2011 web track. In The Twentieth Text REtrieval Conference (TREC 2011) Notebook. National Institute for Standards and Technology, 2011. [ bib | .pdf ]
[48] Gabriella Kazai, Jaap Kamps, Marijn Koolen, and Natasa Milic-Frayling. Crowdsourcing for book search evaluation: Impact of quality on comparative system ranking. In Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York NY, 2011. [ bib ]
[47] Marijn Koolen. The Meaning of Structure: the Value of Link Evidence for Information Retrieval SIGIR Forum, 45(1), June 2011. [ bib | .pdf ]
[46] David Alexander, Paavo Arvola, Thomas Beckers, Patrice Bellot, Timothy Chappell, Christopher M. De Vries, Antoine Doucet, Norbert Fuhr, Shlomo Geva, Jaap Kamps, Gabriella Kazai, Marijn Koolen, Sangeetha Kutty, Monica Landoni, Véronique Moriceau, Richi Nayak, Ragnar Nordlie, Nils Pharo, Eric SanJuan, Ralf Schenkel, Andrea Tagarelli, Xavier Tannier, James A. Thom, Andrew Trotman, Johanna Vainio, Qiuyue Wang, and Chen Wu. Report on INEX 2010. SIGIR Forum, 45(1), June 2011. [ bib | .pdf ]
[45] Marijn Koolen. The Meaning of Structure: the Value of Link Evidence for Information Retrieval. Ph.D. thesis, University of Amsterdam, The Netherlands, 2011. 224 Pages. IR Publications, Amsterdam. ISBN 978-90-814485-5-0. [ bib | .pdf ]
[44] Jaap Kamps, Rianne Kaptein, and Marijn Koolen. Using anchor text, spam filtering and Wikipedia for web search and entity ranking. In Ellen M. Voorhees and Lori P. Buckland, editors, The Ninteenth Text REtrieval Conference Proceedings (TREC 2010). National Institute for Standards and Technology, 2011. [ bib | .pdf ]
[43] Marijn Koolen and Jaap Kamps. Are semantically related links effective for retrieval? In Advances in Information Retrieval: 33rd European Conference on IR Research (ECIR 2011), 2011. [ bib ]
[42] Marijn Koolen and Jaap Kamps. What is the importance of anchor text for ad hoc search? In Proceedings of the 11th Dutch-Belgian Information Retrieval Workshop (DIR 2011), pages 56-57. University of Amsterdam, 2011. [ bib | .pdf ]

2010

[41] Gabriella Kazai and Marijn Koolen and Antoine Doucet and Monica Landoni. Overview of the INEX 2010 Book Track: At the Mercy of Crowdsourcing. In Shlomo Geva, Jaap Kamps, Ralf Schenkel, and Andrew Trotman, editors, INEX 2010 Workshop Pre-proceedings, pages 89-99, 2010. [ bib | .pdf ]
[40] Marijn Koolen and Jaap Kamps. University of Amsterdam at INEX 2009: Ad hoc and book tracks. In Shlomo Geva, Jaap Kamps, Ralf Schenkel, and Andrew Trotman, editors, INEX 2010 Workshop Pre-proceedings, pages 107-115, 2010. [ bib | .pdf ]
[39] Jaap Kamps, Rianne Kaptein, and Marijn Koolen. Using anchor text, spam filtering and Wikipedia for web search and entity ranking. In The Ninteenth Text REtrieval Conference (TREC 2010) Notebook. National Institute for Standards and Technology, 2010. [ bib | .pdf ]
[38] Marijn Koolen and Jaap Kamps. The importance of anchor text for ad hoc search revisited. In Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York NY, USA, 2010. [ bib | .pdf ]
[37] Marijn Koolen and Jaap Kamps. The impact of collection size on relevance and diversity. In Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York NY, USA, 2010. [ bib | .pdf ]
[36] Thomas Beckers, Patrice Bellot, Gianluca Demartini, Ludovic Denoyer, Christopher M. De Vries, Antoine Doucet, Khairun Nisa Fachry, Norbert Fuhr, Patrick Gallinari, Shlomo Geva, Wei-Che Huang, Tereza Iofciu, Jaap Kamps, Gabriella Kazai, Marijn Koolen, Sangeetha Kutty, Monica Landoni, Miro Lehtonen, Véronique Moriceau, Richi Nayak, Ragnar Nordlie, Nils Pharo, Eric SanJuan, Ralf Schenkel, Xavier Tannier, Martin Theobald, James A. Thom, Andrew Trotman, and Arjen P. de Vries. Report on INEX 2009. SIGIR Forum, 44(1), June 2010. [ bib | .pdf ]
[35] Gabriella Kazai, Antoine Doucet, Marijn Koolen and Monica Landoni. Overview of the INEX 2009 Book Track. In Shlomo Geva, Jaap Kamps, and Andrew Trotman, editors, Focused Retrieval and Evaluation : 8th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2009), LNCS, 2010. [ bib | .pdf ]
[34] Marijn Koolen, Rianne Kaptein, and Jaap Kamps. Focused search in books and Wikipedia: Categories, links and relevance feedback. In Shlomo Geva, Jaap Kamps, and Andrew Trotman, editors, Focused Retrieval and Evaluation : 8th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2009), LNCS, 2010. [ bib | .pdf ]
[33] Marijn Koolen and Jaap Kamps. Searching cultural heritage data: Does structure help expert searchers? In Proceedings of RIAO 2010: Adaption, personalization and fusion of heterogeneous information, 2010. [ bib | .pdf ]
[32] Rianne Kaptein, Marijn Koolen, and Jaap Kamps. Result diversity and entity ranking experiments: Text, anchors, links, and wikipedia. In Ellen M. Voorhees and Lori P. Buckland, editors, The Eighteenth Text REtrieval Conference Proceedings (TREC 2009). National Institute for Standards and Technology. NIST Special Publication, 2010. [ bib | .pdf ]
[31] Jaap Kamps and Marijn Koolen. How different are Wikipedia and Web link structure? In Maarten van der Heijden, Max Hinne, Wessel Kraaij, Maria van Kuppeveld, Suzan Verberne, and Theo van der Weide, editors, Proceedings of the 10th Dutch-Belgian Information Retrieval Workshop (DIR 2010), pages 78-79. Radboud Universiteit Nijmegen, 2010. [ bib | .pdf ]

2009

[30] Marijn Koolen, Rianne Kaptein, and Jaap Kamps. University of Amsterdam at INEX 2009: Ad hoc, book, and entity ranking tracks. In Shlomo Geva, Jaap Kamps, and Andrew Trotman, editors, INEX 2009 Workshop Pre-proceedings, pages 260-272, 2009. [ bib | .pdf ]
[29] Gabriella Kazai, Antoine Doucet, Marijn Koolen and Monica Landoni. Overview of the INEX 2009 Book Track. In Shlomo Geva, Jaap Kamps, and Andrew Trotman, editors, INEX 2009 Workshop Pre-proceedings, pages 120-132, 2009. [ bib | .pdf ]
[28] Rianne Kaptein, Marijn Koolen, and Jaap Kamps. Experiments with result diversity and entity ranking: Text, anchors, links, and wikipedia. In The Eighteenth Text REtrieval Conference (TREC 2009) Notebook. National Institute for Standards and Technology, 2009. [ bib | .pdf ]
[27] Marijn Koolen and Jaap Kamps. What's in a link? from document importance to topical relevance. In Leif Azzopardi, Gabriella Kazai, Stephen Robertson, Stefan Rüger, Milad Shokouhi, Dawie Song, and Emine Yilmas, editors, Proceedings of the 2nd International Conferences on the Theory of Information Retrieval (ICTIR 2009), volume 5766 of LNCS, pages 313-321. Springer Verlag, Berlin, Heidelberg, 2009. [ bib | .pdf ]
[26] Rianne Kaptein, Marijn Koolen, and Jaap Kamps. Using Wikipedia categories for ad hoc search. In James Allan, Javed A. Aslam, Mark Sanderson, ChengXiang Zhai, and Justin Zobel, editors, Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 824-825. ACM Press, New York NY, USA, 2009. [ bib | .pdf ]
[25] Jaap Kamps and Marijn Koolen. The impact of document level ranking on focused retrieval. In Shlomo Geva, Jaap Kamps, and Andrew Trotman, editors, Advances in Focused Retrieval: 7th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2008), volume 5631 of LNCS, pages 140-151. Springer Verlag, Berlin, Heidelberg, 2009. [ bib | .pdf ]
[24] Jaap Kamps, Shlomo Geva, Andrew Trotman, Alan Woodley, and Marijn Koolen. Overview of the INEX 2008 ad hoc track. In Shlomo Geva, Jaap Kamps, and Andrew Trotman, editors, Advances in Focused Retrieval: 7th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2008), volume 5631 of LNCS, pages 1-28. Springer Verlag, Berlin, Heidelberg, 2009. [ bib | .pdf ]
[23] Gianluca Demartini, Ludovic Denoyer, Antoine Doucet, Khairun Nisa Fachry, Patrick Gallinari, Shlomo Geva, Wei-Che Huang, Tereza Iofciu, Jaap Kamps, Gabriella Kazai, Marijn Koolen, Monica Landoni, Ragnar Nordlie, Nils Pharo, Ralf Schenkel, Martin Theobald, Andrew Trotman, Arjen P. de Vries, Alan Woodley, and Jianhan Zhu. Report on INEX 2008. SIGIR Forum, 43(1):17-36, June 2009. [ bib | .pdf ]
[22] Marijn Koolen, Jaap Kamps, and Vincent de Keijzer. Information retrieval in cultural heritage. Interdisciplinary Science Reviews, 34:268-284, 2009. [ bib | .pdf ]
[21] Jaap Kamps, Marijn Koolen, and Andrew Trotman. Comparative analysis of clicks and judgments for IR evaluation. In Proceedings of the Workshop on Web Search Click Data (WSCD 2009), pages 80-87. ACM Press, New York NY, USA, 2009. [ bib | .pdf ]
[20] Marijn Koolen and Gabriella Kazai and Nick Craswell. Wikipedia Pages as Entry Points for Book Search. In Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM 2009), pages 44-53. ACM Press, New York NY, USA, 2009. [ bib | .pdf ]
[19] Jaap Kamps and Marijn Koolen. Is Wikipedia link structure different? In Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM 2009), pages 232-241. ACM Press, New York NY, USA, 2009. [ bib | .pdf ]

2008

[18] Khairun Nisa Fachry, Jaap Kamps, Rianne Kaptein, Marijn Koolen, and Junte Zhang. The University of Amsterdam at INEX 2008: Ad hoc, book, entity ranking, interactive, link the wiki, and XML mining tracks. In INEX 2008 Workshop Pre-proceedings, pages 66-92, 2008. [ bib | .pdf ]
[17] Jaap Kamps, Shlomo Geva, Andrew Trotman, Alan Woodley, and Marijn Koolen. Overview of the INEX 2008 ad hoc track. In INEX 2008 Workshop Pre-proceedings, pages 1-28, 2008. [ bib | .pdf ]
[16] Jaap Kamps and Marijn Koolen. The importance of link evidence in Wikipedia (extended abstract). In Anton Nijholt, Maja Pantic, Mannes Poel, and Hendri Hondrop, editors, Proceedings of BNAIC 2008, the twentieth Belgian-Dutch Conference on Artificial Intelligence, pages 319-320. Universiteit Twente, Enschede, 2008. [ bib | .pdf ]
[15] Jaap Kamps, Marijn Koolen, and Mounia Lalmas. Locating relevant text within XML documents. In Sung-Hyon Myaeng, Douglas W. Oard, Fabrizio Sebastiani, Tat-Seng Chua, and Mun-Kew Leong, editors, Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 847-849. ACM Press, New York NY, USA, 2008. [ bib | .pdf ]
[14] Khairun Nisa Fachry, Jaap Kamps, Marijn Koolen, and Junte Zhang. Using and detecting links in Wikipedia. In Norbert Fuhr, Mounia Lalmas, Andrew Trotman, and Jaap Kamps, editors, Focused access to XML documents: 6th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2007), volume 4862 of Lecture Notes in Computer Science, pages 388-403. Springer Verlag, Heidelberg, 2008. [ bib | .pdf ]
[13] Avi Arampatzis, Jaap Kamps, Marijn Koolen, and Nir Nussbaum. Access to legal documents: Exact match, best match, and combinations. In Ellen M. Voorhees and Lori P. Buckland, editors, The Sixteenth Text REtrieval Conference Proceedings (TREC 2007). National Institute of Standards and Technology. NIST Special Publication 500-274, 2008. [ bib | .pdf ]
[12] Jaap Kamps and Marijn Koolen. The importance of link evidence in Wikipedia. In Craig Macdonald, Iadh Ounis, Vassilis Plachouras, Ian Rutven, and Ryen W. White, editors, Advances in Information Retrieval: 30th European Conference on IR Research (ECIR 2008), volume 4956 of Lecture Notes in Computer Science, pages 270-282. Springer Verlag, Heidelberg, 2008. [ bib | .pdf ]

2007

[11] Khairun Nisa Fachry, Jaap Kamps, Marijn Koolen, and Junte Zhang. The University of Amsterdam at INEX 2007. In Norbert Fuhr, Mounia Lalmas, and Andrew Trotman, editors, Pre-Proceedings of INEX 2007, pages 388-402, 2007. [ bib | .pdf ]
[10] Avi Arampatzis, Jaap Kamps, Marijn Koolen, and Nir Nussbaum. University of Amsterdam at the TREC 2007 legal track. In The Sixteenth Text REtrieval Conference (TREC 2007) Notebook, pages 623-625. National Institute for Standards and Technology, 2007. [ bib | .pdf ]
[9] Jaap Kamps and Marijn Koolen. On the relation between relevant passages and XML document structure. In Andrew Trotman, Shlomo Geva, and Jaap Kamps, editors, SIGIR 2007 Workshop on Focused Retrieval, pages 28-32. University of Otago, Dunedin New Zealand, 2007. [ bib | .pdf ]
[8] Avi Arampatzis, Jaap Kamps, Marijn Koolen, and Nir Nussbaum. Deriving a domain specific test collection from a query log. In Antal van den Bosch, Claire Grover, and Caroline Sporleder, editors, Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007), pages 73-80. Association for Computational Linguistics, 2007. [ bib | .pdf ]
[7] Jaap Kamps, Marijn Koolen, and Mounia Lalmas. Where to start reading a textual XML document? In Charles L. A. Clarke, Norbert Fuhr, Noriko Kando, Wessel Kraaij, and Arjen P. de Vries, editors, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 749-750. ACM Press, New York NY, USA, 2007. [ bib | .pdf ]
[6] Marijn Koolen, Avi Arampatzis, Jaap Kamps, Vincent de Keijzer, and Nir Nussbaum. Unified access to heterogeneous data in cultural heritage. In Proceedings of RIAO 2007: Large-Scale Semantic Access to Content (Text, Image, Video and Sound), 2007. [ bib | .pdf ]
[5] Jaap Kamps, Marijn Koolen, and Börkur Sigurbjörnsson. Filtering and clustering XML retrieval results. In Norbert Fuhr, Mounia Lalmas, and Andrew Trotman, editors, Comparative Evaluation of XML Information Retrieval Systems: Fifth Workshop of the INitiative for the Evaluation of XML Retrieval (INEX 2006), volume 4518 of Lecture Notes in Computer Science, pages 121-136. Springer Verlag, Heidelberg, 2007. [ bib | .pdf ]
[4] Avi Arampatzis, Jaap Kamps, Marijn Koolen, and Nir Nussbaum. MuSeUM: Unified access to the state of the art. In Proceedings of the Seventh Dutch-Belgian Workshop on Information Retrieval (DIR 2007), 2007. [ bib | .pdf ]

2006

[3] Jaap Kamps, Marijn Koolen, and Börkur Sigurbjörnsson. The University of Amsterdam at INEX 2006. In Norbert Fuhr, Mounia Lalmas, and Andrew Trotman, editors, INEX 2006 Workshop Pre-Proceedings, pages 88-99, 2006. [ bib | .pdf ]
[2] Marijn Koolen, Frans Adriaans, Jaap Kamps, and Maarten de Rijke. A cross-language approach to historic document retrieval. In Joost Broekens, Tim Cocx, and Walter A. Kosters, editors, Proceedings of the 18th Belgian-Dutch Conference on Artificial Intelligence (BNAIC 2006), 2006. [ bib | .pdf ]
[1] Marijn Koolen, Frans Adriaans, Jaap Kamps, and Maarten de Rijke. A cross-language approach to historic document retrieval. In Mounia Lalmas, Stefan M. Rüger, Theodora Tsikrika, and Alexei Yavlinsky, editors, Advances in Information Retrieval: 28th European Conference on IR Research (ECIR 2006), volume 3936 of Lecture Notes in Computer Science, pages 407-419. Springer Verlag, Heidelberg, 2006. [ bib | .pdf ]