|Current Affiliation:||Leibniz University, Hannover.|
|L3S Research Center, Hannover|
|Address:||L3S Research Center, Appelstraße 4, Hannover.|
|Phone:||+49 0511 762 17795|
|Fax:||+49 0511 762 17779|
|Email:||anand (at) l3s . de|
My research broadly falls in the intersection of information retrieval and text mining. Specifically, I worked on designing algorithms to improve search and enrichment on temporal collections.
Recently, I am also interested in interpretability of retrieval models. That is, how can we better understand the rationale behind predictions of a black-box retrieval model ?
A list of my research interests are as below:
Wikipedia entity pages are a valuable source of information for direct consumption and for knowledge-base construction, update and maintenance. Facts in these entity pages are typically supported by references. Our studies show that as much as 20% of the references are from online news sources. However, many entity pages are incomplete even if relevant information is already available in existing news articles. Even for the already present references, there is often a delay between the news article publication time and the reference time. In this work, we look at Wikipedia through the lens of news and investigate approaches to enrich Wikipedia with potentially missing facts, citations, etc.
HistDiv - Historical Search
Longitudinal corpora like newspaper archives are of immense value to historical research, and time as an important factor for historians strongly influences their search behaviour in these archives. While searching for articles published over time, a key preference is to retrieve documents which cover the important aspects from important points in time which is different from standard search behavior. We are developing HistDiv which is a search and exploration system for searching historical news collections.
Tempas - Tag-based Archive Exploration
Tempas is a search engine incorporating social tags in order to enable richer search capabilities on archived Web sites than currently available. Tempas is based on tags posted on Delicious, which describe a website at a very specific time. This temporal information allows to search an archive for the desired version of a website in a given time period and is percieved as an improvement over accessing an archive by providing the exact URL and time of a website's version, like most Web archives only allow today.
Leibniz University, Hannover.
L3S Research Center, Hannover.
Department of Databases and Information Systems, Max Planck Institute for Informatics, Saarbruecken.
Saarland University and Max Planck Institute for Informatics, Saarbruecken.
Microsoft, India Development Center.
Indian Institute of Information Technology, Allahabad.