Post-doc Researcher @ L3S Research Center
|Current Affiliation:||L3S Research Center.|
|Leibniz University, Hannover|
|Address:||L3S Research Center, Appelstraße 9a, Hannover.|
|Phone:||+49 0511 762 17795|
|Fax:||+49 0511 762 17779|
|Email:||anand (at) l3s . de|
My research focusses on retrieval, mining, analysis and data management aspects of temporal Web collections like Web archives, Wikipedia and news collections. You can find my publications here . I am broadly interested in :
Wikipedia entity pages are a valuable source of information for direct consumption and for knowledge-base construction, update and maintenance. Facts in these entity pages are typically supported by references. Our studies show that as much as 20% of the references are from online news sources. However, many entity pages are incomplete even if relevant information is already available in existing news articles. Even for the already present references, there is often a delay between the news article publication time and the reference time. In this work, we look at Wikipedia through the lens of news and investigate approaches to enrich Wikipedia with potentially missing facts, citations, etc.
HistDiv - Historical Search
Longitudinal corpora like newspaper archives are of immense value to historical research, and time as an important factor for historians strongly influences their search behaviour in these archives. While searching for articles published over time, a key preference is to retrieve documents which cover the important aspects from important points in time which is different from standard search behavior. We are developing HistDiv which is a search and exploration system for searching historical news collections.
Tempas - Tag-based Archive Exploration
Tempas is a search engine incorporating social tags in order to enable richer search capabilities on archived Web sites than currently available. Tempas is based on tags posted on Delicious, which describe a website at a very specific time. This temporal information allows to search an archive for the desired version of a website in a given time period and is percieved as an improvement over accessing an archive by providing the exact URL and time of a website's version, like most Web archives only allow today.
Temporal Indexing and Query Processing
For realizing efficient access to longitudinal collections, keyword queries are extended by temporal predicates giving temporal queries of the form -- "alexandria @ [2008-2011]". This line of work explores indexing and query processing methods for efficient temporal querying, index maintenance and approximate query processing.
L3S Research Center, Hannover.
Department of Databases and Information Systems, Max Planck Institute for Informatics, Saarbruecken.
Saarland University and Max Planck Institute for Informatics, Saarbruecken.
Microsoft, India Development Center.
Indian Institute of Information Technology, Allahabad.