18th International Conference on Application of Natural Language to Information Systems (NLDB2013)

19-21 June 2013, University of Salford, MediaCityUK

Picture of MediaCityUK at night.

Conference Programme

Wednesday 19 June 2013


Welcome Message from the Conference Co-Chairs


Official opening by the University of Salford Pro ViceChancellor Research and Innovation, Professor Jim Yip


Keynote SpeakerTo be or not be engaged: What is the question (to ask)?  Prof Mounia Lalmas, Scientist at Yahoo! Labs Barcelona

Abstract: In the online world, user engagement refers to the quality of the user experience that emphasizes the phenomena associated with wanting to use a web application longer and frequently. User engagement is a multifaceted, complex phenomenon, giving rise to a number of approaches for its measurement: self-reporting (e.g., questionnaires); observational methods (e.g., facial expression analysis, desktop actions); and web analytics using online behavior metrics. These methods represent various trade-offs between the scale of the data analyzed and the depth of understanding. For instance, surveys are hardly scalable but offer rich, qualitative insights, whereas click data can be collected on a large-scale but are more difficult to analyze. Still, the core research questions each type of measurement is able to answer are unclear. This talk will present various efforts aiming at combining approaches to measure engagement and seeking to provide insights into what questions to ask when measuring engagement. (This is work done in collaboration with Janette Lehmann, David Warnock, Ioannis Arapakis, Georges Dupret and many others at Yahoo! Labs.)

Biography: Mounia Lalmas is a scientist at Yahoo! Labs Barcelona, which she joined in January 2011. Prior to this, she held a Microsoft Research/RAEng Research Chair at the School of Computing Science, University of Glasgow. Before that, she was Professor of Information Retrieval at the Department of Computer Science at Queen Mary, University of London, which she joined in 1999 as a lecturer (aka assistant professor). From 2002 until 2007, she co-led the Evaluation Initiative for XML Retrieval (INEX), a large-scale project with over 80 participating organizations worldwide, which was responsible for defining the nature of XML retrieval, and how it should be evaluated. Her current research focuses on three main areas: user engagement, social media and aggregated search. 


Session 1 (Long Papers): Semantic Web

SQUALL: a Controlled Natural Language as Expressive as SPARQL 1.1, Sébastien Ferré

Cross-Lingual Natural Language Querying over the Web of Data, Nitish Aggarwal, Tamara Polajnar, Paul Buitelaar

A Broadly Applicable and Flexible Conceptual Metagrammar as a Basic Tool for Developing a Multilingual Semantic Web  Vladimir Fomichov







Session 2 (Short Papers): NLP Applications

Using Grammar-Profiles to Intrinsically Expose Plagiarism in Text Documents, Michael Tschuggnall and Günther Specht

Experiments in Producing Playful “Explanations” for Given Names (Anthroponyms) in Hebrew and English, Y. HaCohen-Kerner et al.

Ranking Search Intents Underlying a Query. Yunqing Xi et al.

MOSAIC: A Cohesive Method for Orchestrating Discrete Analytics in a Distributed Model, Ransom Winder, J. Jubinski, John Prange, Nathan Giles

Session 3 (Short Papers): Semantic Resources

An RDF-Based Semantic Index, F. Amato, et al.

Collaborative enrichment of electronic dictionaries standardized-LMF, A. Khemakhem, B. Gargouri and A. Ben Hamadou

Entangled Semantics, Diana Tanase and Epaminondas Kapetanios

Towards The Refinement of the Arabic Soundex, Nedjma Djouhra Ousidhoum, Nacéra Bensaou



Session 4 (Short Papers): Text Classification

Text Classification of Technical Papers Based on Text Segmentation, Thien Hai Nguyen and Kiyoaki Shirai

GenDesc: A Partial Generalization of Linguistic Features for Text Classification, Guillaume Tisserant, Violaine Prince, Mathieu Roche

Session 5 (Short Papers): Knowledge Discovery

Feature Selection Methods in Persian Sentiment Analysis, Mohamad Saraee, Ayoub Bagheri

Linguistic Sentiment Features for Newspaper Opinion Mining, Thomas Scholz and Stefan Conrad

Enhancing machine learning results for semantic relation extraction, Ines Boujelben, Salma Jamoussi, Abdelmajid Ben Hamadou


 Thursday 20 June 2013


Invited Speaker: Alex Poulovassilis : Flexible Querying of Heterogeneous Graph Data

Abstract:  Large volumes of graph-structured data are becoming increasingly prevalent in a broad range of application domains. Graph-structured data differs from other data in its greater focus on the relationships between entities, regarding these relationships as important as the entities themselves. Query formulation and evaluation techniques are needed that assist users in querying complex, dynamic or irregular graph data, where users may lack full knowledge of its structure.

This talk will discuss the use of query approximation and query relaxation techniques within one integrated framework that aims to support flexible querying of heterogeneous graph data, e.g. RDF linked data. Our framework incorporates standard notions of query approximation based on edit distance and of query relaxation based on RDFS inference rules.  The query language we adopt comprises conjunctions of regular path queries, thus covering extensions proposed for SPARQL to allow for querying paths using regular expressions. The talk will discuss theoretical foundations, algorithms, prototype implementations, results from performance studies, and ongoing work.  

Biography: Alexandra Poulovassilis is Professor of Computer Science at Birkbeck and Co-Director of the London Knowledge Lab, a multidisciplinary research lab of Birkbeck and the Institute of Education that aims to explore the ways in which digital technologies and new media are shaping the future of knowledge and learning. Her research is motivated by the increasing volumes and varieties of digital information available, and the need to support users in finding the right information for their purposes and in combining and reconciling information from different sources. Her current research is in techniques for querying, integrating, visualising and personalising information, particularly as arising in learning environments, science and social science.  


Session 6 (Long Papers): Opinion Mining

Extraction of Statements in News for a Media Response Analysis, Thomas Scholz and Stefan Conrad

Sentiment-based Ranking of Blog Posts using Rhetorical Structure Theory, Jose M. Chenlo, A. Hogenboom and David E. Losada

An Unsupervised Aspect Detection Model for Sentiment Analysis of Reviews, Ayoub Bagheri, Mohamad Saraee, Franciska de Jong

Bayesian Model Averaging and model selection for polarity classification, Federico A. Pozzi, E. Fersini, and Enza Messina

Session 7 (Long Papers): Arabic NLP

An Approach for Extracting and Disambiguating Arabic Persons' Names using Clustered Dictionaries and Scored Patterns, Omnia Zayed, Samhaa El-Beltagy, and Osama Haggag

Person Name Recognition Using the Hybrid Approach, Mai Oudah and Khaled Shaalan

A Hybrid Approach for Arabic Diacritization, Ahmed Said, Mohamed El-Sharqwi, Achraf Chalabi, and Eslam Kamal






Session 8 (Long Papers): Text Summarization

Evaluating Syntactic Sentence Compression for Text Summarisation, Prasad Perera and Leila Kosseim

Exploring domain-sensitive features for extractive summarization in the medical domain, Dat Tien Nguyen and Johannes Leveling


NLDB 2013 Posters Presentations


 Friday 21 June 2013



Session 9 (Long Papers): Dialog Systems

An OpenCCG-based Approach to Question Generation from Concepts, Markus M. Berg, Amy Isard, and Johanna D. Moore

EDU-Based Similarity for Paraphrase Identification, Ngo Xuan Bach, Nguyen Le Minh, Akira Shimazu

Exploiting Query Logs and Field-Based Models to Address Term Mismatch in an HIV/AIDS FAQ Retrieval System,  Edwin Thuma, Simon Rogers, and Iadh Ounis


Session 10 (Long Papers): Natural Language Processing and Applications

A corpus-based approach for the induction of ontology lexica, Sebastian Walter, Christina Unger, and Philipp Cimiano

Unsupervised Medical Subject Heading Assignment Using Output Label Co-Occurrence Statistics and Semantic Predications, Ramakanth Kavuluru and Zhenghao He





