Thematic Sessions for the 1st IJCNLP


One-paragraph descriptions of the accepted Thematic Sessions of The 1st International Joint Conference of Natural Language Processing appear below.


1. Natural Language Learning using Both Labeled and Unlabeled Data

Organizer : Hang Li (Microsoft Research Asia, Beijing)

Recently, a new trend has arisen in the field of Natural Language Processing (NLP): the development of machine learning technologies that use both labeled and unlabeled data for training. Methods that have been proposed under this paradigm include co-training, EM learning, transductive learning, and other semi-supervised learning techniques. For many NLP tasks, existing data are by their nature unlabeled and manually labeling them is prohibitively expensive. Effective utilization of both unlabeled and labeled data in learning is also a challenging but important issue. The goal of this thematic session is to bring together researchers working on this issue from different perspectives, in order to share their latest research results and to discuss future directions. We think that this session will advance research not only in exploiting unlabeled data but also in other natural language learning issues.


2. Natural Language Technology in the Text Processing User Interface

Organizers: Michael Kuehn (Universitaet Koblenz-Landau, Koblenz)
and Kumiko Tanaka-Ishii (University of Tokyo, Tokyo)

The emergence of applications like mobile text processing, communication aids and authoring support require sophisticated methods of text processing under challenging conditions. We invite researchers to discuss language technologies such as (but not restricted to) language modeling, analysis, summarization and disambiguation, in order to assist the user at the text processing front-end.


3. Mobile Information Retrieval

Organizer: Mun-Kew Leong (Institute for Infocomm Research, Singapore)

One of the strongest impacts in recent information technology is the way mobility has changed computer applications. The rapid rate of handphone adoption, the ubiquitous PDA, and the low cost of wireless adoption has created new problems, new challenges, and new opportunities to researchers in many disciplines. One common thread through all these applications is the necessity for information retrieval in one form or another. Another characteristic is the limited screen size of mobile devices and the consequent ramifications on input and output. The use of NLP plays an integral part in creating better user interfaces, better analysis of results for precise display, and greater understanding in the iterative interaction (dialogue) between user and mobile device. We propose this workshop to explore user oriented and theoretical limits and characteristics of NLP and IR within the context of mobile devices.


4. Text mining in Biomedicine

Organizers: Sophia Ananiadou (Salford University, Manchester)
Jong C. Park (Kaist, Daejeon)

With biomedical literature expanding so rapidly, there is an urgent need to discover and organise knowledge extracted from texts. Although factual databases contain crucial information the overwhelming amount of new knowledge remains in textual form (e.g. MEDLINE). In addition, new terms are constantly coined as the relationships linking new genes, drugs, proteins etc. As the size of biomedical literature is expanding, more systems are applying a variety of methods to automate the process of knowledge acquisition and management. These include a variety of techniques such as statistics, machine learning, SVMs, deep or shallow linguistic or domain knowledge etc. Some NLP related topics are challenging in biomedicine such as: dynamic terminology management, named-entity recognition , integration with non-textual resources, discovery of named relationships, populating and updating existing ontologies / taxonomies. The aim of this thematic session is to examine issu es and challenges in the area of biomedical text mining.