|
Current projects
- The Ergonomics of Electronic Patient Records: an interdisciplinary
development of methodologies for understanding and exploiting free
text to enhance the utility of primary care electronic patient records
(Wellcome Trust).
-
Ranking
Word Senses for Disambiguation: Models and
Applications is concerned with
developing ways of estimating the frequency distributions of senses
of words from raw (unannotated) text (EPSRC).
- Part of the DELPH-IN
(Deep Linguistic Processing with HPSG) collaboration, and affiliated
with the LOGON machine translation
project in Norway.
Past projects
- COGENT:
Controlled Generation of Text is investigating
wide-coverage generation and developing reflective
techniques for controlling it effectively. As well as furthering the
understanding of wide-coverage generation, the project will deliver a
substantial and novel resource to support future research in this area,
and practical implementations of wide-coverage controllable
generators (EPSRC).
- MEANING -
Developing Multilingual Web-scale Language
Technologies: collecting and analysing language data from the WWW on
a large scale, building more comprehensive multilingual lexical
knowledge bases to support improved word sense disambiguation (EU 5th
Framework).
- DEEP THOUGHT
- Hybrid Deep and Shallow Methods for
Knowledge-Intensive Information Extraction is concerned with
devising methods for combining robust shallow methods for language
analysis with deep semantic processing. The approach will be
demonstrated in business intelligence, automated email processing and
document production support applications (EU 5th
Framework).
- Robust Accurate
Statistical Parsing (RASP): integrating and extending several strands of
research on robust statistical parsing and automated grammar and lexicon
induction, to produce a new parsing toolkit (EPSRC).
- PSET:
Practical Simplification of English Text: building a computer system
which takes in English newspaper text across the WWW, and outputs a
simplified version with broadly similar meaning (with, for example,
uncommon or unusual words replaced with more common or familiar synonyms,
and difficult to follow syntactic constructs replaced with simpler ones);
the system will be evaluated with people suffering from aphasia which
impairs their comprehension of written English (EPSRC).
- LEXSYS:
Analysis of Naturally-occurring English Text with Stochastic
Lexicalized Grammars: developing a robust wide-coverage parsing system for
English text, exploiting a combination of: statistical techniques
involving online corpora; inheritance hierarchies for imposing structure
on NLP data; and lexicalised grammars (EPSRC).
- SPARKLE
(Shallow PARsing and Knowledge extraction for Language
Engineering): developing shallow parsing technology in 4 European
languages together with corpus-based lexical acquisition techniques,
and deploying parsers in multilingual information retrieval and speech
dialogue systems (EU 4th Framework).
- ILD
(Integrated Language Database): producing a prototype system for
rapid and efficient development of multilingual language dictionaries from
corpus data (DTI/EPSRC under the SALT programme, at Cambridge University).
Workshops
- Cross-Framework and
Cross-Domain Parser Evaluation: Workshop at COLING 2008, Manchester, UK.
- Grammar Engineering and
Evaluation: Workshop at COLING 2002, Taipei, Taiwan.
- Beyond PARSEVAL
- Towards Improved Evaluation Measures for Parsing Systems: Workshop
at the 3rd International Conference on Language Resources and
Evaluation, Las Palmas, Canary Islands, 2002.
Proceedings
available online.
- Les dictionnaires
électroniques: un élément central dans le
traitement des langues
(Electronic Dictionaries: a Central Component in NLP): ATALA Workshop,
Paris, 2002.
- Efficiency
in Large-scale Parsing Systems: Workshop at COLING 2000, Luxembourg,
August 2000. Proceedings now available in printed form and online
as a technical
report. An outcome of the workshop is a methodology
for comparing parser efficiency.
- The Sixth
International Workshop on Parsing Technologies, Trento, Italy,
February 2000. Some papers are available online.
- The
Evaluation of Parsing Systems: Workshop at the
1st International Conference on Language Resources and Evaluation,
Granada, Spain, May 1998.
Proceedings now available (in printed form only) as a
technical
report.
- Robust
Parsing Workshop at ESSLLI'96.
Proceedings now available (in printed form only) as a
technical report.
|