Robust Accurate Statistical Parsing (RASP)


Participants


Project Description

On this project we are integrating and extending several strands of research on robust statistical parsing and automated grammar and lexicon induction, in order to develop and distribute a new parsing toolkit that significantly extends the current state-of-the-art. The system forms the focus of further research on:

The project started 1 July 2001 (Sussex) / 1 October 2001 (Cambridge), is funded by the UK EPSRC, and is of 3 years duration.

The toolkit is now publicly available, via the RASP System webpage.


Selected Project Publications

2004

Korhonen, A. and E. Briscoe (2004) Extended lexical-semantic classification of English verbs. In Proceedings of the HLT/NAACL'04 Workshop on Computational Lexical Semantics, Boston, MA. 38-45.

McCarthy, D., R. Koeling, J. Weeds and J. Carroll (2004) Finding predominant senses in untagged text. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain. 280-287. (Received the Best Paper Award).

McCarthy, D., R. Koeling, J. Weeds and J. Carroll (2004) Using automatically acquired predominant senses for word sense disambiguation. In Proceedings of the SENSEVAL-3 Workshop at ACL'04, Barcelona, Spain. 151-154.

McCarthy, D., R. Koeling, J. Weeds and J. Carroll (2004) Automatic identification of infrequent word senses. In Proceedings of the 20th International Conference on Computational Linguistics (COLING), Geneva, Switzerland. 1220-1226.

McLauchlan, M. (2004) Thesauruses for prepositional phrase attachment. In Proceedings of the Eighth Conference on Natural Language Learning (CoNLL-2004), Boston, MA. 73-80.

Preiss, J. (2004) Probabilistic word sense disambiguation. Journal of Computer Speech and Language, 18(3):319-337.

Preiss, J. and A. Korhonen (2004) WSD for subcategorization acquisition task description. In Proceedings of the SENSEVAL-3 Workshop at ACL'04, Barcelona, Spain. 33-36.

Weeds, J., D. Weir and D. McCarthy (2004) Characterising measures of lexical distributional similarity. In Proceedings of the 20th International Conference on Computational Linguistics (COLING), Geneva, Switzerland. 1015-1021.

2003

Korhonen, A., Y. Krymolowski and Z. Marx (2003) Clustering polysemic subcategorization frame distributions semantically. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, Japan. 64-71.

Korhonen, A. and J. Preiss (2003) Improving subcategorization acquisition using word sense disambiguation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, Japan. 48-55.

McCarthy, D. and J. Carroll (2003) Disambiguating nouns, verbs and adjectives using automatically acquired selectional preferences. Computational Linguistics, 29(4). 639-654.

McCarthy, D., B. Keller and J. Carroll (2003) Detecting a continuum of compositionality in phrasal verbs. In Proceedings of the ACL-SIGLEX Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, Sapporo, Japan. 73-80.

Preiss, J. (2003) Using grammatical relations to compare parsers. In Proceedings of the Tenth Conference of the European Chapter of the ACL, Budapest, Hungary. 291-298.

Preiss, J. and E. Briscoe (2003) Intermediate parsing for anaphora resolution? Implementing the Lappin and Leass non-coreference filters. In Proceedings of the EACL'03 Workshop on Computational Treatment of Anaphora, Budapest, Hungary. 1-6.

Watson, R., J. Preiss and E. Briscoe (2003) The contribution of domain-independent robust pronominal anaphora resolution to open-domain question-answering. In Proceedings of the Symposium on Reference Resolution and its Applications to Question Answering and Summarization. 75-82.

2002

Briscoe, E. and J. Carroll (2002) Robust accurate statistical annotation of general text. In Proceedings of the Third International Conference on Language Resources and Evaluation, Las Palmas, Gran Canaria. 1499-1504.

Briscoe, E., J. Carroll, J. Graham and A. Copestake (2002) Relational evaluation schemes. In Proceedings of the Beyond PARSEVAL Workshop at the Third International Conference on Language Resources and Evaluation, Las Palmas, Gran Canaria. 4-8.

Carroll, J. and E. Briscoe (2002) High precision extraction of grammatical relations. In Proceedings of the 19th International Conference on Computational Linguistics (COLING), Taipei, Taiwan. 134-140.

Korhonen, A. (2002) Assigning verbs to semantic classes via WordNet. In Proceedings of the COLING'02 Workshop on Building and Using Semantic Networks, Taipei, Taiwan.

Korhonen, A. (2002) Semantically motivated subcategorization acquisition. In Proceedings of the ACL'02 Workshop on Unsupervised Lexical Acquisition, Philadelphia, USA. 51-58.

Korhonen, A. and Y. Krymolowski (2002) On the robustness of entropy-based similarity measures in evaluation of subcategorization acquisition systems. In Proceedings of the Sixth Conference on Natural Language Learning (CoNLL-2002), Taipei, Taiwan. 91-97.

Preiss, J. and A. Korhonen (2002) Improving subcategorization acquisition with WSD. In Proceedings of the ACL'02 Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, USA. 102-108.

Preiss, J., A. Korhonen and E. Briscoe (2002) Subcategorization acquisition as an evaluation method for WSD. In Proceedings of the Third International Conference on Language Resources and Evaluation, Las Palmas, Gran Canaria. 1551-1556.

2001

Briscoe, E. (2001) From dictionary to corpus to self-organizing dictionary: learning valency associations in the face of variation and change. In Proceedings of Corpus Linguistics 2001, Lancaster University, UK. 79-89.

McCarthy, D., J. Carroll and J. Preiss (2001) Disambiguating noun and verb senses using automatically acquired selectional preferences. In Proceedings of the SENSEVAL-2 Workshop at ACL'01, Toulouse, France. 119-122.

2000

Korhonen, A. (2000) Using semantically motivated estimates to help subcategorization acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Hong Kong. 216-223.

Korhonen, A., G. Gorrell and D. McCarthy (2000) Statistical filtering and subcategorization frame acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Hong Kong. 199-205.