Carroll, J. and A. Fang (2004) `The automatic acquisition of verb
subcategorisations and their impact on the performance of an HPSG
parser'. In Proceedings of the 1st International Joint Conference on
Natural Language Processing (IJCNLP), Sanya City,
China. 107-114.
Also in K-Y. Su, J. Tsujii, J-H. Lee, et al. (eds.) Natural
Language Processing - IJCNLP 2004: First International Joint
Conference, Hainan Island, China, March 22-24, 2004, Revised Selected
Papers. Springer Lecture Notes in Computer Science, Volume 3248,
2005. 646.
We describe the automatic acquisition of a lexicon of verb subcategorisations from a domain-specific corpus, and an evaluation of the impact this lexicon has on the performance of a "deep" parser of English. We conducted two experiments to determine whether the empirically extracted verb stems would enhance the lexical coverage of the grammar and to see whether the automatically extracted verb subcategorisations would result in enhanced parser coverage. In our experiments, the empirically extracted verbs enhance lexical coverage by 8.5%. The automatically extracted verb subcategorisations enhance the parse success rate by 15% in theoretical terms and by 4.5% in practice. This is a promising approach for improving the robustness of deep parsing.
Download pdf version.