Improving Noun Phrase Coreference Resolution by Matching Strings

Yang, Xiaofeng; Zhou, Guodong; Su, Jian; Tan, Chew Lim

doi:10.1007/978-3-540-30211-7_3

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3248))

Included in the following conference series:

International Conference on Natural Language Processing

1707 Accesses
18 Citations

Abstract

In this paper we present a noun phrase coreference resolution system which aims to enhance the identification of the coreference realized by string matching. For this purpose, we make two extensions to the standard learn-ing-based resolution framework. First, to improve the recall rate, we introduce an additional set of features to capture the different matching patterns between noun phrases. Second, to improve the precision, we modify the instance selection strategy to allow non-anaphors to be included during training instance generation. The evaluation done on MEDLINE data set shows that the combination of the two extensions provides significant gains in the F-measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Methods of Solving the Problem of Coreference and Searching for Noun Phrases in Natural Languages

Article 01 February 2025

Automatic Extraction of the Phraseology Through NooJ

Semantic Dependency Labeling of Chinese Noun Phrases Based on Semantic Lexicon

References

Aone, C., Bennett, S.W.: Evaluating automated and manual acquistion of anaphora resolution strategies. In: Proceedings of the 33rd Annual Meeting of the Association for Compuational Linguistics, pp. 122–129 (1995)
Google Scholar
McCarthy, J., Lehnert, Q.: Using decision trees for coreference resolution. In: Proceedings of the 14th International Conference on Artificial Intelligences, pp. 1050–1055 (1995)
Google Scholar
Soon, W., Ng, H., Lim, D.: A machine learning approach to coreference resolution of noun phrases. Computational Linguistics 27, 521–544 (2001)
Article Google Scholar
Ng, V., Cardie, C.: Improving machine learning approaches to coreference resolution. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, pp. 104–111 (2002)
Google Scholar
Yang, X., Zhou, G., Su, J., Tan, C.: Coreference resolution using competition learning approach. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Japan (2003)
Google Scholar
MUC-6: Proceedings of the Sixth Message Understanding Conference. Morgan Kaufmann Publishers, San Francisco, CA (1995)
Google Scholar
MUC-7: Proceedings of the Seventh Message Understanding Conference. Morgan Kaufmann Publishers, San Francisco, CA (1998)
Google Scholar
Poesio, M., Vieira, R.: A corpus-based investigation of definite description use. Computational Linguistics 24, 183–261 (1998)
Google Scholar
Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string distance metrics for namematching tasks. In: Procedings of IJCAI 2003 Workshop on Information Integration on the Web (2003)
Google Scholar
Vieira, R., Poesio, M.: An empirically based system for processing definite descriptions. Computational Linguistics 27, 539–592 (2001)
Google Scholar
Strube, M., Rapp, S., Muller, C.: The influence of minimum edit distance on reference resolution. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Philadelphia, pp. 312–319 (2002)
Google Scholar
Castano, J., Zhang, J., Pustejovsky, J.: Anaphora resolution in biomedical literature. In: International Symposium on Reference Resolution, Alicante, Spain (2002)
Google Scholar
Zhou, G., Su, J.: Error-driven HMM-based chunk tagger with context-dependent lexicon. In: Proceedings of the Joint Conference on Empirical Methods on Natural Language Processing and Very Large Corpus, Hong Kong (2000)
Google Scholar
Zhou, G., Su, J.: Named Entity recognition using a HMM-based chunk tagger. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia (2002)
Google Scholar
Shen, D., Zhang, J., Zhou, G., Su, J., Tan, C.: Effective adaptation of hidden markov modelbased named-entity recognizer for biomedical domain. In: Proceedings of ACL 2003 Workshop on Natural Language Processing in Biomedicine, Japan (2003)
Google Scholar
Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann Publishers, San Francisco (1993)
Google Scholar
Vilain, M., Burger, J., Aberdeen, J., Connolly, D., Hirschman, L.: A model-theoretic coreference scoring scheme. In: Proceedings of the Sixth Message understanding Conference (MUC-6), pp. 45–52. Morgan Kaufmann, San Francisco (1995)
Chapter Google Scholar
Ng, V., Cardie, C.: Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution. In: Proceedings of the 19th International Conference on Computational Linguistics, COLING 2002 (2002)
Google Scholar
McCallum, A., Wellner, B.: Toward conditional models of identity uncertainty with application to proper noun coreference. In: Procedings of IJCAI 2003 Workshop on Information Integration on the Web, pp. 79–86 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Xiaofeng Yang, Guodong Zhou & Jian Su
Department of Computer Science, National University of Singapore, 117543, Singapore
Xiaofeng Yang & Chew Lim Tan

Authors

Xiaofeng Yang
View author publications
Search author on:PubMed Google Scholar
Guodong Zhou
View author publications
Search author on:PubMed Google Scholar
Jian Su
View author publications
Search author on:PubMed Google Scholar
Chew Lim Tan
View author publications
Search author on:PubMed Google Scholar

Editor information

Editors and Affiliations

Behavior Design Corporation, IV Science-Based Industrial Park Hsinchu, 2F, No.5, Industry E. Rd, Taiwan
Keh-Yih Su
University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, JST CREST, Honcho 4-1-8, Kawaguchi-shi,, 332-0012, Saitama,
Jun’ichi Tsujii
Pohang University of Science and Technology (POSTECH), AITrc, Republic of Korea
Jong-Hyeok Lee
Language Information Sciences Research Centre, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Oi Yee Kwong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, X., Zhou, G., Su, J., Tan, C.L. (2005). Improving Noun Phrase Coreference Resolution by Matching Strings. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-540-30211-7_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Publish with us

Policies and ethics