[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/991250.991268dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
Article
Free access

Word sense disambiguation and text segmentation based on lexical cohesion

Published: 05 August 1994 Publication History

Abstract

In this paper, we describe how word sense ambiguity can be resolved with the aid of lexical cohesion. By checking lexical cohesion between the current word and lexical chains in the order of the salience, in tandem with generation of lexical chains, we realize incremental word sense disambiguation based on contextual information that lexical chains, reveal. Next, we describe how segment boundaries of a text can be determined with the aid of lexical cohesion. We can measure the plausibility of each point in the text as a segment boundary by computing a degree of agreement of the start and end points of lexical chains.

References

[1]
Bunrui-Goihyo. Shuei Shuppan., 1964. in Japanese.
[2]
B. J. Grosz and C. L. Sidner. Attention, intentions, and the structure of discourse. Computational Linguistics, 12(3):175--204, 1986.
[3]
H. A. K. Halliday and R. Hassan. Cohesion in English. Longman, 1976.
[4]
M. A. Hearst. Texttiling: A quantitative approach to discourse segmentation. Technical Report 93/24, University of California, Berkeley, 1993.
[5]
M. A. Hearst. Multi-paragraph segmentation of expository texts. Technical Report 94/790, University of California, Berkeley, 1994.
[6]
J. Hirschberg and B. Grosz. Intonational features of local and global discourse structure. In Proc. of the Darpa Workshop on Speech and Natural Language, pages 441--446, 1992.
[7]
H. Kozima. Text segmentation based on similarity between words. In Proc. of the 31st Annual Meeting of the Association for Computational Linguistics, pages 286--288, 1993.
[8]
S. W. McRoy. Using multiple knowledge sources for word sense discrimination. Computational Linguistics, 18(1):1--30, 1992.
[9]
C. S. Mellish. Computer Interpretation of Natural Language Descriptions. Ellis Horwood, 1985.
[10]
J. Morris and G. Hirst. Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics, 17(1):21--48, 1991.
[11]
Nagao Lab., Kyoto University. Japanese Morphological Analysis System JUMAN Manual Version 1.0, 1993. in Japanese.
[12]
M. Okumura and H. Tanaka. Towards incremental disambiguation with a generalized discrimination network. In Proc. of the 8th National Conference on Artificial Intelligence, pages 990--995, 1990.
[13]
T. Ookuma. Gengo tan'i toshite no bunshou. Nihongo gaku, 11(4):20--25, 1992. in Japanese.
[14]
R. J. Passonneau. Intention-based segmentation: Human reliability and correlation with linguistic cues. In Proc. of the 31st Annual Meeting of the Association for Computational Linguistics, pages 148--155, 1993.
[15]
P. Roget. Roget's International Thesaurus, Fourth Edition. Harper and Row Publishers Inc., 1977.
[16]
D. Yarowsky. Word-sense disambiguation using statistical models of roget's categories trained on large corpora. In Proc. of the 14th International Conference on Computational Linguistics, pages 454--460, 1992.
[17]
G. Youmans. A new tool for discourse analysis: The vocabulary-management profile. Language, 67:763--789, 1991.

Cited By

View all
  • (2021)BATS: A Spectral Biclustering Approach to Single Document Topic Modeling and SegmentationACM Transactions on Intelligent Systems and Technology10.1145/346826812:5(1-29)Online publication date: 15-Oct-2021
  • (2012)Discourse coherenceProceedings of the 13th Chinese conference on Chinese Lexical Semantics10.1007/978-3-642-36337-5_76(756-765)Online publication date: 6-Jul-2012
  • (2010)Automatic text segmentation for movie subtitlesProceedings of the 23rd Canadian conference on Advances in Artificial Intelligence10.1007/978-3-642-13059-5_32(295-298)Online publication date: 31-May-2010
  • Show More Cited By
  1. Word sense disambiguation and text segmentation based on lexical cohesion

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      COLING '94: Proceedings of the 15th conference on Computational linguistics - Volume 2
      August 1994
      661 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 05 August 1994

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)85
      • Downloads (Last 6 weeks)5
      Reflects downloads up to 21 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)BATS: A Spectral Biclustering Approach to Single Document Topic Modeling and SegmentationACM Transactions on Intelligent Systems and Technology10.1145/346826812:5(1-29)Online publication date: 15-Oct-2021
      • (2012)Discourse coherenceProceedings of the 13th Chinese conference on Chinese Lexical Semantics10.1007/978-3-642-36337-5_76(756-765)Online publication date: 6-Jul-2012
      • (2010)Automatic text segmentation for movie subtitlesProceedings of the 23rd Canadian conference on Advances in Artificial Intelligence10.1007/978-3-642-13059-5_32(295-298)Online publication date: 31-May-2010
      • (2009)A New Lexical Chain Algorithm Used for Automatic SummarizationProceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence10.5555/1671153.1671197(329-338)Online publication date: 22-Jul-2009
      • (2009)A New Lexical Chain Algorithm Used for Automatic SummarizationProceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence10.5555/1659389.1659433(329-338)Online publication date: 22-Jul-2009
      • (2006)A multi-level framework for the analysis of sequential dataData Mining10.5555/2124128.2124150(229-243)Online publication date: 1-Jan-2006
      • (2005)Bridging real world semantics to model world semantics for taxonomy based knowledge representation systemJournal of Computer Science and Technology10.1007/s11390-005-0296-620:3(296-308)Online publication date: 1-May-2005
      • (2004)Inferable Centers, Centering Transitions, and the Notion of CoherenceComputational Linguistics10.1162/08912010432309326730:2(119-150)Online publication date: 1-Jun-2004
      • (2004)Measuring semantic similarity based on weighting attributes of edge countingProceedings of the 13th international conference on AI, Simulation, and Planning in High Autonomy Systems10.1007/978-3-540-30583-5_50(470-480)Online publication date: 4-Oct-2004
      • (2003)Not as easy as it seemsProceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence10.5555/1760335.1760396(544-549)Online publication date: 11-Jun-2003
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media