[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.3115/1119250.1119279dlproceedingsArticle/Chapter ViewAbstractPublication PagessighanConference Proceedingsconference-collections
Article
Free access

Systran's Chinese word segmentation

Published: 11 July 2003 Publication History

Abstract

SYSTRAN's Chinese word segmentation is one important component of its Chinese-English machine translation system. The Chinese word segmentation module uses a rule-based approach, based on a large dictionary and fine-grained linguistic rules. It works on general-purpose texts from different Chinese-speaking regions, with comparable performance. SYSTRAN participated in the four open tracks in the First International Chinese Word Segmentation Bakeoff. This paper gives a general description of the segmentation module, as well as the results and analysis of its performance in the Bakeoff.

References

[1]
Liu, Y, Tan Q. & Shen, X. 1993. Segmentation Standard for Modern Chinese Information Processing and Automatic Segmentation Methodology.
[2]
Sproat, R., & Emerson T. 2003. The First International Chinese Word Segmentation Bakeoff. In the Proceedings of the Second SIGHAN Workshop on Chinese Language Processing. ACL03.

Cited By

View all
  • (2008)Tighter integration of rule-based and statistical MT in serial system combinationProceedings of the 22nd International Conference on Computational Linguistics - Volume 110.5555/1599081.1599196(913-919)Online publication date: 18-Aug-2008

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
SIGHAN '03: Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17
July 2003
193 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 11 July 2003

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)65
  • Downloads (Last 6 weeks)7
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2008)Tighter integration of rule-based and statistical MT in serial system combinationProceedings of the 22nd International Conference on Computational Linguistics - Volume 110.5555/1599081.1599196(913-919)Online publication date: 18-Aug-2008

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media