8000 GitHub - yuanfeiz/Ruby-NER: Project for CS332(The Internet-based Information Extraction Technologies)
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
This repository was 10000 archived by the owner on Feb 11, 2025. It is now read-only.

yuanfeiz/Ruby-NER

Repository files navigation

Directory Tree:

  Ruby-NER/
  	lib/
	  pipeline.rb
    stanford-segmenter-2012-11-11/
      test.simp.utf8
      segment.sh
    stanford-postagger-full-2012-11-11/
      models/
      stanford-postagger.sh

Usage

  $ cd Ruby-NER
  $ bundle --without development
  $ lib/pipeline.rb test INPUT_FILE OUTPUT_FILE MODEL_FILE


  # Utils - 用法参见lib/pipeline.rb help [COMMAND]
  $ lib/pipeline.rb check_column_size # 检查每行feature数量是否一致。
  $ lib/pipeline.rb sub_label # 替换标签
  $ lib/pipeline.rb extract_prefix_and_surfix

Note

我对stanford-postagger.sh做了小修改,请用修改后的版本。(主要是修正了-classpath

About

Project for CS332(The Internet-based Information Extraction Technologies)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0