8000 GitHub - cchongXD/franc: Detect the language of text
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

cchongXD/franc

 
 

Repository files navigation

franc

Build Status Coverage Status Code Climate

Detect the language of text.

What 8096 s so cool about franc?

  1. franc supports more languages(†) than any other library, or Google;
  2. franc is easily forked to support 300+ languages;
  3. franc is just as fast as the competition.

† - If humans write in the language, on the web, and the language has more than one million speakers, franc detects it.

Installation

npm:

$ npm install franc

Component:

$ component install wooorm/franc

Bower:

$ bower install franc

Usage

var franc = require('franc');

franc('Alle menslike wesens word vry'); // "afr"
franc('এটি একটি ভাষা একক IBM স্ক্রিপ্ট'); // "ben"
franc('Alle mennesker er født frie og'); // "nno"
franc(''); // "und"

franc.all('O Brasil caiu 26 posições em');
/*
 * [
 *   [ 'por', 5507 ],
 *   [ 'glg', 6270 ],
 *   [ 'src', 6292 ],
 *   [ 'lav', 6391 ],
 *   [ 'cat', 6432 ],
 *   [ 'spa', 6481 ],
 *   [ 'bos', 6509 ],
 *   [ 'tpi', 6526 ],
 *   [ 'hrv', 6532 ],
 *   [ 'snn', 6556 ],
 *   [ 'bam', 6693 ],
 *   [ 'sco', 6695 ],
 *   ...
 * ]
 */

franc.all(''); // [ [ 'und', 1 ] ]

Note!: franc returns the "und" language code for an undetermined language. This happens when the input value is too short to give a significant answer.

Supported languages

franc supports 168 languages. For a complete list, check out Supported-Languages.md.

Other Language detection libraries

Benchmark

On a MacBook Air, it runs 169 paragraphs 2 times per second (total: 338 op/s).

         benchmarks * 169 paragraphs in different languages
  2 op/s » franc -- this module
  2 op/s » guesslanguage
  2 op/s » languagedetect
  2 op/s » vac

(I’ll work on a better benchmark soon)

License

LGPL © Titus Wormer

About

Detect the language of text

Resources

License

Stars

Watchers

Forks

Packages

No packages published
0