8000 GitHub - udondan/CommonSubstrings: $ npm install common-substrings
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

udondan/CommonSubstrings

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

common-substrings

npm npm version license

A method written in Typescript, used for finding all common strings for Javascript and node.js, particularly quick for large string samples. It works in both web and node environment and it has no dependencies.

Usage

Quickstart

The easiest way to start is:

import substrings from 'common-substrings';
const result = substrings(stringArray, {
  minOccurrence: 3,
  minLength: 5,
});

Result is listed as an Object array, each element in the array include :

  • source : the index of the labels which contain this fragment,
  • name : the name of the fragment,
  • weight : the product of the fragment length and the fragment occurrence

Example Result

If we have the array ['java', 'javascript','pythonscript'], using the default options, we will get result array:

  [
    {name : 'java', source : [0,1], weight : 8},
    {name : 'script', source : [1,2], weight : 10}
  ]

The default options are:

  • minLength : 3
  • minOccurrence : 2

Result is fetched from leaf to node of the trie, so it is not sorted, but it will be quite easy with lodash sortBy function , for example:

    const resultSortByWeight = _.sortBy(result, ['weight']);
    const resultSortByLength = _.sortBy(result, substring => substring.name.length);

Algorithm

Explanation here

Implementation in Other Languages

License

The algorithm code is under The MIT License

About

$ npm install common-substrings

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • JavaScript 62.3%
  • TypeScript 37.7%
0