10000 GitHub - Gaglia88/ruler: Scalable record-level matching rules
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Gaglia88/ruler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RulER

RulER is a tool for Apache Spark that uses a novel technique that allows to find similar records by applying complex joining rules on one or more attributes.


If use this library, please cite:

  • Gagliardelli, L., Simonini, G., & Bergamaschi, S. (2020). RulER: Scaling Up Record-level Matching Rules. In EDBT 2020: 23nd International Conference on Extending Database Technology.

A brief presentation about RulER is available by clicking on the image below

Contacts

For any questions about RulER write us at name.surname@unimore.it

  • Luca Gagliardelli
  • Giovanni Simonini

About

Scalable record-level matching rules

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0