8000 GitHub - lulurun/web-data-miner: Nodejs library automatically extract data from html
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

lulurun/web-data-miner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The web-data-minier is aiming to build a general web scraper.

It can detect if a web page contains listing data (like google search result) or a single data (like a wikipedia article) and exrtact data automatically.

It is written in pure javascript with no dependencies, and the algorithm can be easily implemented in any language.

To be continued ...

About

Nodejs library automatically extract data from html

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0