8000 GitHub - alexLCL/py_baidu: 用python开发的爬虫,抓取百度百科的数据,练手用的
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

用python开发的爬虫,抓取百度百科的数据,练手用的

Notifications You must be signed in to change notification settings

alexLCL/py_baidu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

py_baidu

百度百科的url如果有变化的话,请记得修改root_url的值 和 html_parser 里的 _get_new_urls的正则 为了节省时间,这个示例我只爬了5条数据,如果想测试更多条数据,请自行修改spider_main.py文件里的count值

About

用python开发的爬虫,抓取百度百科的数据,练手用的

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0