8000 GitHub - jesalnmistry/Extract_Links_1: python code to extract links from a webpage using SGMLParser, urllib
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

jesalnmistry/Extract_Links_1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

used sgmllib - simple SGML(Standard Generalized Mark-up Language) Parser

input link used as an example is "http://in.yahoo.com/?p=us"

links pertaining to all hrefs is saved in the file "url.txt"

also the links are sorted

Commands to run->

python UseURLLister.py

output is file "url.txt"

About

python code to extract links from a webpage using SGMLParser, urllib

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0