8000 Releases · goose3/goose3 · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Releases: goose3/goose3

Version 3.0.9

12 Jan 01:20
82c242d
Compare
Choose a tag to compare
  • Add soup as a parser option to use lxml.html.soupparser #27
  • Fix an issue with passing the requests session object to the crawler
  • Documentation on readthedocs (https://readthedocs.org/projects/goose3/)
  • Code cleanup

v3.0.8

09 Dec 18:59
2bbea91
Compare
Choose a tag to compare

Version 3.0.8

  • Fix using different requests session for each url fetched
    • Added close method to the Goose object
  • Allow the Goose object to be a context manager
from goose3 import Goose
with Goose() as g:
    g.extract(url='some-url-here')

NOTE: No need to change code as it will attempt to automatically close
the connection on garbage collection

  • Configuration object changes
    • Better handling of the known_context_patterns configuration
    • Added http_headers configuration option to be passed to requests
    • Added http_proxies configuration option to be passed to requests
    • Added http_auth configuration option to be passed to requests
  • Fix base64 image parsing #7

v3.0.7

23 Nov 03:35
980e2a1
Compare
Choose a tag to compare
  • Fix installation issue
    • Removed unused/broken regex
    • Include all necessary files
    • Fix failed tests (most)
  • Resolved relative URL issue #21
  • Resolved temporary files not being properly removed #18
  • Removed unused dependencies and code to support python 2 #16
  • Fix error when using the configuration object to configure goose #14

v3.0.3

28 Mar 19:55
Compare
Choose a tag to compare
update version

v3.0.1

24 Mar 16:31
Compare
Choose a tag to compare

This is the first working version of Goose on python3

0