Releases: goose3/goose3
Releases · goose3/goose3
Version 3.1.19
- Clean
small
tags by removing them; see issue #190 - Update stopword files with additional languages; see issue #194
Version 3.1.18
Version 3.1.17
- Fix issue with multiple site names in the metadata; see issue #179
- Fix key error in author extraction; PR #177 and PR #183; Thanks anchitshrivastavaand erikvullings
Version 3.1.16
- Fix issue with author parsing; see issue #175
Version 3.1.15
- Add
article.schema
support for@graph
data; PR #168; Thanks julianofnascimento - Retrieve
article.authors
from linked data; see issue #170 - Fix
article.title
extraction whenopengraph
returns a list instead of a single string; see issue #172
Version 3.1.14
- Fix
article.schema
failing when returns a list instead of a dictionary; PR #160; Thanks @catdingding - Add
article.top_node_raw_html
which keeps the base html of the top node; see issue #112 and PR #162; Thanks @markowanga - Handle opengraph arrays if multiple tags are present; see issue #91
Version 3.1.13
- Deprecation Notice:
camelCase
functions have been deprecated; see issue #151
- Fix for building Goose3 into an executable; see issue #144
- Fix
schema.org
check; see PR #154; Thanks @mromanelli9 - Fix
TypeError
issues in publisher and title extractions; see PR #157 and PR #156; Thanks @Amaimersion - Fix
Configuration.strict
not always being enforced; see issue #158 - Update build system to be PEP 517 compliant
- Code Standardization
Version 3.1.12
- Fix for Korean stop words see PR #138; Thanks galaxytemple
- Allow for extra dependencies see issue #141
- Fix leading and trailing charset characters see issue #139; Thanks @nnick14
- Added basic logging and typing
Version 3.1.11
- Replace
md5
with a pure pythonfnv_1a
non-cryptographic hash see issue #133; Thanks @openbrian
Version 3.1.10
- Fix for float based timezones see issue #128 Thanks @Vasniktel!
- Add
langdetect
dependency to help resolve some edge cases when missing language information causes text to not be pulled. see issue #106