-
Scrapinghub
- Vigo, Spain
- https://gitlab.com/gallaecio
-
scrapyrt Public
Forked from scrapinghub/scrapyrtHTTP API for Scrapy spiders
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 26, 2025 -
zyte-common-items Public
Forked from zytedata/zyte-common-itemsContains the common item definitions used in Zyte.
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 26, 2025 -
dateparser Public
Forked from scrapinghub/dateparserpython parser for human readable dates
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 26, 2025 -
vscode Public
Forked from microsoft/vscodeVisual Studio Code
TypeScript MIT License UpdatedJun 23, 2025 -
flake8-scrapy Public
Forked from scrapy/flake8-scrapyA Flake8 plugin to catch common issues on Scrapy spiders
Python MIT License UpdatedJun 18, 2025 -
scrapy Public
Forked from scrapy/scrapyScrapy, a fast high-level web crawling & scraping framework for Python.
-
ruff Public
Forked from astral-sh/ruffAn extremely fast Python linter and code formatter, written in Rust.
Rust MIT License UpdatedJun 6, 2025 -
scrapy-frontera Public
Forked from scrapinghub/scrapy-fronteraMore flexible and featured Frontera scheduler for Scrapy
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 6, 2025 -
scrapy-zyte-api Public
Forked from scrapy-plugins/scrapy-zyte-apiZyte Data API integration for Scrapy
Python UpdatedFeb 18, 2025 -
zyte-spider-templates Public
Forked from zytedata/zyte-spider-templatesSpider templates for automatic crawlers.
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 18, 2025 -
python-zyte-api Public
Forked from zytedata/python-zyte-apiPython client for Zyte Data API
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 16, 2025 -
andi Public
Forked from scrapinghub/andiLibrary for annotation-based dependency injection
10000 Python BSD 3-Clause "New" or "Revised" License UpdatedJan 30, 2025 -
paramiko-sftp-example Public
Paramiko-based minimal SFTP client and server examples
Python UpdatedJan 29, 2025 -
frontera Public
Forked from scrapinghub/fronteraA scalable frontier for web crawlers
-
scrapy-splash Public
Forked from scrapy-plugins/scrapy-splashScrapy+Splash for JavaScript integration
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 27, 2025 -
scrapy-poet Public
Forked from scrapinghub/scrapy-poetPage Object pattern for Scrapy
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 23, 2025 -
web-poet Public
Forked from scrapinghub/web-poetWeb scraping Page Objects core library
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 22, 2025 -
itemadapter Public
Forked from scrapy/itemadapterCommon interface for data container classes
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 8, 2025 -
zyte-spider-templates-project Public
Forked from zytedata/zyte-spider-templates-project -
duplicate-url-discarder Public
Forked from zytedata/duplicate-url-discarderPython MIT License UpdatedDec 30, 2024 -
scrapy-crawlera Public
Forked from scrapy-plugins/scrapy-zyte-smartproxyCrawlera middleware for Scrapy
Python UpdatedDec 30, 2024 -
form2request Public
Forked from scrapy/form2requestAI-powered Python 3.8+ library to build HTTP requests out of HTML forms.
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 24, 2024 -
web-scraping-tutorial-project Public
Forked from zytedata/web-scraping-tutorial-projecthttps://docs.zyte.com/web-scraping/tutorial/index.html
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 20, 2024 -
python-scrapinghub Public
Forked from scrapinghub/python-scrapinghubA client interface for Scrapinghub's API
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 16, 2024 -
scrapinghub-entrypoint-scrapy Public
Forked from scrapinghub/scrapinghub-entrypoint-scrapyScrapy entrypoint for Scrapinghub job runner
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 27, 2024 -
protego Public
Forked from scrapy/protegoA pure-Python robots.txt parser with support for modern conventions.
DIGITAL Command Language BSD 3-Clause "New" or "Revised" License UpdatedNov 15, 2024 -
extruct Public
Forked from scrapinghub/extructExtract embedded metadata from HTML markup
-
Formasaurus Public
Forked from TeamHG-Memex/FormasaurusFormasaurus tells you the type of an HTML form and its fields using machine learning
HTML UpdatedNov 7, 2024 -
-
extract-summit-contest-solutions Public
Forked from zytedata/extract-summit-contest-solutionsExample solutions for the practice and contest websites of the code contest of Web Data Extraction Summit.
Python MIT License UpdatedOct 18, 2024