Skip to content
@scrapinghub

Scrapinghub

Turn web content into useful data

Pinned Loading

  1. splash splash Public

    Lightweight, scriptable browser as a service with an HTTP API

    Python 4.2k 516

  2. dateparser dateparser Public

    python parser for human readable dates

    Python 2.7k 481

  3. python-scrapinghub python-scrapinghub Public

    A client interface for Scrapinghub's API

    Python 208 61

  4. extruct extruct Public

    Extract embedded metadata from HTML markup

    Python 922 118

  5. spidermon spidermon Public

    Scrapy Extension for monitoring spiders execution.

    Python 544 101

  6. python-crfsuite python-crfsuite Public

    A python binding for crfsuite

    Python 774 222

Repositories

Showing 10 of 183 repositories
  • dateparser Public

    python parser for human readable dates

    scrapinghub/dateparser’s past year of commit activity
    Python 2,686 BSD-3-Clause 481 297 (6 issues need help) 53 Updated Jun 26, 2025
  • scrapinghub-stack-scrapy Public

    Software stack with latest Scrapy and updated deps

    scrapinghub/scrapinghub-stack-scrapy’s past year of commit activity
    Dockerfile 63 BSD-3-Clause 20 2 2 Updated Jun 20, 2025
  • shub-workflow Public
    scrapinghub/shub-workflow’s past year of commit activity
    Python 15 BSD-3-Clause 15 2 2 Updated Jun 13, 2025
  • scrapy-frontera Public

    More flexible and featured Frontera scheduler for Scrapy

    scrapinghub/scrapy-frontera’s past year of commit activity
    Python 37 BSD-3-Clause 5 2 1 Updated Jun 6, 2025
  • hcf-backend Public

    Crawl Frontier HCF backend

    scrapinghub/hcf-backend’s past year of commit activity
    Python 8 BSD-3-Clause 5 2 1 Updated Jun 6, 2025
  • frontera Public

    A scalable frontier for web crawlers

    scrapinghub/frontera’s past year of commit activity
    Python 1,312 BSD-3-Clause 217 78 (8 issues need help) 17 Updated Jun 6, 2025
  • web-poet Public

    Web scraping Page Objects core library

    scrapinghub/web-poet’s past year of commit activity
    Python 102 BSD-3-Clause 15 15 (1 issue needs help) 13 Updated Jun 6, 2025
  • scrapinghub-entrypoint-scrapy Public

    Scrapy entrypoint for Scrapinghub job runner

    scrapinghub/scrapinghub-entrypoint-scrapy’s past year of commit activity
    Python 26 BSD-3-Clause 16 7 1 Updated Jun 5, 2025
  • scrapy-poet Public

    Page Object pattern for Scrapy

    scrapinghub/scrapy-poet’s past year of commit activity
    Python 123 BSD-3-Clause 28 11 (1 issue needs help) 4 Updated May 26, 2025
  • andi Public

    Library for annotation-based dependency injection

    scrapinghub/andi’s past year of commit activity
    Python 22 BSD-3-Clause 6 4 1 Updated May 14, 2025