Skip to content

Releases: hhursev/recipe-scrapers

15.2.1

04 Oct 14:30
Compare
Choose a tag to compare

This is essentially a re-release of 15.2.0 -- but that previous version was mistakenly uploaded from a locally-created git tag prepared before some changes from this origin repository were pulled (and hence ended up off-branch after the changes were added).

15.2.0

04 Oct 14:15
Compare
Choose a tag to compare

Adds support for:

  • donalskehan.com (#1237)
  • eatthismuch.com (#1237)
  • glutenfreeonashoestring.com (#1220)
  • miljuschka.nl (#1254)
  • theglutenfreeaustrian.com (#1220)
  • theloopywhisk.com (#1220)

Bugfixes:

  • Fix ingredient grouping retrieval for arla (#1236)
  • Remove workaround from kitchenstories (#1240)
  • Update/fix scraper for gesund+aktiv (#1238)
  • Update/fix scraper for realsimple (#1242)
  • Update/fix scraper for sallysblog (#1243)
  • Update/fix scraper for tofoo (#1244)
  • Update/fix scraper for wikicookbook (#1239)

Misc:

  • Drops support for Python 3.8 (#1265)
  • Modernize some type hints given that Python 3.9 has become baseline (#1267)
  • Adds an enforced order for nutritional expectations in JSON test data (#1203)
  • Refactor the generate.py scraper-generator script (#1232)
  • Raise a newly-added RecipeSchemaNotFound exception when the schema.org-fill plugin fails to find metadata on the page (#1230)
  • Require only ingredients_list, not also ingredients, in the test JSON data (#1211)
  • Cleanups: remove redundant methods and test expectation data (#1197, #1228, #1255)

Thank you, @jaspervzwi @jknndy! πŸŽ‰

14.60.0

04 Oct 14:22
Compare
Choose a tag to compare

Adds support for:

  • donalskehan.com (#1237)
  • eatthismuch.com (#1237)
  • glutenfreeonashoestring.com (#1220)
  • miljuschka.nl (#1254)
  • theglutenfreeaustrian.com (#1220)
  • theloopywhisk.com (#1220)

Bugfixes:

  • Fix ingredient grouping retrieval for arla (#1236)
  • Remove workaround from kitchenstories (#1240)
  • Update/fix scraper for gesund+aktiv (#1238)
  • Update/fix scraper for realsimple (#1242)
  • Update/fix scraper for sallysblog (#1243)
  • Update/fix scraper for tofoo (#1244)
  • Update/fix scraper for wikicookbook (#1239)

Misc:

  • Drops support for Python 3.8 (#1265)
  • Modernize some type hints given that Python 3.9 has become baseline (#1267)
  • Adds an enforced order for nutritional expectations in JSON test data (#1203)
  • Refactor the generate.py scraper-generator script (#1232)
  • Raise a newly-added RecipeSchemaNotFound exception when the schema.org-fill plugin fails to find metadata on the page (#1230)
  • Require only ingredients_list, not also ingredients, in the test JSON data (#1211)
  • Cleanups: remove redundant methods and test expectation data (#1197, #1228, #1255)

Thank you, @jaspervzwi @jknndy! πŸŽ‰

15.1.0

26 Aug 15:36
Compare
Choose a tag to compare

Adds support for:

  • books.ottolenghi.co.uk (#1227)
  • celebratingsweets.com (#1196)
  • ingoodflavor.com (#1196)
  • kalejunkie.com (#1204)
  • kitchendreaming.com (#1196)
  • leitesculinaria.com (#1196)
  • myvegetarianroots.com (#1196)
  • peelwithzeal.com (#1196)
  • sandwichtribunal.com (#1196)

Improvements:

  • streetkitchen.hu: additional field retrievals (#1183)
  • whatsgabycooking.com: ingredient group support (#1210)

Bugfixes:

  • streetkitchen.hu: field retrieval fixups (#1183)

Thank you, @bartaldavid @jknndy @tomtjes!

15.0.1

26 Aug 15:01
Compare
Choose a tag to compare

Bugfixes:

  • mob.co.uk: rating-rounding and site name retrieval (#1198)
  • reishunger.de: HTML and scraping refresh (#1202)
  • weightwatchers.*: dynamic category retrieval, language correction (#1213, #1216)
  • abstract schema.org: fixup for retrieval of graph lists (#1199)

Misc:

  • Documentation fixups (#1208, b798870)
  • Allow package description retrieval from metadata on Py3.9 (#1218)
  • A clearer, more transparent identifying user-agent has been added (#1221)
  • Enable continuous integration on the v14 branch (#1222)

Thank you, @jknndy @nmummau! πŸŽ‰

14.59.0

26 Aug 15:35
Compare
Choose a tag to compare

Adds support for:

  • books.ottolenghi.co.uk (#1227)
  • celebratingsweets.com (#1196)
  • ingoodflavor.com (#1196)
  • kalejunkie.com (#1204)
  • kitchendreaming.com (#1196)
  • leitesculinaria.com (#1196)
  • myvegetarianroots.com (#1196)
  • peelwithzeal.com (#1196)
  • sandwichtribunal.com (#1196)

Improvements:

  • streetkitchen.hu: additional field retrievals (#1183)
  • whatsgabycooking.com: ingredient group support (#1210)

Bugfixes:

  • streetkitchen.hu: field retrieval fixups (#1183)

Thank you, @bartaldavid @jknndy @tomtjes!

14.58.3

26 Aug 15:06
Compare
Choose a tag to compare

Bugfixes:

  • mob.co.uk: rating-rounding and site name retrieval (#1198)
  • reishunger.de: HTML and scraping refresh (#1202)
  • weightwatchers.*: dynamic category retrieval, language correction (#1213, #1216)
  • abstract schema.org: fixup for retrieval of graph lists (#1199)

Misc:

  • Documentation fixups (#1208, b798870)
  • Allow package description retrieval from metadata on Py3.9 (#1218)
  • A clearer, more transparent identifying user-agent has been added (#1221)
  • Enable continuous integration on the v14 branch (#1222)

Thank you, @jknndy @nmummau! πŸŽ‰

15.0.0

28 Jul 22:59
Compare
Choose a tag to compare

πŸ— 🍚 πŸ₯˜ 15.0.0 🍏 πŸ₯• πŸ…

Welcome to version 15.0.0 of recipe-scrapers, a simple scraping tool for recipe webpages.

This version is a major version upgrade, and contains some breaking changes from the 14.x series.

In particular:

  • The scrape_me function to the library has been removed from the 15.x series - please migrate to use the recommended scrape_html function instead.
    • This includes removal of the generic options arguments that were previously passed to requests. Please configure the options for your HTTP client (requests, httpx, urllib.request or other) directly.
    • If you need more time to migrate: don't worry, the 14.x series will continue to be maintained and updated until we're comfortable that the upgraded version is in regular use.

To get started and for usage examples, see the README file.

Features

  • Support for America's Test Kitchen, Cook's Country, and Cook's Illustrated have been added (#1060) - thank you, @smilerz!

Scrapers removed

Unfortunately the updated scraper interface is more restrictive, by-design, than 14.x - each scraper accepts an individual HTML page as input, and should not directly make additional HTTP requests. During adaptation to this interface, we dropped support for the following scrapers:

  • coop.se
  • gousto.co.uk
  • kptncook.com
  • madewithlau.com
  • marleyspoon.com
  • monsieur-cuisine.com
  • woolworths.com.au

This makes scrapers simpler, and that means they're easier to review and support. But it is a trade-off against the coverage of recipe sites that we can achieve.

Dependencies removed

  • requests is no longer a hard dependency of the library, but is available as an optional dependency.
  • responses is no longer required to run unit tests for the library.

Edit: add missing (and important!) scrapers-removed section.
Edit: section-heading nesting consistency.

14.58.2

28 Jul 22:18
Compare
Choose a tag to compare

This release only affects the unit test suite, and as a result is functionally identical to the 14.58.1 release. This release should be the nearest to the upcoming 15.0.0 release temporally, and relatively similar in terms of feature support.

Misc:

  • Unit tests: consider 'ingredient_groups' field optional instead of mandatory (#1184).

Normally I'd thank our contributors in this release-note footer - but unusually, the only change in this release was one of mine, but I can thank @jknndy for code reviewing πŸŽ‰

14.58.1

26 Jul 13:49
Compare
Choose a tag to compare

Bugfixes:

  • The nosalty.hu scraper has been added to the SCRAPERS supported-hosts dictionary (#1188).
  • Misleading deprecation notices about options had been emitted when using the wild_mode keyword argument; these no longer occur (#1191).
  • wild_mode was incorrectly enabled when scrape_html was called on recipe URLs for unsupported sites (#1186).

Misc:

  • Consistent ordering of JSON keys in test data, to support code review (#1151).
  • A deprecation notice for the scrape_me function has been added (#1192).
  • Updates to the README based on user support feedback (#1182).

WIth thanks to @jknndy for the consistent JSON key ordering improvements! πŸŽ‰