🚨 Hotfix release for CCNewsCrawler
🚨
With the newly added xpath_search
in version 0.4.5
some parsers generated unpickable extractions, crashing the CCNewsCrawler
when piping back to the main thread and thus rendering the crawler unusable. This issue is now fixed with #655
Updated parsers
- Fix
paragraph
andsubheadline
selectors forMDR
by @MaxDall in #648 - Fix
BoersenZeitung
by @addie9800 in #647 - Fix
Merkur
by @addie9800 in #654 - Fix
Frankfurter Rundschau
by @addie9800 in #652 - Update
Stern
parser by @MaxDall in #658 - Add RSSFeed to
LeFigaro
by @addie9800 in #657
Bug fixes
- Fix a bug with attribute defaults and add
default_factory
parameter by @MaxDall in #649 - Fix pickling problem in
LinkedDataMapping
by @addie9800 in #655
QoL
- Add additional space characters to
normalize_whitespace
by @MaxDall in #646 - Improve encoding detection by @MaxDall in #650
Full Changelog: v0.4.5...v0.4.6