mirror of
https://github.com/thewesker/twitter_ebooks.git
synced 2025-12-21 21:01:16 -05:00
Ran into `Encoding::CompatibilityError` issue trying to consume my corpus (tweets.csv) on Windows 7, but this likely affects other environments as well. Fix: force reading corpus file contents as utf-8. Also a quick clean-up of the CSV flow to only parse the content once instead of double-dipping.