mirror of
https://github.com/thewesker/twitter_ebooks.git
synced 2025-12-20 04:11:08 -05:00
256 B
Executable File
256 B
Executable File
- Files in text/ are preprocessed by
rake consumeand serialized - e.g. text/foo.tweets becomes consumed/foo.corpus
rake consumelooks at hashes to know which it needs to update- Preprocessed corpus files are loaded at runtime by Corpus.load('foo')