__init__(fname) |
|
docbyoffset(offset) |
Return document at file offset offset (in bytes) |
load(fname[, mmap]) |
Load a previously saved object from file (also see save). |
save(*args, **kwargs) |
|
save_corpus(fname, corpus[, id2word, ...]) |
Save a corpus in the Matrix Market format to disk. |
serialize(serializer, fname, corpus[, ...]) |
Iterate through the document stream corpus, saving the documents to fname and recording byte offset of each document. |
skip_headers(input_file) |
Skip file headers that appear before the first document. |