gensim.models.HdpModel
¶
-
class
gensim.models.
HdpModel
(corpus, id2word, max_chunks=None, max_time=None, chunksize=256, kappa=1.0, tau=64.0, K=15, T=150, alpha=1, gamma=1, eta=0.01, scale=1.0, var_converge=0.0001, outputdir=None)[source]¶ The constructor estimates Hierachical Dirichlet Process model parameters based on a training corpus:
>>> hdp = HdpModel(corpus, id2word) >>> hdp.print_topics(show_topics=20, num_words=10)
Inference on new documents is based on the approximately LDA-equivalent topics.
Model persistency is achieved through its load/save methods.
Methods¶
__init__ (corpus, id2word[, max_chunks, ...]) |
gamma: first level concentration |
doc_e_step (doc, ss, Elogsticks_1st, ...) |
e step for a single doc |
evaluate_test_corpus (corpus) |
|
hdp_to_lda () |
Compute the LDA almost equivalent HDP. |
inference (chunk) |
|
load (fname[, mmap]) |
Load a previously saved object from file (also see save). |
optimal_ordering () |
ordering the topics |
print_topics ([num_topics, num_words]) |
Alias for show_topics() that prints the num_words most probable words for topics number of topics to log. |
save (fname_or_handle[, separately, ...]) |
Save the object to file (also see load). |
save_options () |
legacy method; use self.save() instead |
save_topics ([doc_count]) |
legacy method; use self.save() instead |
show_topics ([num_topics, num_words, log, ...]) |
Print the num_words most probable words for topics number of topics. |
update (corpus) |
|
update_chunk (chunk[, update, opt_o]) |
|
update_expectations () |
Since we’re doing lazy updates on lambda, at any given moment the current state of lambda may not be accurate. |
update_finished (start_time, ...) |
|
update_lambda (sstats, word_list, opt_o) |