gensim.models.LsiModel.add_documents¶

LsiModel.add_documents(corpus, chunksize=None, decay=None)[source]¶

Update singular value decomposition to take into account a new corpus of documents.

Training proceeds in chunks of chunksize documents at a time. The size of chunksize is a tradeoff between increased speed (bigger chunksize) vs. lower memory footprint (smaller chunksize). If the distributed mode is on, each chunk is sent to a different worker/computer.

Setting decay < 1.0 causes re-orientation towards new data trends in the input document stream, by giving less emphasis to old observations. This allows LSA to gradually “forget” old observations (documents) and give more preference to new ones.