simserver.SimServer.find_similar

SimServer.find_similar(doc, min_score=0.0, max_results=100)[source]

Find at most max_results most similar articles in the index, each having similarity score of at least min_score.

doc is either a string (document id, previously indexed) or a dict containing a ‘tokens’ key. These tokens are processed to produce a vector, which is then used as a query.

The similar documents are returned in decreasing similarity order, as (doc_id, doc_score) pairs.