simserver.SimServer.find_similar¶
-
SimServer.
find_similar
(doc, min_score=0.0, max_results=100)[source]¶ Find at most max_results most similar articles in the index, each having similarity score of at least min_score.
doc is either a string (document id, previously indexed) or a dict containing a ‘tokens’ key. These tokens are processed to produce a vector, which is then used as a query.
The similar documents are returned in decreasing similarity order, as (doc_id, doc_score) pairs.