in Uncategorized

Cosine Similarity Search for new documents using Scikit-Learn

Some time ago I was working in a project of similarity search (i.e. bring similar items based on text) using Scikit-Learn and one topic that it’s not covered in the documentation is: What if I got new data? How can I calculate the similarity of this new text?

Some lovely souls of Stack Overlow gave me the tip and I’m sharing it:

Write a Comment


This site uses Akismet to reduce spam. Learn how your comment data is processed.