-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
Closed
Labels
difficulty mediumMedium issue: required good gensim understanding & python skillsMedium issue: required good gensim understanding & python skillsfeatureIssue described a new featureIssue described a new featurewishlistFeature requestFeature request
Description
https://en.wikipedia.org/wiki/SMART_Information_Retrieval_System
The current TFIDF model uses natural TF and IDF for computing TFIDF. The idea is to try various transformation like logarithmic, augmented,boolean etc. before computing the vectors.
More about this - http://www.cs.odu.edu/~jbollen/IR04/readings/article1-29-03.pdf and https://nlp.stanford.edu/IR-book/pdf/06vect.pdf
Will send a PR tomorrow.
Metadata
Metadata
Assignees
Labels
difficulty mediumMedium issue: required good gensim understanding & python skillsMedium issue: required good gensim understanding & python skillsfeatureIssue described a new featureIssue described a new featurewishlistFeature requestFeature request