TF-IDF统计方法
TF-IDF(term frequency–inverse document frequency)是一种用于信息检索与数据挖掘的常用加权技术。
- Term frequency—The more times the words you’re looking for appear in a document,the higher the score.
- Inverse document frequency—The weight of each word is higher if the word is uncommon across other documents.