Notice the denominator is actually the overall number of terms in document d (counting each event of the same term separately). You'll find many other ways to determine term frequency:[five]: 128 An idf is consistent for each corpus, and accounts with the ratio of documents that include the word "this". With this case, Now we have a corpus of t