题目
题目

Business Analysis with Unstructured Data - DAT-7471 - BMBAN2 In-class knowledge check #2 (Remotely Proctored)

多项选择题

When analyzing articles, the tf-idf-tf_idf framework is used to:

选项
A.identify tokens or terms that are most frequent to each article
B.identify tokens or terms that are most important/specific to each article
C.identify tokens or terms that are both, most frequent and most important/specific to each article
查看解析

查看解析

标准答案
Please login to view
思路分析
To analyze how tf-idf-tf_idf works, we need to consider what the metric is designed to do across a collection of documents. Option 1: 'identify tokens or terms that are most frequent to each article' – While term frequency within a document can be high for some words, tf-idf specifically downplays terms that are merely frequent across many documents and emphasizes terms that are distinctive for that documen......Login to view full explanation

登录即可查看完整答案

我们收录了全球超50000道考试原题与详细解析,现在登录,立即获得答案。

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!