The naive Bayer classifier is trained over 1600 movie reviews and then tested over 400 reviews.
Here is the resulting confusion matrix:
190 (TP) 10(FN)
80 (FP) 120(TN)
What are the precision, recall, and the F1-score values?
Which Hadoop Files System shell command copies data from a local file system into HDFS?
You are analyzing written transcripts of focus groups conducted on product X. You approach is to use TF-IDF for your analysis.
What combination of TF-IDF scores should you examine to ensure you only report on the most important terms?
A hotel chain runs a simul-ation on room pricing. They want to estimate revenue, per hotel, within +/- $10 with 95% confidence (Za/2=1.96). The estimated revenue standard deviation is $5000 based on previous booking data.
What is the optimal number of simulation trials to run?