Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

There are 3 tables of exclaim_mess.recode -the first one recoded in 5 values, se

ID: 3301683 • Letter: T

Question

There are 3 tables of exclaim_mess.recode

-the first one recoded in 5 values, see table below:

-
-the second table recoded into 10 values, see below:


-the third table recoded into 3 values, see below

   [0,1) [1,2) [2,Inf)
0 1219   650    1685
1   216    83      68
Row: 0=non-spam, 1=spam

column: exclaim_mess.recode

question: how would your summary on the relation between spam and exclaim_mess.recode change if you recoded it into 5 values? 10 values? 3 values? which regroup is most reasonable, and why?

[0,1) [1,2) [2,3) [3,4) [4,Inf) 0 1219 650 482 116 1087 1 216 83 25 12 31

Explanation / Answer

Here we have given frequency distribution of two variables coded as, 0 and 1

Our main aim to give reasons what table is good and why,

We know frequency distribution construction should be like that each cells contain more or less greater number (not too less values, like 0, 2,3)

In this case most appropriate is table 1 which has 5 cells which is each cell sufficient frequency.