Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

The data set infmort from the faraway package contains information on the mortal

ID: 3309373 • Letter: T

Question

The data set infmort from the faraway package contains information on the mortality of infants for 105 nations. The variable mortality gives the number of deaths per 1000 live births, while income is the per capita income in US dollars and region indicates the geographic area of the nation. Consider the model E(log(mortality)|income, region) = 0 + 1 log(income) + 2region + 12region log(income). (b) Explain the practical meaning of the hypothesis H0 : 12 = 2 = 0 in the context of the above model. (c) Perform a test for the hypothesis in part b) and summarize your results.

Explanation / Answer

(b)

The practical meaning of the hypothesis H0 : 12 = 2 = 0 is that for given income and region, only log(income) is significant in determining log(mortality). In other words, log(mortality) is independent of region and interaction between regio and log(income) for given income and region.

(c)

The unrestricted model is

E(log(mortality)|income, region) = 0 + 1 log(income) + 2region + 12region log(income)

R Code to run the above model is -

model.ur = lm(log(mortality) ~ log(income) + region + region*log(income), data = infmort)

The restricted model is

E(log(mortality)|income, region) = 0 + 1 log(income)

R Code to run the above model is -

model.r = lm(log(mortality) ~ log(income), data = infmort)

R-Squared for both the models are

> summary(model.r)$r.squared
[1] 0.5021236
> summary(model.ur)$r.squared
[1] 0.6464492

F -Statisitic is given as,

F = [(R2UR - R2R)/q] / [(1-R2UR) / (n-k)]

where R2UR and R2R are R-squared values for unrestricted and restricted model.

q is number of restrivtions. Here q = 2

n is number of observations. n = 105

k is number of coefficients (including intercept) in the unrestricted model . Here k = 4

F = [(0.6464492 - 0.5021236)/2] / [(1-0.6464492) / (105-4)] = 20.61498

Degree of freedom = q , n-k = 2, 101

Critical value of F for significance level of 0.05 and df = 2, 101 is 3.086

As, observed value of F is greater than the criticacl value, we reject the null hypothesis.

(c)

As, we reject the null hypothesis, we conclude that at 5% significance level, 12 and 2 are not 0 and region and interaction between region and log(income) are significant variable in determining the (log(mortality) for given income and region.

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote