Handling Dirty Data. Provide your solution for this situation: You are the Regis
ID: 3260446 • Letter: H
Question
Handling Dirty Data. Provide your solution for this situation: You are the Registrar for a university in Ohio. Your university began in 1900 and has operated continuously and is operating today. The state regulation body asks you for a report. One of the pieces of information asked is the average grade for students from the beginning of the college (1900) to today. A fire destroyed the student grades for years 1942 to 1944. A flood help destroy grades for January 1949 to June 1949. You must provide a report with the average grade for students since the beginning of the college. How do you handle this dirty data situation, considering the things that have been covered in this course (and of course this chapter)?
Explanation / Answer
Data Lost- 1942-1944 and jan,1949-june,1949
so we have to find mean of grades from 1900-2017
for thet we need to estimate the rades of 1942,1943,1944
Now we fit a time series data to grades of 1942,1943,1944 using the grades of 1900-1942 and estimate the grades of those three years
Now what would be tha model? one can go for auto regressive model of suitable order.the plot of the data will give an idea about that.
Again for jan 1949 to june 1949 we fit a time series model with the help of the grades of jan-june of the years 1900-1948
to get the seasonality factor we are doing here like this.now with the help of this we estimate grades of six months.
then we get all the grades and find the average of the grades from 1900-2017
Thanks
Related Questions
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.