Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

Question1: In the references bellow two research papers for data quality in big

ID: 3849592 • Letter: Q

Question

Question1: In the references bellow two research papers for data quality in big data are presented. Write a text of a few paragraphs to report on a comparison between the two papers:

1- Big Data Preprocessing: A Quality Framework.

2- Evaluation the Quality of Social Media Data in Big Data Architucture.

You need to clearly identify the criteria you use for this comparison. You need also to identify the advantages and limitations, if any of each paper. Discuss how can you use such frameworks in Data and Information Quality Management.

References Question1:

[1] TALEB, Ikbal, DSSOULI, Rachida, et SERHANI, Mohamed Adel. Big data pre-processing: A quality framework. In : Big Data (BigData Congress), 2015 IEEE International Congress on. IEEE, 2015. p. 191-198.

[2] IMMONEN, Anne, PÄÄKKÖNEN, Pekka, et OVASKA, Eila. Evaluating the quality of social media data in big data architecture. IEEE Access, 2015, vol. 3, p. 2028-2043.

Explanation / Answer

Evaluating the Quality of Social Media Data in Big Data Architecture has implemented to adopt metadata and metadata standard  the metadata have standards this will help in the quality

Navigational metadata: it will consist of keywords and tags to the keywords attached

Process metadata :the orgin of the data base could be found

Descriptive metadata: this gives metadata descriptions where technical metadata gives technical information how to process the information

Quality metadata: the time and accuracy will be the key important of the value

Administrative metadata: this data gives the authority of the data which provider can give the access of the data how we can accept the data can used for processsing

advantages

they study each data detailed and classfiy them

they have huge type of parammeters for classfication

more errors could be detected in this process

limitations

data given is not quality most of them fail to meet the standards

data is not given from verified data provided

Big data pre-processing: A quality framework. In : Big Data has be implemented through Data Quality Dimensions  they are of two types intrinsic and contextual, they both depend on time and accuracy

they follow Data quality Profile optimization where data with high qualityactivities is selected for example  EEG file are collected from the reputed hospitals only to have quality and accurate data

advantages

they only take data from legit users

less accuracy failure

data provider is used to give high quality data

limitation

no details manner classfication

less no of attributed are used for classfication

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote