Question1: In the references bellow two research papers for data quality in big
ID: 3849592 • Letter: Q
Question
Question1: In the references bellow two research papers for data quality in big data are presented. Write a text of a few paragraphs to report on a comparison between the two papers:
1- Big Data Preprocessing: A Quality Framework.
2- Evaluation the Quality of Social Media Data in Big Data Architucture.
You need to clearly identify the criteria you use for this comparison. You need also to identify the advantages and limitations, if any of each paper. Discuss how can you use such frameworks in Data and Information Quality Management.
References Question1:
[1] TALEB, Ikbal, DSSOULI, Rachida, et SERHANI, Mohamed Adel. Big data pre-processing: A quality framework. In : Big Data (BigData Congress), 2015 IEEE International Congress on. IEEE, 2015. p. 191-198.
[2] IMMONEN, Anne, PÄÄKKÖNEN, Pekka, et OVASKA, Eila. Evaluating the quality of social media data in big data architecture. IEEE Access, 2015, vol. 3, p. 2028-2043.
Explanation / Answer
Evaluating the Quality of Social Media Data in Big Data Architecture has implemented to adopt metadata and metadata standard the metadata have standards this will help in the quality
Navigational metadata: it will consist of keywords and tags to the keywords attached
Process metadata :the orgin of the data base could be found
Descriptive metadata: this gives metadata descriptions where technical metadata gives technical information how to process the information
Quality metadata: the time and accuracy will be the key important of the value
Administrative metadata: this data gives the authority of the data which provider can give the access of the data how we can accept the data can used for processsing
advantages
they study each data detailed and classfiy them
they have huge type of parammeters for classfication
more errors could be detected in this process
limitations
data given is not quality most of them fail to meet the standards
data is not given from verified data provided
Big data pre-processing: A quality framework. In : Big Data has be implemented through Data Quality Dimensions they are of two types intrinsic and contextual, they both depend on time and accuracy
they follow Data quality Profile optimization where data with high qualityactivities is selected for example EEG file are collected from the reputed hospitals only to have quality and accurate data
advantages
they only take data from legit users
less accuracy failure
data provider is used to give high quality data
limitation
no details manner classfication
less no of attributed are used for classfication
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.