Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

DAT 220: Exercise Guidelines and Rubric Overview As organizations become more an

ID: 3913035 • Letter: D

Question

DAT 220:

Exercise Guidelines and Rubric

Overview

As organizations become more and more dependent on sophisticated data mining techniques to uncover new value streams, data management practices have been forced to respond to the often unique requirements presented by data mining professionals. It is self-evident that the organizations that best understand their data assets are also best positioned to adapt to the emerging data management practices required by data mining initiatives. In this exercise, you are asked to provide an overview of a prototypical company’s data environment and explain how it is situated for use in customer data mining activities.

Prompt

Assume you have been asked to present the current state of an organization’s analytic data assets.

Your Assignment

Prepare a depiction of an analytics data environment typical to an online retailer. Include a data warehouse repository that depicts various sources of available data. Also include at least one data mart that is sourced at least in part from the data warehouse.

The assignment will be graded based on the following critical elements:

a)Source Data Systems: Identify at least two source data systems that are typical to an online retailer and that might be useful to a data mining initiative to better understand the retailer’s customers.

b)Data Warehouse: Describe the contents of a data warehouse typical to an online retailer, emphasizing sources (transactional system, supply chain management system, etc.) and data subject areas(sales, customer, supply, etc.).

c)Data Mart: Identify the benefits and limitations of a data mart that is sourced from the warehouse to support customer analytics for a typical online retailer.

d)External Data: Identify a source of external data a typical online retailer might wish to include in a customer analytics data mart. What benefit is gained by the addition of this external data? What challenges are presented by the integration of this external data source?

Explanation / Answer

Nowadays Online retail (E-Retail) shops like Amazon, Flipkart, and many more have dramatically increased the use of Retail Analytics to enhance their business in terms of increasing the business revenues and improving their performance. This actually helps business people to get the information about statistical facts and figures of their business process. This Analysis support the business managers to design strategy and take vital decisions based on the analytical reports and results.

a)Sources of data can be used in this Analytic Solution :-

b) Data warehouse: A Data Source is a location from which data is being Extracted and used to create the contents of Datawarehouse. This Data is primarily loaded from operational Database tables like Customer, Sales, Products etc. This data is loaded from different servers of the Distributed online retails DBMS system. The data from various sources can be in any form like Database tables, CSV files, Bills in the form of text Documents/ PDF.

Such an aggregated data can be of multiple subject areas like Products Sales, Inventory System, Vendor Management, Logistics, Offers and Promotional Marketing System.

Data marts are the collection of data that is developed to perform analysis on a specific subject area, unlike a data warehouse which is a central data repository of the entire online market organization. Hence we can say data mart is a special usable subset of data warehouse.     

c)Data Marts from such a data warehouse:

Customer purchase Analysis: The Transactions of all the past customers is put into an analysis and insights from such a data is made by determining the frequently purchased group sets of products also called as Market Basket Analysis. Customer Buying habits and their preferences can be found out with deep analysis of past Transactions and Bills.

Advantages: This Data mart design from a big data warehouse can be more fast and efficient in terms of Processing and is less complex and expensive to maintain.

Disadvantages of Such a data mart can be

d)External Sources of Data in online retail Organisation data warehouse: This is sometimes in the unstructured and in an unpredictable form. Cleaning such a data and incorporating it can be helpful for business analysis and decision making.