Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

As part the work for the paper you are expected to develop a small piece of soft

ID: 3568120 • Letter: A

Question

As part the work for the paper you are expected to develop a small piece of software that extracts Microdata from a webpages (you can create your own pages if you wish). The software requirements for both solution and testing are shown in section B below. The software should be used to demonstrate the argument you are putting forward through the paper.

B Software

B.1 Basic Requirement

B.2 Advanced

Appropriate feature or features that add (all features of the basic requirements must still be present) to the system. Examples only (other appropriate features are acceptable)

Explanation / Answer

Extracting schema.org microdata using Scrapy selectors and XPath

Web pages are full of data, that is what web scraping is mostly about. But often you want more than data, you want meaning. Microdata markup embedded in HTML source helps machines understand what the pages are about: contact information, product reviews, events etc.

Web authors have several ways to add metadata to their web pages: HTML