Capture page-level metadata
  • 24 May 2021
  • 1 Minute to read
  • Contributors
  • Dark
    Light
  • PDF

Capture page-level metadata

  • Dark
    Light
  • PDF

Article Summary

Page-level metadata are data points about the current page you are scraping. This includes such things as the webpage’s URL, the title or the whole HTML code.

In the Agent Builder:

  1. Enter a URL
  2. Right-click the page header.

Capture Page-Level and Meta Data_Image1(1)

  1. Select Properties > PAGE LEVEL CAPTURE ACTIONS.
  2. Specify the information you want to capture.

Capture Page-Level and Meta Data_Image2(1)

ItemDescription
Page URLThe URL of the current web page.
Page TitleThe name of the page that displays in search engine results and tab names in your web browser.
Page HTMLThe full HTML code of the web page.
Meta DescriptionAn HTML element containing a summary of the web page used in SEO.
Meta KeywordsAn HTML element containing keywords that specify the topic of the web page used in SEO.
  1. Select SAVE.

The field name and the page level action will be included in your data set.

Capture Page-Level and Meta Data_Image3(1)


Was this article helpful?