An HTML spider is any program that collects data from domains on the internet. “HTML” implies that it targets data based on it’s location within the pages source code. “Spider” usually implies that the software is designed to traverse many domain names looking at one or more common data types. This is different from Mozenda, which allows the user to collect any data type,
including context data.
Use Mozenda’s customizable HTML spider to crawl any website
The Mozenda Advantage
Unlike other parsing or wrapper based web data extraction software, Mozenda uses browser rendering technology which allows the Mozenda application to look and behave like a web browser, but act like a web scraper. There are multiple benefits to this approach:
- Mozenda loads pages and navigates pages just like a browser.
- Mozenda can click and activate any item on a web page and wait for it to load.
- Mozenda can easily navigate through sub-pages.
- With Mozenda you only have to set up the navigation and capture action once and it will replicate across pages and categories.
Mozenda provides users with tools that allow them to set up “Agents”. These Web Agents are like spiders, but they are smarter. They allow you to choose specific items on pages you would like to be extracted, then they automate the process of extracting the information. Of the many companies providing such technology, only Mozenda provides a simple point and click interface for setting up these agents to get the exact data you need.
Mozenda Training Videos
|Input Text Into a Form||0:44|
|Click the “Next” Button to Load the Next Page of Results||1:58|
|Schedule an Agent to Run Regularly||1:08|
|Combine the Contents of Two Fields||1:16|