AJAX
Today many websites load their content using AJAX (Asynchronous JavaScript and XML). This often greatly improves user experience but also may become a stumbling block for some web scrapers.
At the same time a good web scraper should be able to parse all major data formats that are used in AJAX technology: HTML, XML and JSON.
You may use this test to check scraper's ability to:
- Receive HTML via AJAX and parse it
- Receive XML via AJAX and parse it
- Receive JSON via AJAX and parse it
How does it work:
- The browser receives three lists of three names through AJAX in three different formats: HTML, XML and JSON
- HTML data is received automatically as the page is loaded
- To receive XML and JSON data you need to click to a corresponding link
- The scraper should be able to extract all nine names