webscraper.io
robots.txt

Robots Exclusion Standard data for webscraper.io

Resource Scan

Scan Details

Site Domain webscraper.io
Base Domain webscraper.io
Scan Status Ok
Last Scan2024-09-11T19:03:10+00:00
Next Scan 2024-09-25T19:03:10+00:00

Last Scan

Scanned2024-09-11T19:03:10+00:00
URL https://webscraper.io/robots.txt
Domain IPs 108.156.133.108, 108.156.133.113, 108.156.133.119, 108.156.133.6
Response IP 108.156.133.113
Found Yes
Hash 145c0634c5af63315abb54bd9a98573fb9efc6d6940ef84ae05c84d096b167c9
SimHash 69005846ad93

Groups

*

Rule Path
Disallow
Disallow /test-sites/e-commerce/
Disallow /test-sites/tables

Other Records

Field Value
sitemap https://webscraper.io/sitemap.xml