waystone.com
robots.txt
Robots Exclusion Standard data for waystone.com
Resource Scan
Scan Details
Site Domain | waystone.com |
Base Domain | waystone.com |
Scan Status | Ok |
Last Scan | 2025-06-03T05:26:02+00:00 |
Next Scan | 2025-07-03T05:26:02+00:00 |
Last Scan
Scanned | 2025-06-03T05:26:02+00:00 |
URL | https://waystone.com/robots.txt |
Redirect | https://www.waystone.com/robots.txt |
Redirect Domain | www.waystone.com |
Redirect Base | waystone.com |
Domain IPs | 104.26.6.249, 104.26.7.249, 172.67.75.104, 2606:4700:20::681a:6f9, 2606:4700:20::681a:7f9, 2606:4700:20::ac43:4b68 |
Redirect IPs | 104.26.6.249, 104.26.7.249, 172.67.75.104, 2606:4700:20::681a:6f9, 2606:4700:20::681a:7f9, 2606:4700:20::ac43:4b68 |
Response IP | 104.26.7.249 |
Found | Yes |
Hash | 1b9c6c4539a95dd5a3f9b8e734f60bd736a58814ec0997c89231d4c62100659c |
SimHash | 49089b12e793 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /docs/*.pdf$ |
Disallow | /wp-content/uploads/2023/09/*24-Waystone-*-*.pdf$ |
Disallow | /sfdr/ |
Disallow | /kiid/ |
Disallow | /waystonefs/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.waystone.com/sitemap.xml |
sitemap | https://www.waystone.com/pdf-sitemap.xml |