website.com
robots.txt
Robots Exclusion Standard data for website.com
Resource Scan
Scan Details
| Site Domain | website.com |
| Base Domain | website.com |
| Scan Status | Ok |
| Last Scan | 2025-12-09T13:02:47+00:00 |
| Next Scan | 2025-12-16T13:02:47+00:00 |
Last Scan
| Scanned | 2025-12-09T13:02:47+00:00 |
| URL | https://website.com/robots.txt |
| Redirect | https://www.website.com/robots.txt?source=SC |
| Redirect Domain | www.website.com |
| Redirect Base | website.com |
| Domain IPs | 104.20.26.208, 172.66.170.25, 2606:4700:10::6814:1ad0, 2606:4700:10::ac42:aa19 |
| Redirect IPs | 104.20.26.208, 172.66.170.25, 2606:4700:10::6814:1ad0, 2606:4700:10::ac42:aa19 |
| Response IP | 104.20.26.208 |
| Found | Yes |
| Hash | bf20d0023b75fe9f46f3dbbce2f602105a79fff1db57df65b7640436b7b484b6 |
| SimHash | 210cccd1a482 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /_api/* |
| Disallow | /_api/ |
| Disallow | /verify-email/ |
| Disallow | /cdn-cgi/ |
| Disallow | /ad/* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.website.com/sitemap.xml |