pagethink.com
robots.txt
Robots Exclusion Standard data for pagethink.com
Resource Scan
Scan Details
Site Domain | pagethink.com |
Base Domain | pagethink.com |
Scan Status | Ok |
Last Scan | 2025-09-21T08:37:15+00:00 |
Next Scan | 2025-10-21T08:37:15+00:00 |
Last Scan
Scanned | 2025-09-21T08:37:15+00:00 |
URL | https://pagethink.com/robots.txt |
Redirect | https://www.pagethink.com/robots.txt |
Redirect Domain | www.pagethink.com |
Redirect Base | pagethink.com |
Domain IPs | 104.26.12.190, 104.26.13.190, 172.67.70.158, 2606:4700:20::681a:cbe, 2606:4700:20::681a:dbe, 2606:4700:20::ac43:469e |
Redirect IPs | 104.26.12.190, 104.26.13.190, 172.67.70.158, 2606:4700:20::681a:cbe, 2606:4700:20::681a:dbe, 2606:4700:20::ac43:469e |
Response IP | 104.26.12.190 |
Found | Yes |
Hash | e76cb6e95448e0ca51fbdc1931d7847466b12291517f8bea9a47772d419ed79f |
SimHash | aa685c062192 |
Groups
*
Rule | Path |
---|---|
Disallow | /cpresources/ |
Disallow | /vendor/ |
Disallow | /.env |
Disallow | /cache/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.pagethink.com/sitemap.xml |