pageglance.com
robots.txt
Robots Exclusion Standard data for pageglance.com
Resource Scan
Scan Details
Site Domain | pageglance.com |
Base Domain | pageglance.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2025-08-25T17:24:06+00:00 |
Next Scan | 2025-11-23T17:24:06+00:00 |
Last Successful Scan
Scanned | 2025-04-28T13:47:14+00:00 |
URL | https://pageglance.com/robots.txt |
Redirect | https://www.pageglimpse.org/robots.txt |
Redirect Domain | www.pageglimpse.org |
Redirect Base | pageglimpse.org |
Domain IPs | 66.160.134.61 |
Redirect IPs | 66.160.134.61 |
Response IP | 66.160.134.61 |
Found | Yes |
Hash | 7036e89f7eb3db72482932ca4304dc1792156afff206398ec910380164fd6761 |
SimHash | f3975223c9e7 |
Groups
*
Rule | Path |
---|---|
Disallow | /censored/ |
Disallow | /add-review/ |
Disallow | /add-coupon/ |
Disallow | /external/ |
Disallow | /thin/ |
Disallow | /scrept/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.pageglimpse.org/sitemap.xml |
Warnings
- 2 invalid lines.