webisjericho.com
robots.txt
Robots Exclusion Standard data for webisjericho.com
Resource Scan
Scan Details
Site Domain | webisjericho.com |
Base Domain | webisjericho.com |
Scan Status | Ok |
Last Scan | 2024-09-27T23:14:48+00:00 |
Next Scan | 2024-10-04T23:14:48+00:00 |
Last Scan
Scanned | 2024-09-27T23:14:48+00:00 |
URL | https://webisjericho.com/robots.txt |
Domain IPs | 104.26.8.108, 104.26.9.108, 172.67.69.147, 2606:4700:20::681a:86c, 2606:4700:20::681a:96c, 2606:4700:20::ac43:4593 |
Response IP | 104.26.9.108 |
Found | Yes |
Hash | 9b46d420e6fc8551338e8c5924ba850c8e21a7f3757d4ab72bb75cbe517ed8de |
SimHash | 62345a86eff1 |
Groups
*
Rule | Path | Comment |
---|---|---|
Disallow | /wp-admin/ | block access to admin section |
Disallow | /wp-login.php | block access to admin section |
Disallow | /search/ | block access to internal search result pages |
Disallow | *?s=* | block access to internal search result pages |
Disallow | *?p=* | block access to pages for which permalinks fails |
Disallow | *%26p%3D* | block access to pages for which permalinks fails |
Disallow | *%26preview%3D* | block access to preview pages |
Disallow | /tag/ | block access to tag pages |
Disallow | /author/ | block access to author pages |
Disallow | /404-error/ | block access to 404 page |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |