thebl.com
robots.txt

Robots Exclusion Standard data for thebl.com

Resource Scan

Scan Details

Site Domain thebl.com
Base Domain thebl.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-23T15:40:36+00:00
Next Scan 2024-12-22T15:40:36+00:00

Last Successful Scan

Scanned2023-02-09T10:45:37+00:00
URL https://thebl.com/robots.txt
Domain IPs 104.21.36.79, 172.67.190.101, 2606:4700:3035::ac43:be65, 2606:4700:3036::6815:244f
Response IP 172.67.190.101
Found Yes
Hash 96a24364b03f3e33e21e72bef60090f447341c93e203ed9d7e028997d6fd65fb
SimHash e9014c40ca93

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /search?q=*
Disallow *?replytocom
Disallow */attachment/*
Disallow /counter/
Disallow /cronjob/
Disallow /data/
Disallow /lib/
Disallow /print/

Other Records

Field Value
sitemap https://thebl.com/sitemap.xml