thebureauconnection.com
robots.txt

Robots Exclusion Standard data for thebureauconnection.com

Resource Scan

Scan Details

Site Domain thebureauconnection.com
Base Domain thebureauconnection.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-06T12:16:49+00:00
Next Scan 2024-11-04T12:16:49+00:00

Last Successful Scan

Scanned2024-04-09T11:36:32+00:00
URL https://thebureauconnection.com/robots.txt
Domain IPs 104.21.83.239, 172.67.183.117, 2606:4700:3032::6815:53ef, 2606:4700:3037::ac43:b775
Response IP 172.67.183.117
Found Yes
Hash 0d6de75c9109956c9350dbe7fe8f9fed093793ef47064955bd01ce56b8d56f1d
SimHash 6e10cee5ea90

Groups

*

Rule Path
Disallow */page*.php$
Disallow /map*.php$
Disallow /sitemap*.php$

Other Records

Field Value
sitemap https://thebureauconnection.com/sitemap.xml
sitemap https://thebureauconnection.com/c1/sitemap.xml
sitemap https://thebureauconnection.com/c2/sitemap.xml
sitemap https://thebureauconnection.com/c3/sitemap.xml
sitemap https://thebureauconnection.com/c4/sitemap.xml
sitemap https://thebureauconnection.com/c5/sitemap.xml

Warnings

  • `host` is not a known field.