theflag.org
robots.txt

Robots Exclusion Standard data for theflag.org

Resource Scan

Scan Details

Site Domain theflag.org
Base Domain theflag.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-20T17:58:24+00:00
Next Scan 2024-08-18T17:58:24+00:00

Last Successful Scan

Scanned2023-10-30T10:14:52+00:00
URL https://www.theflag.org/robots.txt
Domain IPs 104.21.87.84, 172.67.142.133, 2606:4700:3031::ac43:8e85, 2606:4700:3033::6815:5754
Response IP 172.67.142.133
Found Yes
Hash 69e48a0b914a424a01fc235ceb3aeb6d96f54feaf78e1aa950fc09edbcaa8a2f
SimHash 6b5ddc52e911

Groups

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Disallow /login

adsbot-google

Rule Path
Disallow /login

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.theflag.org/sitemap.xml