walkingtheboards.com
robots.txt

Robots Exclusion Standard data for walkingtheboards.com

Resource Scan

Scan Details

Site Domain walkingtheboards.com
Base Domain walkingtheboards.com
Scan Status Ok
Last Scan2025-11-22T21:33:54+00:00
Next Scan 2025-12-22T21:33:54+00:00

Last Scan

Scanned2025-11-22T21:33:54+00:00
URL https://www.walkingtheboards.com/robots.txt
Domain IPs 104.18.68.40, 104.18.69.40, 2606:4700::6812:4428, 2606:4700::6812:4528
Response IP 104.18.69.40
Found Yes
Hash 880eaa2cfdeda11ea288acd83208c07bd843d17252840c76b606554d1ce1d9d4
SimHash 6f1dd860eb11

Groups

amazonbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Disallow /login

adsbot-google

Rule Path
Disallow /login

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.walkingtheboards.com/sitemap.xml