goatformat.com
robots.txt

Robots Exclusion Standard data for goatformat.com

Resource Scan

Scan Details

Site Domain goatformat.com
Base Domain goatformat.com
Scan Status Ok
Last Scan2024-09-21T18:20:49+00:00
Next Scan 2024-09-28T18:20:49+00:00

Last Scan

Scanned2024-09-21T18:20:49+00:00
URL https://goatformat.com/robots.txt
Redirect https://www.goatformat.com/robots.txt
Redirect Domain www.goatformat.com
Redirect Base goatformat.com
Domain IPs 199.34.228.77
Redirect IPs 199.34.228.77
Response IP 199.34.228.77
Found Yes
Hash 531016b0e1a83e0d37ff61e7e31945b33a95755c80b5958b35165fb98be8269b
SimHash 4944dc4c2fdb

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /https%3A//www.goatformat.com/home/category/sjc-indianapolis
Disallow /gr2005.html
Disallow /gr2014.html
Disallow /newhome.html
Disallow /strategy.html
Disallow /gr2017.html
Disallow /historic-premier-events.html
Disallow /decks1.html
Disallow /gr2018.html
Disallow /tournaments.html
Disallow /gr2019.html
Disallow /store--support.html
Disallow /gr2020.html
Disallow /gr2021.html
Disallow /gr2022.html
Disallow /ggpcleveland.html

Other Records

Field Value
sitemap https://www.goatformat.com/sitemap.xml