guidemaster.org
robots.txt

Robots Exclusion Standard data for guidemaster.org

Resource Scan

Scan Details

Site Domain guidemaster.org
Base Domain guidemaster.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-04-28T16:39:12+00:00
Next Scan 2025-07-27T16:39:12+00:00

Last Successful Scan

Scanned2023-06-16T15:19:55+00:00
URL https://guidemaster.org/robots.txt
Redirect https://www.guidemaster.org/robots.txt
Redirect Domain www.guidemaster.org
Redirect Base guidemaster.org
Domain IPs 104.21.60.180, 172.67.199.43, 2606:4700:3033::ac43:c72b, 2606:4700:3037::6815:3cb4
Redirect IPs 104.21.60.180, 172.67.199.43, 2606:4700:3033::ac43:c72b, 2606:4700:3037::6815:3cb4
Response IP 104.21.60.180
Found Yes
Hash bf34142a71fdbd7beca9e91d09f3b8b5ac3c8e517f87d1eacf9ca9635c64d21f
SimHash cb9a5c0186d3

Groups

yandexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

etaospider

Rule Path
Disallow /

facebot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /user-brand-rank-lists
Disallow /user-goods-rank-lists
Disallow /user-brand-lists
Disallow /user-goods-lists
Disallow /user

Other Records

Field Value
sitemap https://www.guidemaster.org/sitemap.xml

Comments

  • Production Robots.txt file
  • Sitemap