gucitheblog.com
robots.txt

Robots Exclusion Standard data for gucitheblog.com

Resource Scan

Scan Details

Site Domain gucitheblog.com
Base Domain gucitheblog.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-12-27T08:10:18+00:00
Next Scan 2026-03-27T08:10:18+00:00

Last Successful Scan

Scanned2024-09-02T08:15:27+00:00
URL https://gucitheblog.com/robots.txt
Domain IPs 162.241.217.249
Response IP 162.241.217.249
Found Yes
Hash 394bb9cca901032caf1f2788dc7cbb00d740e60511c07fe1a547e4475a33943e
SimHash 105e8d60ef11

Groups

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://gucitheblog.com/?feed=xmlsitemap01
sitemap https://gucitheblog.com//?feed=xmlsitemap4
sitemap https://gucitheblog.com//?feed=xmlsitemap3
sitemap https://gucitheblog.com//?feed=xmlsitemap2
sitemap https://gucitheblog.com//?feed=xmlsitemap1