guyanatimesgy.com
robots.txt

Robots Exclusion Standard data for guyanatimesgy.com

Resource Scan

Scan Details

Site Domain guyanatimesgy.com
Base Domain guyanatimesgy.com
Scan Status Ok
Last Scan2026-01-06T21:54:47+00:00
Next Scan 2026-01-13T21:54:47+00:00

Last Scan

Scanned2026-01-06T21:54:47+00:00
URL https://guyanatimesgy.com/robots.txt
Domain IPs 104.21.75.124, 172.67.175.152, 2606:4700:3031::ac43:af98, 2606:4700:3035::6815:4b7c
Response IP 104.21.75.124
Found Yes
Hash 584cf2e7e53ccea2569e25e48c27a06e1b98f011cbc23c7e25567c1c234690f7
SimHash e818f503d6f1

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /z/j/
Disallow /z/c/
Disallow /stats/
Disallow /dh_
Disallow /about/
Disallow /contact/
Disallow /tag/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /contact
Disallow /manual
Disallow /manual/*
Disallow /phpmanual/
Disallow /category/

Other Records

Field Value
crawl-delay 15

googlebot

Rule Path
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$

Other Records

Field Value
crawl-delay 15

duggmirror

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow
Allow /*

Other Records

Field Value
crawl-delay 15

mediapartners-google*

Rule Path
Disallow
Allow /*

Other Records

Field Value
crawl-delay 15

Comments

  • disallow all files in these directories
  • disallow all files ending with these extensions
  • disable duggmirror
  • allow google image bot to search all images
  • allow adsense bot on entire site