gregweeks.com
robots.txt

Robots Exclusion Standard data for gregweeks.com

Resource Scan

Scan Details

Site Domain gregweeks.com
Base Domain gregweeks.com
Scan Status Ok
Last Scan2024-10-22T02:52:16+00:00
Next Scan 2024-11-21T02:52:16+00:00

Last Scan

Scanned2024-10-22T02:52:16+00:00
URL https://gregweeks.com/robots.txt
Redirect https://www.gregweeks.com/robots.txt
Redirect Domain www.gregweeks.com
Redirect Base gregweeks.com
Domain IPs 54.230.71.37, 54.230.71.45, 54.230.71.70, 54.230.71.80
Redirect IPs 108.156.133.126, 108.156.133.52, 108.156.133.7, 108.156.133.84
Response IP 108.156.133.126
Found Yes
Hash 095b86e3a57d146eef6f28e5037c8d7808414b534a6555a02ce70b1298d85eea
SimHash 4cd010b0d6f4

Groups

googlebot
storebot-google
adsbot-google
adsbot-google-mobile

Rule Path
Disallow /*.do*
Disallow /*.ajax*
Disallow /f_*
Disallow /*uri%3D*
Disallow /*blockCacheType%3D*
Disallow /*blockUri%3D*
Disallow /*cs%3Ao*
Disallow /undefined/*

bingbot
adidxbot
bingpreview
microsoftpreview
duckduckbot
applebot
mj12bot
motominerbot
rogerbot
ravencrawler
twitterbot
slurp
semrushbot
siteauditbot
facebookexternalhit/1.1

Rule Path
Disallow /*.do*
Disallow /*.ajax*
Disallow /f_*
Disallow /*uri%3D*
Disallow /*blockCacheType%3D*
Disallow /*blockUri%3D*
Disallow /*cs%3Ao*
Disallow /undefined/*

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gregweeks.com/sitemap.xml