thorpesgmcinc.com
robots.txt

Robots Exclusion Standard data for thorpesgmcinc.com

Resource Scan

Scan Details

Site Domain thorpesgmcinc.com
Base Domain thorpesgmcinc.com
Scan Status Ok
Last Scan2024-09-29T04:06:41+00:00
Next Scan 2024-10-29T04:06:41+00:00

Last Scan

Scanned2024-09-29T04:06:41+00:00
URL https://thorpesgmcinc.com/robots.txt
Redirect https://www.thorpesgmcinc.com/robots.txt
Redirect Domain www.thorpesgmcinc.com
Redirect Base thorpesgmcinc.com
Domain IPs 18.155.68.128, 18.155.68.33, 18.155.68.46, 18.155.68.81
Redirect IPs 18.155.68.128, 18.155.68.33, 18.155.68.46, 18.155.68.81
Response IP 18.155.68.81
Found Yes
Hash 2f577896aedad7ac835937733f4e7994e4f90c1fba1ca8dfb3c381d0a013e264
SimHash 4cd010b0d6f0

Groups

googlebot
storebot-google
adsbot-google
adsbot-google-mobile

Rule Path
Disallow /*.do*
Disallow /*.ajax*
Disallow /f_*
Disallow /*uri%3D*
Disallow /*blockCacheType%3D*
Disallow /*blockUri%3D*
Disallow /*cs%3Ao*
Disallow /undefined/*

bingbot
adidxbot
bingpreview
microsoftpreview
duckduckbot
applebot
mj12bot
motominerbot
rogerbot
ravencrawler
twitterbot
slurp
semrushbot
siteauditbot
facebookexternalhit/1.1

Rule Path
Disallow /*.do*
Disallow /*.ajax*
Disallow /f_*
Disallow /*uri%3D*
Disallow /*blockCacheType%3D*
Disallow /*blockUri%3D*
Disallow /*cs%3Ao*
Disallow /undefined/*

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thorpesgmcinc.com/sitemap.xml