cadillaciv.com
robots.txt

Robots Exclusion Standard data for cadillaciv.com

Resource Scan

Scan Details

Site Domain cadillaciv.com
Base Domain cadillaciv.com
Scan Status Ok
Last Scan2024-09-19T15:13:24+00:00
Next Scan 2024-10-19T15:13:24+00:00

Last Scan

Scanned2024-09-19T15:13:24+00:00
URL http://cadillaciv.com/robots.txt
Redirect https://www.ivgm.com/robots.txt
Redirect Domain www.ivgm.com
Redirect Base ivgm.com
Domain IPs 216.241.213.55
Redirect IPs 52.84.229.108, 52.84.229.121, 52.84.229.127, 52.84.229.13
Response IP 52.84.229.121
Found Yes
Hash 205027cbe628ea888c9d7cd3b395fb3db1d58f8e9688c09147c0e509f81c1396
SimHash 6cc010b0c6a4

Groups

googlebot
storebot-google
adsbot-google
adsbot-google-mobile

Rule Path
Disallow /*.do*
Disallow /*.ajax*
Disallow /f_*
Disallow /*uri%3D*
Disallow /*blockCacheType%3D*
Disallow /*blockUri%3D*
Disallow /*cs%3Ao*
Disallow /undefined/*

bingbot
adidxbot
bingpreview
microsoftpreview
duckduckbot
applebot
mj12bot
motominerbot
rogerbot
ravencrawler
twitterbot
slurp
semrushbot
siteauditbot
facebookexternalhit/1.1

Rule Path
Disallow /*.do*
Disallow /*.ajax*
Disallow /f_*
Disallow /*uri%3D*
Disallow /*blockCacheType%3D*
Disallow /*blockUri%3D*
Disallow /*cs%3Ao*
Disallow /undefined/*

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ivgm.com/sitemap.xml