agentiz.ca
robots.txt
Robots Exclusion Standard data for agentiz.ca
Resource Scan
Scan Details
Site Domain | agentiz.ca |
Base Domain | agentiz.ca |
Scan Status | Ok |
Last Scan | 2025-05-16T14:15:13+00:00 |
Next Scan | 2025-05-23T14:15:13+00:00 |
Last Scan
Scanned | 2025-05-16T14:15:13+00:00 |
URL | https://agentiz.ca/robots.txt |
Domain IPs | 104.21.30.22, 172.67.150.107, 2606:4700:3033::6815:1e16, 2606:4700:3036::ac43:966b |
Response IP | 172.67.150.107 |
Found | Yes |
Hash | 71260c743f06beac723372364d5cf2485ee2cd85022fbe76914be316894c36d4 |
SimHash | 681513df41db |
Groups
adsbot-google
adsbot-msn
ahrefsbot
ahrefssiteaudit
agentizbot
applebot
bingbot
bingpreview
chatgptbot
claudebot
cloudflareobservatory
duckassist
duckduckbot
ecosiabot
facebookcatalog
facebookexternalhit
feedfetcher-google
flipboardproxy
google-adwords-express
google-api-java-client
google-imageproxy
google-read-aloud
google-site-verification
googlebot
googlebot-image
googleother
gptbot
ia_archiver
linkedinbot
mediapartners-google
meta-externalfetcher
msnbot
naverbot
oai-searchbot
perplexitybot
pinterestbot
seznambot
slurp
structured-data-testing-tool
telegrambot
twitterbot
uptimerobot
Rule | Path |
---|---|
Disallow | /*-xs.jpg$ |
Disallow | /*-xxs.jpg$ |
Disallow | /*-xxxs.jpg$ |
Disallow | /*-xs.webp$ |
Disallow | /*-xxs.webp$ |
Disallow | /*-xxxs.webp$ |
Disallow | /*/?type=* |
Disallow | /*/assets |
Disallow | /*/policies |
Disallow | /*/redirect |
*
Rule | Path |
---|---|
Disallow | / |
Comments