imt.ie
robots.txt

Robots Exclusion Standard data for imt.ie

Resource Scan

Scan Details

Site Domain imt.ie
Base Domain imt.ie
Scan Status Ok
Last Scan2024-10-28T14:48:56+00:00
Next Scan 2024-11-04T14:48:56+00:00

Last Scan

Scanned2024-10-28T14:48:56+00:00
URL https://imt.ie/robots.txt
Redirect https://www.imt.ie/robots.txt
Redirect Domain www.imt.ie
Redirect Base imt.ie
Domain IPs 3.248.53.223, 52.51.72.187
Redirect IPs 3.248.53.223, 52.51.72.187
Response IP 3.248.53.223
Found Yes
Hash 8035fe0f001300290c4a8bf533173056986e38f340e8c4f13c74349aa33ab566
SimHash 5354c170ed91

Groups

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

neticlebot

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

livelapbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin
Disallow /account/
Disallow /?orderby*
Disallow /?s=*
Disallow /news/email-template/*