iriscorporate.com
robots.txt

Robots Exclusion Standard data for iriscorporate.com

Resource Scan

Scan Details

Site Domain iriscorporate.com
Base Domain iriscorporate.com
Scan Status Ok
Last Scan2025-11-11T21:19:57+00:00
Next Scan 2025-12-11T21:19:57+00:00

Last Scan

Scanned2025-11-11T21:19:57+00:00
URL https://iriscorporate.com/robots.txt
Domain IPs 104.21.51.189, 172.67.184.37, 2606:4700:3031::ac43:b825, 2606:4700:3037::6815:33bd
Response IP 104.21.51.189
Found Yes
Hash 1c038070370a89963881cd9992d45229ebe40c53f19e91a262c702490a4f7000
SimHash 42005d44eff1

Groups

*

Rule Path
Disallow /wp-login.php
Disallow */trackback
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Allow /*css?*
Allow /*js?*
Allow /*?utm*
Allow /css/?

googlebot-image

Rule Path
Allow /*

mediapartners-google*

Rule Path
Allow /*

Other Records

Field Value
sitemap https://irisdatacapture.com/sitemap_index.xml

Comments

  • URLs que je ne veux pas indexer : Login Trackbacks Commentaires
  • URLs autorisées CSS JS Analytics pour les Bots
  • Autoriser Google Image
  • Autoriser Google AdSense