illi-pro.com
robots.txt

Robots Exclusion Standard data for illi-pro.com

Resource Scan

Scan Details

Site Domain illi-pro.com
Base Domain illi-pro.com
Scan Status Ok
Last Scan2025-11-20T06:21:42+00:00
Next Scan 2025-11-27T06:21:42+00:00

Last Scan

Scanned2025-11-20T06:21:42+00:00
URL https://www.illi-pro.com/robots.txt
Domain IPs 104.21.3.25, 172.67.130.18, 2606:4700:3030::ac43:8212, 2606:4700:3032::6815:319
Response IP 172.67.130.18
Found Yes
Hash 1aeded78797831a36a173a4694c70914d250026ff345b64650e163222abbd614
SimHash e8155d9246fe

Groups

googlebot

Rule Path
Disallow /tag/
Disallow /page/*
Disallow /search/
Disallow /categoria/
Disallow /category/
Disallow /pages/
Disallow /rss.xml
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-login.php
Disallow /search
Disallow /?s=
Disallow /*.php$
Disallow /*.inc$
Disallow /*.cgi$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*.php*
Disallow /*?*
Disallow /*?
Disallow /wp-*

*

Rule Path
Disallow /wp-admin/
Disallow /wp-
Disallow /feed/
Disallow /trackback/
Disallow /uploads/

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.illi-pro.com/sitemap.xml

Comments

  • disallow all files ending with these extensions