imperiaua.com
robots.txt

Robots Exclusion Standard data for imperiaua.com

Resource Scan

Scan Details

Site Domain imperiaua.com
Base Domain imperiaua.com
Scan Status Ok
Last Scan2025-03-28T23:28:26+00:00
Next Scan 2025-04-27T23:28:26+00:00

Last Scan

Scanned2025-03-28T23:28:26+00:00
URL https://imperiaua.com/robots.txt
Domain IPs 31.131.18.79
Response IP 31.131.18.79
Found Yes
Hash 01dad979b762d7c58b99a6c28e62fa8fa11b174d7a396bbb5deda9859a9c282a
SimHash 2d0cc64747d3

Groups

*

Rule Path
Disallow /admin/*
Disallow /index.php?route*
Disallow /index.php?route=product%2Fsearch*
Disallow /*?*
Disallow /*?page=$
Disallow /*%26page%3D$
Disallow /*?sort*
Disallow /*?limit*
Disallow /*?order=
Disallow /*%26order%3D
Disallow /*?limit=
Disallow /*%26limit%3D
Disallow /*?filter_name=
Disallow /*%26filter_name%3D
Disallow /*?filter_sub_category=
Disallow /*%26filter_sub_category%3D
Disallow /*?filter_description=
Disallow /*%26filter_description%3D
Allow /image/*
Allow /image/cache/*.png
Allow /image/cache/*.jpg
Allow /image/cache/*.jpeg
Allow /image/cache/*.svg
Allow /image/cache/*.pdf
Allow /catalog/view/javascript/*
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$

googlebot-image

Rule Path
Allow /*

Other Records

Field Value
sitemap https://imperiaua.com/sitemap.xml

Comments

  • Disalow
  • Disalow
  • Allow
  • Disallow indexation of sensitive files
  • Allow Google Image Bot
  • Our sitemap