arice.com
robots.txt

Robots Exclusion Standard data for arice.com

Resource Scan

Scan Details

Site Domain arice.com
Base Domain arice.com
Scan Status Ok
Last Scan2025-08-03T18:38:19+00:00
Next Scan 2025-09-02T18:38:19+00:00

Last Scan

Scanned2025-08-03T18:38:19+00:00
URL https://arice.com/robots.txt
Domain IPs 2001:41d0:301::27, 54.36.91.62
Response IP 54.36.91.62
Found Yes
Hash b5cd5d9f926cf276eb8f121425b8fd3aee213d2a4ce9a0460c08deb39f8e6181
SimHash 61805c200ff2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow */trackback
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Allow /*css?*
Allow /*js?*
Allow /*?utm*
Allow /css/?

Comments

  • URLs à désindexer : Login Trackbacks Commentaires
  • URLs autorisées CSS JS Analytics pour les Bots