alma-da-comporta.com
robots.txt

Robots Exclusion Standard data for alma-da-comporta.com

Resource Scan

Scan Details

Site Domain alma-da-comporta.com
Base Domain alma-da-comporta.com
Scan Status Ok
Last Scan2025-05-05T15:35:21+00:00
Next Scan 2025-06-04T15:35:21+00:00

Last Scan

Scanned2025-05-05T15:35:21+00:00
URL https://alma-da-comporta.com/robots.txt
Redirect https://espiritodacomporta.com/robots.txt
Redirect Domain espiritodacomporta.com
Redirect Base espiritodacomporta.com
Domain IPs 2001:41d0:301::27, 54.36.91.62
Redirect IPs 2001:41d0:301::27, 54.36.91.62
Response IP 54.36.91.62
Found Yes
Hash 77793dfd54b136d7b83db366871d825595398a0bdd68a9f5cd5965b41236cbf3
SimHash 71085c44c3f3

Groups

*

Rule Path
Disallow /wp-login.php
Disallow */trackback
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Disallow /*author/
Allow *.js
Allow /*css?*
Allow /*js?*
Allow /*?utm*
Allow /css/?

googlebot-image

Rule Path
Allow /*

mediapartners-google*

Rule Path
Allow /*

Other Records

Field Value
sitemap https://alma-da-comporta.com/sitemap_index.xml

Comments

  • URLs que je ne veux pas indexer : Login Trackbacks Commentaires
  • URLs autorisees CSS JS Analytics pour les Bots
  • Autoriser Google Image
  • Autoriser Google AdSense

Warnings

  • 1 invalid line.