laraza.com
robots.txt

Robots Exclusion Standard data for laraza.com

Resource Scan

Scan Details

Site Domain laraza.com
Base Domain laraza.com
Scan Status Ok
Last Scan2024-11-10T06:38:35+00:00
Next Scan 2024-11-17T06:38:35+00:00

Last Scan

Scanned2024-11-10T06:38:35+00:00
URL https://laraza.com/robots.txt
Domain IPs 192.0.66.64
Response IP 192.0.66.64
Found Yes
Hash 18aed1bdad46df4139bf747aa97705239359ffefed6b9e959e4eb3a52bbe4434
SimHash c921db084dd0

Groups

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/feed/newstand/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/feed/newstand/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/feed/newstand/$

*

Rule Path
Disallow /amp-helper-frame.html
Disallow /amp-permission-dialog.html

*

Rule Path
Disallow /xmlrpc.php

*

Rule Path
Disallow /*customize_changeset_uuid*
Disallow /*customize_autosaved*

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://laraza.com/news-sitemap.xml
sitemap https://laraza.com/wp-sitemap.xml

Comments

  • Bloqueo de busqueda.
  • Bloqueo de feeds.
  • Bloqueo de HTML de push en AMP.
  • Bloqueo de HTML de XMLRPC.
  • Bloqueo de URL con variables especificas.