gs1ve.org
robots.txt

Robots Exclusion Standard data for gs1ve.org

Resource Scan

Scan Details

Site Domain gs1ve.org
Base Domain gs1ve.org
Scan Status Ok
Last Scan2025-11-12T21:05:35+00:00
Next Scan 2025-12-12T21:05:35+00:00

Last Scan

Scanned2025-11-12T21:05:35+00:00
URL https://gs1ve.org/robots.txt
Response IP 148.72.152.131
Found Yes
Hash 211538e845a5acd8d88c5bbf0dcba78aa2f80488f5298f1c3bc07f6e44f3c64f
SimHash c8554d14c950

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /wp-admin/
Disallow /wp-content/uploads/wpo-plugins-tables-list.json
Allow /feed/$
Disallow /feed
Disallow /comments/feed
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Allow /*.js$
Allow /*.css$
Disallow /*.pdf$

Other Records

Field Value
sitemap https://gs1ve.org/sitemap_index.xml

Comments

  • Impedir el acceso a los diferentes feed que genere la página
  • Impedir URLs terminadas en /trackback/ que sirven como Trackback URL.
  • Evita bloqueos de CSS y JS.
  • Bloquear todos los pdfs
  • Bloquear parámetros
  • Añadimos una indicación de la localización del sitemap

Warnings

  • 1 invalid line.