lagrannoticia.com
robots.txt

Robots Exclusion Standard data for lagrannoticia.com

Resource Scan

Scan Details

Site Domain lagrannoticia.com
Base Domain lagrannoticia.com
Scan Status Ok
Last Scan2024-10-28T09:54:39+00:00
Next Scan 2024-11-04T09:54:39+00:00

Last Scan

Scanned2024-10-28T09:54:39+00:00
URL https://lagrannoticia.com/robots.txt
Domain IPs 104.21.87.14, 172.67.139.37, 2606:4700:3033::6815:570e, 2606:4700:3037::ac43:8b25
Response IP 172.67.139.37
Found Yes
Hash f60279c1e1d7e783a25e9ab8b53c106ab9012ce770be9c4ffd28830466765fc0
SimHash 63a4dd4a6937

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /*/*.css
Allow /*/*.js
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow */disclaimer/*
Disallow *?attachment_id=
Disallow /privacy-policy

Other Records

Field Value
crawl-delay 5

Comments

  • This robots.txt file was created by Better Robots.txt (Index & Rank Booster by Pagup) Plugin. https://www.better-robots.com/