elprogreso.galiciae.com
robots.txt

Robots Exclusion Standard data for elprogreso.galiciae.com

Resource Scan

Scan Details

Site Domain elprogreso.galiciae.com
Base Domain galiciae.com
Scan Status Ok
Last Scan2024-10-22T11:14:24+00:00
Next Scan 2024-11-21T11:14:24+00:00

Last Scan

Scanned2024-10-22T11:14:24+00:00
URL https://elprogreso.galiciae.com/robots.txt
Redirect https://www.elprogreso.es/robots.txt
Redirect Domain www.elprogreso.es
Redirect Base elprogreso.es
Domain IPs 104.21.92.68, 172.67.187.162, 2606:4700:3031::ac43:bba2, 2606:4700:3036::6815:5c44
Redirect IPs 104.26.6.201, 104.26.7.201, 172.67.70.71, 2606:4700:20::681a:6c9, 2606:4700:20::681a:7c9, 2606:4700:20::ac43:4647
Response IP 172.67.70.71
Found Yes
Hash 4c8e11e2e024b7786d199aabb0e7e0f07ce6ac07265bc2340c89ea36afda2c9a
SimHash a820c8a0e9d3

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.elprogreso.es/sitemap.news.xml.gz
sitemap https://www.elprogreso.es/sitemap.xml