diariodegrancanaria.com
robots.txt

Robots Exclusion Standard data for diariodegrancanaria.com

Resource Scan

Scan Details

Site Domain diariodegrancanaria.com
Base Domain diariodegrancanaria.com
Scan Status Ok
Last Scan2024-11-02T17:43:16+00:00
Next Scan 2024-11-09T17:43:16+00:00

Last Scan

Scanned2024-11-02T17:43:16+00:00
URL https://diariodegrancanaria.com/robots.txt
Redirect https://www.diariodegrancanaria.com/robots.txt
Redirect Domain www.diariodegrancanaria.com
Redirect Base diariodegrancanaria.com
Domain IPs 104.21.81.33, 172.67.137.209, 2606:4700:3031::6815:5121, 2606:4700:3032::ac43:89d1
Redirect IPs 104.21.81.33, 172.67.137.209, 2606:4700:3031::6815:5121, 2606:4700:3032::ac43:89d1
Response IP 172.67.137.209
Found Yes
Hash 326fe3b7d4c80d73a501f8ecbc3b19e517bbdd5668659073a60213a6edc62582
SimHash 8020cc2069d3

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin

Other Records

Field Value
sitemap https://www.diariodegrancanaria.com/sitemap.news.xml.gz
sitemap https://www.diariodegrancanaria.com/sitemap.xml