noticiasgalicia.com
robots.txt

Robots Exclusion Standard data for noticiasgalicia.com

Resource Scan

Scan Details

Site Domain noticiasgalicia.com
Base Domain noticiasgalicia.com
Scan Status Ok
Last Scan2024-11-04T06:49:27+00:00
Next Scan 2024-11-11T06:49:27+00:00

Last Scan

Scanned2024-11-04T06:49:27+00:00
URL https://noticiasgalicia.com/robots.txt
Redirect https://www.noticiasgalicia.com/robots.txt
Redirect Domain www.noticiasgalicia.com
Redirect Base noticiasgalicia.com
Domain IPs 104.21.94.143, 172.67.136.219, 2606:4700:3031::ac43:88db, 2606:4700:3032::6815:5e8f
Redirect IPs 104.21.94.143, 172.67.136.219, 2606:4700:3031::ac43:88db, 2606:4700:3032::6815:5e8f
Response IP 172.67.136.219
Found Yes
Hash 4ee29398e778a006fcb9100b2cc2ac64c0dfba9ff8840c076536ef88872fdd92
SimHash 8c00ca60abd2

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin

Other Records

Field Value
sitemap https://www.noticiasgalicia.com/sitemap.news.xml.gz
sitemap https://www.noticiasgalicia.com/sitemap.xml