periodicoclm.publico.es
robots.txt

Robots Exclusion Standard data for periodicoclm.publico.es

Resource Scan

Scan Details

Site Domain periodicoclm.publico.es
Base Domain publico.es
Scan Status Ok
Last Scan2024-11-06T17:16:35+00:00
Next Scan 2024-12-06T17:16:35+00:00

Last Scan

Scanned2024-11-06T17:16:35+00:00
URL https://periodicoclm.publico.es/robots.txt
Domain IPs 104.22.4.10, 104.22.5.10, 172.67.27.228, 2606:4700:10::6816:40a, 2606:4700:10::6816:50a, 2606:4700:10::ac43:1be4
Response IP 172.67.27.228
Found Yes
Hash ebc0ff06087d60febdafe6cf7913908413d5c71f4f84dcea1997cd4a18f9decb
SimHash a920cee089d2

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin

Other Records

Field Value
sitemap https://periodicoclm.publico.es/sitemap.news.xml.gz
sitemap https://periodicoclm.publico.es/sitemap.xml