periodicoclm.es
robots.txt
Robots Exclusion Standard data for periodicoclm.es
Resource Scan
Scan Details
Site Domain | periodicoclm.es |
Base Domain | periodicoclm.es |
Scan Status | Ok |
Last Scan | 2024-11-12T05:39:34+00:00 |
Next Scan | 2024-11-19T05:39:34+00:00 |
Last Scan
Scanned | 2024-11-12T05:39:34+00:00 |
URL | http://periodicoclm.es/robots.txt |
Redirect | https://periodicoclm.publico.es/robots.txt |
Redirect Domain | periodicoclm.publico.es |
Redirect Base | publico.es |
Domain IPs | 213.186.33.5 |
Redirect IPs | 104.22.4.10, 104.22.5.10, 172.67.27.228, 2606:4700:10::6816:40a, 2606:4700:10::6816:50a, 2606:4700:10::ac43:1be4 |
Response IP | 172.67.27.228 |
Found | Yes |
Hash | ebc0ff06087d60febdafe6cf7913908413d5c71f4f84dcea1997cd4a18f9decb |
SimHash | a920cee089d2 |
Groups
*
Rule | Path |
---|---|
Disallow | /harming/humans |
Disallow | /ignoring/human/orders |
Disallow | /harm/to/self |
Disallow | /api |
Disallow | /admin |
Other Records
Field | Value |
---|---|
sitemap | https://periodicoclm.publico.es/sitemap.news.xml.gz |
sitemap | https://periodicoclm.publico.es/sitemap.xml |