somoscomarca.es
robots.txt
Robots Exclusion Standard data for somoscomarca.es
Resource Scan
Scan Details
Site Domain | somoscomarca.es |
Base Domain | somoscomarca.es |
Scan Status | Ok |
Last Scan | 2024-11-03T23:27:45+00:00 |
Next Scan | 2024-11-10T23:27:45+00:00 |
Last Scan
Scanned | 2024-11-03T23:27:45+00:00 |
URL | https://somoscomarca.es/robots.txt |
Redirect | https://www.somoscomarca.es/robots.txt |
Redirect Domain | www.somoscomarca.es |
Redirect Base | somoscomarca.es |
Domain IPs | 104.21.82.23, 172.67.151.91, 2606:4700:3034::ac43:975b, 2606:4700:3036::6815:5217 |
Redirect IPs | 104.21.82.23, 172.67.151.91, 2606:4700:3034::ac43:975b, 2606:4700:3036::6815:5217 |
Response IP | 172.67.151.91 |
Found | Yes |
Hash | 0a4e0c4d3c420a07062cb15faa561da6b0ed696d53d5e15d5adfcf9adeb18dc4 |
SimHash | 80204c44e9d2 |
Groups
*
Rule | Path |
---|---|
Disallow | /harming/humans |
Disallow | /ignoring/human/orders |
Disallow | /harm/to/self |
Disallow | /api |
Disallow | /admin |
Other Records
Field | Value |
---|---|
sitemap | https://www.somoscomarca.es/sitemap.news.xml.gz |
sitemap | https://www.somoscomarca.es/sitemap.xml |