ceutaldia.com
robots.txt
Robots Exclusion Standard data for ceutaldia.com
Resource Scan
Scan Details
Site Domain | ceutaldia.com |
Base Domain | ceutaldia.com |
Scan Status | Ok |
Last Scan | 2024-10-31T05:26:47+00:00 |
Next Scan | 2024-11-07T05:26:47+00:00 |
Last Scan
Scanned | 2024-10-31T05:26:47+00:00 |
URL | https://ceutaldia.com/robots.txt |
Redirect | https://www.ceutaldia.com/robots.txt |
Redirect Domain | www.ceutaldia.com |
Redirect Base | ceutaldia.com |
Domain IPs | 104.21.67.29, 172.67.211.150, 2606:4700:3033::ac43:d396, 2606:4700:3036::6815:431d |
Redirect IPs | 104.21.67.29, 172.67.211.150, 2606:4700:3033::ac43:d396, 2606:4700:3036::6815:431d |
Response IP | 104.21.67.29 |
Found | Yes |
Hash | c48bd84886994d740557d9893c0ae636e0abeaa16db73e3435327d8c952c20de |
SimHash | e000ca60e9d2 |
Groups
*
Rule | Path |
---|---|
Disallow | /harming/humans |
Disallow | /ignoring/human/orders |
Disallow | /harm/to/self |
Disallow | /api |
Disallow | /admin |
Other Records
Field | Value |
---|---|
sitemap | https://www.ceutaldia.com/sitemap.news.xml.gz |
sitemap | https://www.ceutaldia.com/sitemap.xml |