ceutaldia.com
robots.txt

Robots Exclusion Standard data for ceutaldia.com

Resource Scan

Scan Details

Site Domain ceutaldia.com
Base Domain ceutaldia.com
Scan Status Ok
Last Scan2024-10-31T05:26:47+00:00
Next Scan 2024-11-07T05:26:47+00:00

Last Scan

Scanned2024-10-31T05:26:47+00:00
URL https://ceutaldia.com/robots.txt
Redirect https://www.ceutaldia.com/robots.txt
Redirect Domain www.ceutaldia.com
Redirect Base ceutaldia.com
Domain IPs 104.21.67.29, 172.67.211.150, 2606:4700:3033::ac43:d396, 2606:4700:3036::6815:431d
Redirect IPs 104.21.67.29, 172.67.211.150, 2606:4700:3033::ac43:d396, 2606:4700:3036::6815:431d
Response IP 104.21.67.29
Found Yes
Hash c48bd84886994d740557d9893c0ae636e0abeaa16db73e3435327d8c952c20de
SimHash e000ca60e9d2

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin

Other Records

Field Value
sitemap https://www.ceutaldia.com/sitemap.news.xml.gz
sitemap https://www.ceutaldia.com/sitemap.xml