noticias.madrededios.com
robots.txt

Robots Exclusion Standard data for noticias.madrededios.com

Resource Scan

Scan Details

Site Domain noticias.madrededios.com
Base Domain madrededios.com
Scan Status Ok
Last Scan2024-11-01T08:14:25+00:00
Next Scan 2024-12-01T08:14:25+00:00

Last Scan

Scanned2024-11-01T08:14:25+00:00
URL https://noticias.madrededios.com/robots.txt
Domain IPs 104.21.93.16, 172.67.202.179, 2606:4700:3034::ac43:cab3, 2606:4700:3037::6815:5d10
Response IP 104.21.93.16
Found Yes
Hash be3adc515955d17191907d40e6c3703b0f410dcc85377a052491d9f574ed7d13
SimHash e8204c6069d2

Groups

*

Rule Path
Disallow /harming/humans
Disallow /ignoring/human/orders
Disallow /harm/to/self
Disallow /api
Disallow /admin

Other Records

Field Value
sitemap https://noticias.madrededios.com/sitemap.news.xml.gz
sitemap https://noticias.madrededios.com/sitemap.xml