diazilla.com
robots.txt

Robots Exclusion Standard data for diazilla.com

Resource Scan

Scan Details

Site Domain diazilla.com
Base Domain diazilla.com
Scan Status Ok
Last Scan2025-05-22T15:29:28+00:00
Next Scan 2025-05-29T15:29:28+00:00

Last Scan

Scanned2025-05-22T15:29:28+00:00
URL https://diazilla.com/robots.txt
Domain IPs 104.21.16.67, 172.67.166.224, 2606:4700:3031::ac43:a6e0, 2606:4700:3034::6815:1043
Response IP 172.67.166.224
Found Yes
Hash ab757fc3322f060e1b6120690c053f2edc507e3a1866484a6f6356e1ea2fb8ac
SimHash 01009e50d530

Groups

*

Rule Path
Disallow /viewer_next/
Disallow /theme/
Allow /theme/*/static
Disallow /store/
Disallow /upload
Disallow /docinfo.xml
Disallow /sendmail.html
Disallow /ask/searchAjax
Disallow /cdn-cgi/
Allow /

Other Records

Field Value
sitemap https://diazilla.com/sitemap.xml

Warnings

  • `host` is not a known field.