excurzilla.com
robots.txt

Robots Exclusion Standard data for excurzilla.com

Resource Scan

Scan Details

Site Domain excurzilla.com
Base Domain excurzilla.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-03-06T11:15:38+00:00
Next Scan 2026-06-04T11:15:38+00:00

Last Successful Scan

Scanned2025-07-16T19:37:26+00:00
URL https://excurzilla.com/robots.txt
Domain IPs 172.66.40.118, 172.66.43.138, 2606:4700:3108::ac42:2876, 2606:4700:3108::ac42:2b8a
Response IP 172.66.43.138
Found Yes
Hash 76f529368dbea161682f668adfbbbad22433bddafa22b7022897bebb08167bf5
SimHash 694c8200a717

Groups

*

Rule Path
Disallow /*?*
Disallow /*%3D*
Disallow /*?
Disallow /?
Disallow */tag/tours*
Disallow /*?sort=*
Disallow /*?page=*
Disallow /*?from=*
Disallow /*?utm_source=*
Disallow /*?gclid=*

yandex

Rule Path
Disallow /*?*
Disallow /*%3D*
Disallow /*?
Disallow /?
Disallow */tag/tours*
Disallow /*?sort=*
Disallow /*?page=*
Disallow /*?from=*
Disallow /sv/
Disallow /da/
Disallow /pt/
Disallow /pl/
Disallow /nl/
Disallow /it/
Disallow /no/
Disallow /sv/
Disallow /fi/
Disallow /kr/
Disallow /ko/
Disallow /ja/
Disallow /fr/
Disallow /de/
Disallow /es/
Disallow */tag/tours*

geedoproductsearch

Rule Path
Disallow *

semrushbot

Rule Path
Disallow *

claudebot

Rule Path
Disallow *

gptbot

Rule Path
Disallow *

Other Records

Field Value
sitemap https://excurzilla.com/sitemap.xml

Warnings

  • `host` is not a known field.