adventurebook.com
robots.txt

Robots Exclusion Standard data for adventurebook.com

Resource Scan

Scan Details

Site Domain adventurebook.com
Base Domain adventurebook.com
Scan Status Ok
Last Scan2025-12-30T06:48:14+00:00
Next Scan 2026-01-29T06:48:14+00:00

Last Scan

Scanned2025-12-30T06:48:14+00:00
URL https://adventurebook.com/robots.txt
Redirect https://www.adventurebook.com/robots.txt
Redirect Domain www.adventurebook.com
Redirect Base adventurebook.com
Domain IPs 104.21.51.190, 172.67.184.39, 2606:4700:3031::6815:33be, 2606:4700:3033::ac43:b827
Redirect IPs 104.21.51.190, 172.67.184.39, 2606:4700:3031::6815:33be, 2606:4700:3033::ac43:b827
Response IP 172.67.184.39
Found Yes
Hash a3e6f8f7a7eef2723939ec15674153d85f5cbcfa38ed78406eda8f9f3d81e3ef
SimHash 852c5c1027a1

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /cache/
Disallow /application/logs/
Disallow /assets/docs/Lets_Roam_printable_gift_info-2.pdf
Disallow /orders/
Disallow /*?*order=
Disallow /*?*filter=
Disallow /*?*sort=
Disallow /*?*query=

Other Records

Field Value
sitemap https://www.adventurebook.com/sitemap.xml

Comments

  • Robots.txt - Comprehensive Crawl Directives
  • Last updated: 2025-12-29
  • Allow crawling of most content
  • Disallow admin areas and sensitive content
  • Sitemaps