travelcaribou.com
robots.txt

Robots Exclusion Standard data for travelcaribou.com

Resource Scan

Scan Details

Site Domain travelcaribou.com
Base Domain travelcaribou.com
Scan Status Ok
Last Scan2024-05-09T04:22:30+00:00
Next Scan 2024-05-16T04:22:30+00:00

Last Scan

Scanned2024-05-09T04:22:30+00:00
URL https://travelcaribou.com/robots.txt
Redirect https://www.travelcaribou.com/robots.txt
Redirect Domain www.travelcaribou.com
Redirect Base travelcaribou.com
Domain IPs 104.26.2.15, 104.26.3.15, 172.67.70.195, 2606:4700:20::681a:20f, 2606:4700:20::681a:30f, 2606:4700:20::ac43:46c3
Redirect IPs 104.26.2.15, 104.26.3.15, 172.67.70.195, 2606:4700:20::681a:20f, 2606:4700:20::681a:30f, 2606:4700:20::ac43:46c3
Response IP 104.26.3.15
Found Yes
Hash e26d03acb7d48ae962a2df5537f6aada747af5446e918bc5168931cb4be3d10b
SimHash ca10c8c0a6a9

Groups

*

Rule Path
Allow /ads.txt
Disallow /trackback/
Disallow /comments/
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /?s=*
Disallow /stats/
Disallow /events/
Disallow /search/
Disallow /21287525/
Disallow /author
Disallow /wget/
Disallow /httpd/
Disallow /wp-content/uploads/

Other Records

Field Value
crawl-delay 30

googlebot-image

Rule Path
Disallow /

twitterbot

Rule Path
Allow /wp-content/uploads/

facebookexternalhit

Rule Path
Allow /wp-content/uploads/

ia_archiver

Rule Path
Disallow /