globusjourneys.ca
robots.txt

Robots Exclusion Standard data for globusjourneys.ca

Resource Scan

Scan Details

Site Domain globusjourneys.ca
Base Domain globusjourneys.ca
Scan Status Ok
Last Scan2025-11-27T02:14:28+00:00
Next Scan 2025-12-27T02:14:28+00:00

Last Scan

Scanned2025-11-27T02:14:28+00:00
URL https://globusjourneys.ca/robots.txt
Redirect https://www.globusjourneys.ca/robots.txt
Redirect Domain www.globusjourneys.ca
Redirect Base globusjourneys.ca
Domain IPs 20.119.16.57
Redirect IPs 104.18.35.3, 172.64.152.253
Response IP 172.64.152.253
Found Yes
Hash bb5258a4ca3137343bec3eef58dd1ab6659182e6930ac191fc04f609edeb8f86
SimHash 79d59a564d57

Groups

*

Rule Path
Allow /
Disallow /APIs
Disallow /print
Disallow /common
Disallow /Common
Disallow /gaparm
Disallow /Thank-You
Disallow /thank_you
Disallow /YourWayList*
Disallow *mode%3Dprintme*
Disallow *content%3Dprint*
Disallow /WebResource.axd
Disallow /user/slideshow/
Allow /user/slideshow///.jpg
Allow /Common/Know-Before-You-Go/
Disallow /controls/currencyconverterframe.aspx
Disallow /user/htmlresources/changecountry.html

gptbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

google-extended

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.globusjourneys.ca/sitemap.xml