travismathew.ca
robots.txt

Robots Exclusion Standard data for travismathew.ca

Archived Snapshots

Resource Scan

Scan Details

Site Domain	travismathew.ca
Base Domain	travismathew.ca
Scan Status	Ok
Last Scan	2024-10-30T10:17:18+00:00
Next Scan	2024-11-29T10:17:18+00:00

Last Scan

Scanned	2024-10-30T10:17:18+00:00
URL	https://travismathew.ca/robots.txt
Redirect	https://www.travismathew.com/robots.txt
Redirect Domain	www.travismathew.com
Redirect Base	travismathew.com
Domain IPs	63.166.75.109
Redirect IPs	23.215.7.13, 23.215.7.7, 2600:1413:b000:1b::17d7:707, 2600:1413:b000:1b::17d7:70d
Response IP	23.44.4.161
Found	Yes
Hash	54b9a13c76ab76b227e29eb567c50fbb4b6915368c9699bf0498de217e179fa4
SimHash	7c76df1cede8

Groups

*

Rule	Path
Disallow	/search
Disallow	/cart
Disallow	/login
Disallow	/checkout
Disallow	/tmcheckout
Disallow	/my-account

Rule

Path

Disallow

/search

Disallow

/cart

Disallow

/login

Disallow

/checkout

Disallow

/tmcheckout

Disallow

/my-account

Other Records

Field	Value	Comment
crawl-delay	10	10 seconds between page requests

Field

Value

Comment

crawl-delay

10

10 seconds between page requests

cazoodlebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

dotbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

amazonbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.travismathew.com/sitemap.xml

Field

Value

sitemap

https://www.travismathew.com/sitemap.xml

Back to top

Comments

For all robots
Block access to specific groups of pages
Allow search crawlers to discover the sitemap
Block CazoodleBot as it does not present correct accept content headers
Block MJ12bot as it is just noise
Block dotbot as it cannot parse base urls properly
Block Gigabot
Block Amazonbot to fix Reduce Bloomreach API Calls

Back to top

Warnings

`request-rate` is not a known field.
`visit-time` is not a known field.

Back to top

travismathew.carobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

cazoodlebot

mj12bot

dotbot/1.0

gigabot

amazonbot/1.0

amazonbot

Other Records

Comments

Warnings

travismathew.ca
robots.txt