thetalon.ca
robots.txt

Robots Exclusion Standard data for thetalon.ca

Resource Scan

Scan Details

Site Domain thetalon.ca
Base Domain thetalon.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-12-01T17:26:11+00:00
Next Scan 2026-03-01T17:26:11+00:00

Last Successful Scan

Scanned2024-01-19T09:35:59+00:00
URL https://thetalon.ca/robots.txt
Domain IPs 104.21.69.69, 172.67.206.9, 2606:4700:3034::6815:4545, 2606:4700:3036::ac43:ce09
Response IP 172.67.206.9
Found Yes
Hash c3d950a79526c3e672430261fa3ec58dc73c2ec2ea313c78219c0641803cb2bc
SimHash 1119c68269a6

Groups

seo-bot

Rule Path
Disallow /

npbot

Rule Path
Disallow /

recorder

Rule Path
Disallow /

aspseek

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

linkscan

Rule Path
Disallow /

sideqik

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

xaldon

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

anarchie

Rule Path
Disallow /

jetcar

Rule Path
Disallow /

sucker

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

getsmart

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

httptrack

Rule Path
Disallow /

webhook

Rule Path
Disallow /

php

Rule Path
Disallow /

spyfu

Rule Path
Disallow /

semrushbot/1.1~bl

Rule Path
Disallow /

ahrefsbot/6.1

Rule Path
Disallow /

blackhole

Rule Path
Disallow /

webwhacker

Rule Path
Disallow /

*

Rule Path
Disallow /Payment
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php