theorca.ca
robots.txt

Robots Exclusion Standard data for theorca.ca

Resource Scan

Scan Details

Site Domain theorca.ca
Base Domain theorca.ca
Scan Status Ok
Last Scan2024-09-25T22:56:05+00:00
Next Scan 2024-10-02T22:56:05+00:00

Last Scan

Scanned2024-09-25T22:56:05+00:00
URL https://theorca.ca/robots.txt
Redirect https://www.theorca.ca/robots.txt
Redirect Domain www.theorca.ca
Redirect Base theorca.ca
Domain IPs 104.21.63.233, 172.67.173.22, 2606:4700:3030::6815:3fe9, 2606:4700:3034::ac43:ad16
Redirect IPs 104.21.63.233, 172.67.173.22, 2606:4700:3030::6815:3fe9, 2606:4700:3034::ac43:ad16
Response IP 104.21.63.233
Found Yes
Hash b240b47c0c5c74c64961628aa2475377ca50bc326c0843c39586f8ad48e50c0f
SimHash 4904cba0e510

Groups

*

Rule Path
Allow /

googlebot-news

Rule Path
Allow /rss/showcase

googlebot

Rule Path
Allow /rss/showcase

semrushbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /