theblog.ca
robots.txt

Robots Exclusion Standard data for theblog.ca

Resource Scan

Scan Details

Site Domain theblog.ca
Base Domain theblog.ca
Scan Status Ok
Last Scan2025-10-12T18:24:50+00:00
Next Scan 2025-10-19T18:24:50+00:00

Last Scan

Scanned2025-10-12T18:24:50+00:00
URL https://theblog.ca/robots.txt
Domain IPs 104.21.5.20, 172.67.132.192, 2606:4700:3033::6815:514, 2606:4700:3034::ac43:84c0
Response IP 172.67.132.192
Found Yes
Hash 8f6335854e06bf110bc3e70449b9f7f3ed7636d11f64cb86e021e949c7cef45c
SimHash 486d5384c942

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /*?feed
Disallow /feed
Disallow /category
Disallow /comments/feed
Disallow /feed/$
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /*/feed$
Disallow /*/feed/rss$
Disallow /*/trackback$
Disallow /*/*/feed$
Disallow /*/*/feed/rss$
Disallow /*/*/trackback$
Disallow /*/*/*/feed$
Disallow /*/*/*/feed/rss$
Disallow /*/*/*/trackback$
Disallow /index.php
Disallow /wp-content/plugins

mediapartners-google

Rule Path
Disallow