discountmags.ca
robots.txt

Robots Exclusion Standard data for discountmags.ca

Resource Scan

Scan Details

Site Domain discountmags.ca
Base Domain discountmags.ca
Scan Status Ok
Last Scan2024-09-25T08:19:30+00:00
Next Scan 2024-10-02T08:19:30+00:00

Last Scan

Scanned2024-09-25T08:19:30+00:00
URL https://discountmags.ca/robots.txt
Redirect https://www.discountmags.ca:443/robots.txt
Redirect Domain www.discountmags.ca
Redirect Base discountmags.ca
Domain IPs 34.236.46.53, 52.4.176.253
Redirect IPs 34.236.46.53, 52.4.176.253
Response IP 34.236.46.53
Found Yes
Hash e0f92f5958fa25c5d4c532656a910bde5e5f688fd7187bf00d680183206ac8b7
SimHash 3561c2a29357

Groups

mediapartners-google*

Rule Path
Disallow /cgi-bin/

mj12bot

Rule Path
Disallow

msnbot

Rule Path
Disallow /cgi-bin/

psbot

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

crescent

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

webemailextrac.*

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

zeus.*webster

Rule Path
Disallow /

microsoft.url

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

slurp

Rule Path
Disallow /cgi-bin/

twiceler-0.9

Rule Path
Disallow /

yahoo

Rule Path
Disallow /cgi-bin/

*

Rule Path
Disallow /cgi-bin/

*

Rule Path
Disallow /page/tos.html

*

Rule Path
Allow /datafeed/PowerReviews/pwr/engine/

*

Rule Path
Disallow /datafeed/
Disallow /deals/
Disallow /voucher/
Disallow *privacypopup.html
Disallow /process

Other Records

Field Value
sitemap https://www.discountmags.ca/sitemap.xml

Warnings

  • 2 invalid lines.