cancel.io
robots.txt

Robots Exclusion Standard data for cancel.io

Resource Scan

Scan Details

Site Domain cancel.io
Base Domain cancel.io
Scan Status Ok
Last Scan2024-10-07T20:29:54+00:00
Next Scan 2024-10-14T20:29:54+00:00

Last Scan

Scanned2024-10-07T20:29:54+00:00
URL https://cancel.io/robots.txt
Response IP 135.148.120.16
Found Yes
Hash 331714d4db95b0c33d72f2748490d01cbc2bebb8cb86745c5d470f3097047bfe
SimHash c83684624020

Groups

*

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sirdatabot

Rule Path
Disallow /

lcc

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /