octo.net
robots.txt

Robots Exclusion Standard data for octo.net

Resource Scan

Scan Details

Site Domain octo.net
Base Domain octo.net
Scan Status Ok
Last Scan2024-06-20T23:47:08+00:00
Next Scan 2024-06-27T23:47:08+00:00

Last Scan

Scanned2024-06-20T23:47:08+00:00
URL https://octo.net/robots.txt
Domain IPs 104.21.234.132, 104.21.234.133
Response IP 104.21.234.133
Found Yes
Hash 04cf557a689a96da3cf2f13d2f8622d2cf815f27979da665fb85ecaa234754ed
SimHash d05c8f60cf99

Groups

*

Rule Path
Disallow /-sys-/
Disallow /~sys~/
Disallow /~sys~/

googleother

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

sitecheckerbotcrawler

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /