800notes.com
robots.txt

Robots Exclusion Standard data for 800notes.com

Resource Scan

Scan Details

Site Domain 800notes.com
Base Domain 800notes.com
Scan Status Ok
Last Scan2024-11-01T05:37:53+00:00
Next Scan 2024-11-08T05:37:53+00:00

Last Scan

Scanned2024-11-01T05:37:53+00:00
URL https://800notes.com/robots.txt
Domain IPs 104.21.15.104, 172.67.162.43
Response IP 104.21.15.104
Found Yes
Hash 6bc4b136aa91e902f291cdde515e02fbd7c21a00f9bea9bd4124290be4434dfe
SimHash d95c04408ddb

Groups

*

Rule Path
Disallow /-sys-/
Disallow /~sys~/
Disallow /~sys~/
Disallow /nb/
Disallow /awl/
Disallow /forum/nb/
Disallow /news/nb/
Disallow /arts/nb/
Disallow /videos/nb/

googleother

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

owler

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

sitecheckerbotcrawler

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /