littlenotebook.uk
robots.txt

Robots Exclusion Standard data for littlenotebook.uk

Resource Scan

Scan Details

Site Domain littlenotebook.uk
Base Domain littlenotebook.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-11T10:27:30+00:00
Next Scan 2025-11-09T10:27:30+00:00

Last Successful Scan

Scanned2025-04-14T10:19:38+00:00
URL https://littlenotebook.uk/robots.txt
Domain IPs 77.72.1.48
Response IP 77.72.1.48
Found Yes
Hash 561bf6d89c9f67b204ced8f4dceaa3e18f6c3ae940c2ffbb73918321b02bc0e4
SimHash 71106950c2a5

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /?p=*
Disallow */feed

feedfetcher-google

Rule Path
Allow /*feed

yandexturbo/1.0

Rule Path
Allow /*feed

bingbot

Rule Path
Allow /*feed

mediapartners-google*

Rule Path
Allow /

amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
gptbot
imagesiftbot
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot

Rule Path
Disallow /