blog.supercookie.co.uk
robots.txt

Robots Exclusion Standard data for blog.supercookie.co.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	blog.supercookie.co.uk
Base Domain	supercookie.co.uk
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-06-30T11:04:50+00:00
Next Scan	2025-09-28T11:04:50+00:00

Last Successful Scan

Scanned	2024-08-27T03:34:37+00:00
URL	https://blog.supercookie.co.uk/robots.txt
Domain IPs	74.114.154.18, 74.114.154.22
Response IP	74.114.154.18
Found	Yes
Hash	812e5becf6706b43b55056ea67e78e7e7a55f7c841661b0fffab8151c1545bb4
SimHash	6b9cd8438406

Groups

*

Rule	Path
Disallow	/random
Disallow	/day
Disallow	/sticky-ad-iframe.html
Disallow	/privacy/consent

Rule

Path

Disallow

/random

Disallow

/day

Disallow

/sticky-ad-iframe.html

Disallow

/privacy/consent

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

sentibot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://blog.supercookie.co.uk/sitemap.xml

Field

Value

sitemap

https://blog.supercookie.co.uk/sitemap.xml

Comments

Common Crawl's crawler
SentiBot's crawler
Google Bard's crawler
Facebook's crawler
webz.io's crawler
webz.io's crawler
Amazon's crawler
ClaudeBot's crawler
anthropic-ai's crawler
ImageSift's AI crawler
Apple's AI crawler
TurnitinBot crawler

blog.supercookie.co.ukrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

ccbot

sentibot

google-extended

facebookbot

omgili

omgilibot

amazonbot

claudebot

anthropic-ai

imagesiftbot

applebot-extended

turnitinbot

Other Records

Comments

blog.supercookie.co.uk
robots.txt