19centuryman-blog.tumblr.com
robots.txt

Robots Exclusion Standard data for 19centuryman-blog.tumblr.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	19centuryman-blog.tumblr.com
Base Domain	tumblr.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-11-10T14:03:09+00:00
Next Scan	2025-01-09T14:03:09+00:00

Last Successful Scan

Scanned	2024-09-05T10:33:43+00:00
URL	https://19centuryman-blog.tumblr.com/robots.txt
Domain IPs	74.114.154.18, 74.114.154.22
Response IP	74.114.154.18
Found	Yes
Hash	7c95fe5efbc57ee781a426f20de2c18ccd5d3482d8d1b32ac605b4bbcf81c43f
SimHash	eb9cda438406

Groups

*

Rule	Path
Disallow	/random
Disallow	/day
Disallow	/sticky-ad-iframe.html
Disallow	/privacy/consent

Rule

Path

Disallow

/random

Disallow

/day

Disallow

/sticky-ad-iframe.html

Disallow

/privacy/consent

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

sentibot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://19centuryman-blog.tumblr.com/sitemap.xml

Field

Value

sitemap

https://19centuryman-blog.tumblr.com/sitemap.xml

Comments

Common Crawl's crawler
SentiBot's crawler
Google Bard's crawler
Facebook's crawler
webz.io's crawler
webz.io's crawler
Amazon's crawler
ClaudeBot's crawler
anthropic-ai's crawler
ImageSift's AI crawler
Apple's AI crawler
TurnitinBot crawler
Meta AI crawler

19centuryman-blog.tumblr.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

ccbot

sentibot

google-extended

facebookbot

omgili

omgilibot

amazonbot

claudebot

anthropic-ai

imagesiftbot

applebot-extended

turnitinbot

meta-externalagent

Other Records

Comments

19centuryman-blog.tumblr.com
robots.txt