christophermarsh.org
robots.txt

Robots Exclusion Standard data for christophermarsh.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	christophermarsh.org
Base Domain	christophermarsh.org
Scan Status	Ok
Last Scan	2024-09-13T00:24:01+00:00
Next Scan	2024-10-13T00:24:01+00:00

Last Scan

Scanned	2024-09-13T00:24:01+00:00
URL	https://christophermarsh.org/robots.txt
Domain IPs	192.0.78.150, 192.0.78.249
Response IP	192.0.78.249
Found	Yes
Hash	f8a5182d2cd7f717bb3f064b37475c7f67c7f7d8d452ee7c5e5d5b17e444f531
SimHash	39930802a734

Groups

*

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

sentibot

Rule	Path
Disallow	/

Rule

Path

Disallow

sentibot

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

This file was generated on Wed, 07 Aug 2024 15:27:54 +0000

christophermarsh.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

anthropic-ai

applebot-extended

bytespider

ccbot

claudebot

facebookbot

google-extended

gptbot

omgili

omgilibot

perplexitybot

sentibot

sentibot

Comments

christophermarsh.org
robots.txt