stevenpemberton.net
robots.txt

Robots Exclusion Standard data for stevenpemberton.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	stevenpemberton.net
Base Domain	stevenpemberton.net
Scan Status	Ok
Last Scan	2025-09-22T21:42:05+00:00
Next Scan	2025-10-22T21:42:05+00:00

Last Scan

Scanned	2025-09-22T21:42:05+00:00
URL	https://stevenpemberton.net/robots.txt
Domain IPs	104.21.75.191, 172.67.180.225, 2606:4700:3030::ac43:b4e1, 2606:4700:3033::6815:4bbf
Response IP	172.67.180.225
Found	Yes
Hash	b0a14ca1da0ea472cb494e48f17eb49814fc4c9f8df026a31f8887352e39f6fa
SimHash	2c1151526384

Groups

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Disallow

Rule

Path

Disallow

googlebot-image

Rule	Path
Disallow

Rule

Path

Disallow

googlebot-mobile

Rule	Path
Disallow

Rule

Path

Disallow

msnbot

Rule	Path
Disallow

Rule

Path

Disallow

slurp

Rule	Path
Disallow

Rule

Path

Disallow

teoma

Rule	Path
Disallow

Rule

Path

Disallow

yahoo-mmcrawler

Rule	Path
Disallow

Rule

Path

Disallow

yahoo-blogs/v3.9

Rule	Path
Disallow

Rule

Path

Disallow

*

Rule	Path
Disallow
Disallow	/cgi-bin/

Rule

Path

Disallow

/cgi-bin/

Other Records

Field	Value
sitemap	https://stevenpemberton.net/

Field

Value

sitemap

https://stevenpemberton.net/

Comments

NOTICE: The collection of content and other data on this
site through automated means, including any device, tool,
or process designed to data mine or scrape content, is
prohibited except (1) for the purpose of search engine indexing or
artificial intelligence retrieval augmented generation or (2) with express
written permission from this site’s operator.
To request permission to license our intellectual
property and/or other materials, please contact this
site’s operator directly.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content

Warnings

1 invalid line.

stevenpemberton.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

googlebot

googlebot-image

googlebot-mobile

msnbot

slurp

teoma

yahoo-mmcrawler

yahoo-blogs/v3.9

*

Other Records

Comments

Warnings

stevenpemberton.net
robots.txt