diontraejackson.com
robots.txt

Robots Exclusion Standard data for diontraejackson.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	diontraejackson.com
Base Domain	diontraejackson.com
Scan Status	Ok
Last Scan	2025-12-09T06:52:48+00:00
Next Scan	2026-01-08T06:52:48+00:00

Last Scan

Scanned	2025-12-09T06:52:48+00:00
URL	https://www.diontraejackson.com/robots.txt
Domain IPs	104.18.132.62, 104.18.133.62, 104.18.134.62, 104.18.135.62, 104.18.136.62
Response IP	104.18.136.62
Found	Yes
Hash	9153796b7082d3096a82c6e15232869bc61c4fe593beed5f28e871c4bf87046d
SimHash	010c914080f2

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

/

nerdybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

semrushbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	60

Field

Value

crawl-delay

60

bubing

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.diontraejackson.com/sitemap.xml

Field

Value

sitemap

https://www.diontraejackson.com/sitemap.xml

Back to top

Comments

Cloudflare crawl control rules

Back to top

Warnings

`content-signal` is not a known field.

Back to top

diontraejackson.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

nerdybot

semrushbot

Other Records

bubing

Other Records

Comments

Warnings

diontraejackson.com
robots.txt