pinejournal.com
robots.txt

Robots Exclusion Standard data for pinejournal.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	pinejournal.com
Base Domain	pinejournal.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-11-02T03:02:55+00:00
Next Scan	2025-01-31T03:02:55+00:00

Last Successful Scan

Scanned	2024-07-06T03:00:35+00:00
URL	https://pinejournal.com/robots.txt
Domain IPs	13.33.88.100, 13.33.88.30, 13.33.88.43, 13.33.88.97
Response IP	13.33.88.100
Found	Yes
Hash	d12b06b59130d4fd8fe2b6c3c1e73c208741fdecf3f7dc3e16738f15430c4c7a
SimHash	6a14d048e292

Groups

*

Rule	Path
Disallow	/search
Disallow	/cms

Rule

Path

Disallow

/search

Disallow

/cms

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.pinejournal.com/sitemap.xml

Field

Value

sitemap

https://www.pinejournal.com/sitemap.xml

Comments

Sitemap

pinejournal.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

ccbot

gptbot

chatgpt-user

anthropic-ai

cohere-ai

ia_archiver

omgili

omgilibot

mj12bot

piplbot

google-extended

bytespider

petalbot

Other Records

Comments

pinejournal.com
robots.txt