blackwelljournaltribune.net
robots.txt

Robots Exclusion Standard data for blackwelljournaltribune.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	blackwelljournaltribune.net
Base Domain	blackwelljournaltribune.net
Scan Status	Ok
Last Scan	2024-10-27T04:33:54+00:00
Next Scan	2024-11-03T04:33:54+00:00

Last Scan

Scanned	2024-10-27T04:33:54+00:00
URL	https://blackwelljournaltribune.net/robots.txt
Domain IPs	104.154.203.214
Response IP	104.154.203.214
Found	Yes
Hash	bccc2b8df4f581b4b9c31667ee1546f0612e215df1cd1c9f43e6860396a5992d
SimHash	a20d1a85ec55

Groups

*

Rule	Path
Disallow	/?page=*
Disallow	/editions/*
Disallow	/users/*
Disallow	/feed
Disallow	/feeds
Disallow	/rss
Disallow	/?q=*

Rule

Path

Disallow

/*?*page=*

Disallow

/editions/*

Disallow

/users/*

Disallow

/feed

Disallow

/feeds

Disallow

/rss

Disallow

/*?*q=*

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://s3.amazonaws.com/cjp-public-access/sitemaps/bjt/sitemap.xml.gz

Field

Value

sitemap

https://s3.amazonaws.com/cjp-public-access/sitemaps/bjt/sitemap.xml.gz

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file

Back to top

blackwelljournaltribune.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

semrushbot

petalbot

Other Records

Comments

blackwelljournaltribune.net
robots.txt