hardenexpress.com.au
robots.txt

Robots Exclusion Standard data for hardenexpress.com.au

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hardenexpress.com.au
Base Domain	hardenexpress.com.au
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-11-01T00:06:49+00:00
Next Scan	2024-12-01T00:06:49+00:00

Last Successful Scan

Scanned	2024-10-02T21:56:11+00:00
URL	https://hardenexpress.com.au/robots.txt
Redirect	https://www.hardenexpress.com.au/robots.txt
Redirect Domain	www.hardenexpress.com.au
Redirect Base	hardenexpress.com.au
Domain IPs	13.225.4.11, 13.225.4.17, 13.225.4.50, 13.225.4.96
Redirect IPs	108.156.133.101, 108.156.133.40, 108.156.133.77, 108.156.133.94
Response IP	108.156.133.40
Found	Yes
Hash	a2139350919212be26341527b5ae9904ad78e04621cb932e986fc230c25479f4
SimHash	452c9484a8a5

Groups

*

Rule	Path
Disallow	/*/ajax/
Disallow	/*/ajax
Disallow	/*/internal/
Disallow	/push-worker.js
Disallow	/scores-and-draws/
Disallow	*/rss.xml
Disallow	*/rss-full.xml
Disallow	*/feed
Disallow	*/feed/

Rule

Path

Disallow

/*/ajax/

Disallow

/*/ajax

Disallow

/*/internal/

Disallow

/push-worker.js

Disallow

/scores-and-draws/

Disallow

*/rss.xml

Disallow

*/rss-full.xml

Disallow

*/feed

Disallow

*/feed/

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

grapeshot

Rule	Path
Disallow

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

/

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

/

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

/

printfriendly.com

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.hardenexpress.com.au/sitemap.xml
sitemap	https://www.hardenexpress.com.au/sitemap-news.xml

Field

Value

sitemap

https://www.hardenexpress.com.au/sitemap.xml

sitemap

https://www.hardenexpress.com.au/sitemap-news.xml

Back to top

Comments

Agent Specific Disallowed Sections
Agent Specific Disallowed Feeds
IVT recommendation
Teads recommendation
OpenAI disallowed GPT crawling
Blocking AI web crawlers from accessing content behind paywall

Back to top

hardenexpress.com.aurobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

petalbot

grapeshot

gptbot

google-extended

ccbot

chatgpt-user

anthropic-ai

cohere-ai

printfriendly.com

Other Records

Comments

hardenexpress.com.au
robots.txt