cowraguardian.com.au
robots.txt

Robots Exclusion Standard data for cowraguardian.com.au

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cowraguardian.com.au
Base Domain	cowraguardian.com.au
Scan Status	Ok
Last Scan	2024-05-15T20:07:09+00:00
Next Scan	2024-05-22T20:07:09+00:00

Last Scan

Scanned	2024-05-15T20:07:09+00:00
URL	https://cowraguardian.com.au/robots.txt
Redirect	https://www.cowraguardian.com.au/robots.txt
Redirect Domain	www.cowraguardian.com.au
Redirect Base	cowraguardian.com.au
Domain IPs	13.225.4.11, 13.225.4.17, 13.225.4.50, 13.225.4.96
Redirect IPs	13.225.4.127, 13.225.4.23, 13.225.4.6, 13.225.4.98
Response IP	13.225.4.6
Found	Yes
Hash	ab9ff7a3636da13d1dabf0fc4e5572176bef2205702fd60c3af34cbbb19b99c2
SimHash	c566b4d7ecf5

Groups

*

Rule	Path
Disallow	/*/ajax/
Disallow	/*/ajax
Disallow	/*/internal/
Disallow	/push-worker.js
Disallow	/scores-and-draws/
Disallow	*/rss.xml
Disallow	*/rss-full.xml
Disallow	*/feed
Disallow	*/feed/

Rule

Path

Disallow

/*/ajax/

Disallow

/*/ajax

Disallow

/*/internal/

Disallow

/push-worker.js

Disallow

/scores-and-draws/

Disallow

*/rss.xml

Disallow

*/rss-full.xml

Disallow

*/feed

Disallow

*/feed/

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

grapeshot

Rule	Path
Disallow

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.cowraguardian.com.au/sitemap.xml
sitemap	https://www.cowraguardian.com.au/sitemap-news.xml

Field

Value

sitemap

https://www.cowraguardian.com.au/sitemap.xml

sitemap

https://www.cowraguardian.com.au/sitemap-news.xml

Back to top

Comments

Agent Specific Disallowed Sections
Agent Specific Disallowed Feeds
IVT recommendation
Teads recommendation
OpenAI disallowed GPT crawling

Back to top

cowraguardian.com.aurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

petalbot

grapeshot

gptbot

Other Records

Comments

cowraguardian.com.au
robots.txt