broadsheet.com.au
robots.txt

Robots Exclusion Standard data for broadsheet.com.au

Archived Snapshots

Resource Scan

Scan Details

Site Domain	broadsheet.com.au
Base Domain	broadsheet.com.au
Scan Status	Ok
Last Scan	2024-09-25T14:02:03+00:00
Next Scan	2024-10-02T14:02:03+00:00

Last Scan

Scanned	2024-09-25T14:02:03+00:00
URL	https://broadsheet.com.au/robots.txt
Redirect	https://www.broadsheet.com.au/robots.txt
Redirect Domain	www.broadsheet.com.au
Redirect Base	broadsheet.com.au
Domain IPs	35.189.28.162
Redirect IPs	35.189.28.162
Response IP	35.189.28.162
Found	Yes
Hash	968077358e429c54ee762aa4f168c087d2cff321bb9b221a22f1247f85971c21
SimHash	51107c24e193

Groups

*

Rule	Path
Disallow	/melbourne/subscribed/
Disallow	/melbourne/admin
Disallow	/sydney/subscribed/
Disallow	/sydney/admin
Disallow	/admin
Disallow	/usersearch/

Rule

Path

Disallow

/melbourne/subscribed/

Disallow

/melbourne/admin

Disallow

/sydney/subscribed/

Disallow

/sydney/admin

Disallow

/admin

Disallow

*/usersearch/*

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexity-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

meltwater

Rule	Path
Disallow	/

Rule

Path

Disallow

seekr

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.broadsheet.com.au/sitemap/index

Field

Value

sitemap

https://www.broadsheet.com.au/sitemap/index

broadsheet.com.aurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

chatgpt-user

google-extended

claude-web

claudebot

anthropic-ai

cohere-ai

perplexitybot

perplexity-ai

meltwater

seekr

Other Records

broadsheet.com.au
robots.txt