environment.act.gov.au
robots.txt

Robots Exclusion Standard data for environment.act.gov.au

Resource Scan

Scan Details

Site Domain environment.act.gov.au
Base Domain act.gov.au
Scan Status Ok
Last Scan2026-01-23T14:58:15+00:00
Next Scan 2026-02-06T14:58:15+00:00

Last Scan

Scanned2026-01-23T14:58:15+00:00
URL https://environment.act.gov.au/robots.txt
Redirect https://www.environment.act.gov.au/robots.txt
Redirect Domain www.environment.act.gov.au
Redirect Base act.gov.au
Domain IPs 2.58.104.10, 2.58.104.11
Redirect IPs 2.58.104.10, 2.58.104.11
Response IP 2.58.104.10
Found Yes
Hash f68f38b5d6fd0c0015a39372e7943214a80b80ef3f4ef7ba68b467c7157fa8f6
SimHash 4d1cd860eb93

Groups

*

Rule Path
Disallow /search
Disallow /?sq_content_src=
Disallow /_recache
Disallow /_edit
Disallow /_admin
Disallow /_login
Disallow /_performance
Disallow /designs
Disallow /_designs
Disallow /_web_services
Disallow /heritage/festival-2023-onwards/heritage-festival/program/event-management-modules
Disallow /admin
Disallow /asset-listings
Disallow /schemas
Disallow /paint-layouts
Disallow /alerts
Disallow /nested-content
Disallow /asset-listings
Disallow /2022-images
Disallow /nested-content
Disallow /paint-layouts
Disallow /resources
Disallow /schemas
Disallow /r

Other Records

Field Value
crawl-delay 3

funnelback

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 0

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.environment.act.gov.au/sitemap.xml

Comments

  • www.robotstxt.org/
  • http://code.google.com/web/controlcrawlindex/