abc.net.au
robots.txt

Robots Exclusion Standard data for abc.net.au

Resource Scan

Scan Details

Site Domain abc.net.au
Base Domain abc.net.au
Scan Status Ok
Last Scan2024-10-22T19:21:57+00:00
Next Scan 2024-11-05T19:21:57+00:00

Last Scan

Scanned2024-10-22T19:21:57+00:00
URL https://abc.net.au/robots.txt
Redirect https://www.abc.net.au/robots.txt
Redirect Domain www.abc.net.au
Redirect Base abc.net.au
Domain IPs 184.51.96.134
Redirect IPs 104.83.196.95
Response IP 184.51.96.134
Found Yes
Hash 1c3c75188bf00c9e0714970303b65ad97f67bea89ffdc9ea579b184dd4396245
SimHash 4444b01f4d14

Groups

*

Rule Path
Disallow /classic/contact/concerts.htm
Disallow /classic/contact/default.htm
Disallow /classic/contact/eventsdiary.htm
Disallow /classic/contact/formerror.htm
Disallow /classic/contact/formthanks.htm
Disallow /classic/contact/general.htm
Disallow /classic/contact/limelight.htm
Disallow /classic/contact/mailinglist.htm
Disallow /classic/contact/music.htm
Disallow /classic/contact/presenter.htm
Disallow /classic/contact/website.htm
Disallow /classic/contact/word.htm
Disallow /xmlcontent/
Disallow /classicfm/
Disallow /iview/
Disallow /site-archive/
Disallow /corp/
Disallow /contact/
Disallow /homepage/2013/
Disallow /beta/
Disallow /abc4000/
Disallow /res/

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

googlebot-image

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

flipboardproxy

Rule Path
Disallow /news/image/

Other Records

Field Value
crawl-delay 2

isec_bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

trendkite-akashic-crawler

Rule Path
Disallow /

tineye-bot

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.abc.net.au/sitemaps/sitemap-index.xml.gz

Comments

  • robots.txt for https://www.abc.net.au/ -- ABC Online
  • OPSSD-340 2015/5/5
  • INNG-46: 2014-12-30
  • Added for corporate communications, as they have migrated to a new site
  • Added for Homepage Beta, prevent indexing during public beta
  • Added for WCMS Tennent testing, not a public
  • sitemaps