geelongadvertiser.com.au
robots.txt
Robots Exclusion Standard data for geelongadvertiser.com.au
Resource Scan
Scan Details
Site Domain | geelongadvertiser.com.au |
Base Domain | geelongadvertiser.com.au |
Scan Status | Ok |
Last Scan | 2024-09-25T09:39:20+00:00 |
Next Scan | 2024-10-02T09:39:20+00:00 |
Last Scan
Scanned | 2024-09-25T09:39:20+00:00 |
URL | https://geelongadvertiser.com.au/robots.txt |
Redirect | https://www.geelongadvertiser.com.au/robots.txt |
Redirect Domain | www.geelongadvertiser.com.au |
Redirect Base | geelongadvertiser.com.au |
Domain IPs | 23.54.56.122, 2600:1413:b000:386::ebe, 2600:1413:b000:387::ebe, 2600:1413:b000:391::ebe, 2600:1413:b000:394::ebe, 2600:1413:b000:39b::ebe |
Redirect IPs | 23.36.48.116, 2600:1413:b000:381::ebe, 2600:1413:b000:384::ebe, 2600:1413:b000:389::ebe, 2600:1413:b000:38b::ebe, 2600:1413:b000:39a::ebe |
Response IP | 23.54.56.122 |
Found | Yes |
Hash | fcc4c5700d8c66d481bd6cae1a80ea03c2065a5238fd3efd16c9a796218d1a02 |
SimHash | 5135cd51c9d2 |
Groups
*
Rule | Path |
---|---|
Disallow | /*/comments-* |
Disallow | /404 |
Disallow | /enewsletters/* |
Disallow | /doublerainbow/* |
Disallow | /it-test-only/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.geelongadvertiser.com.au/sitemap.xml |
sitemap | https://www.geelongadvertiser.com.au/news-sitemap.xml |
Comments