geelongadvertiser.com.au
robots.txt
Robots Exclusion Standard data for geelongadvertiser.com.au
Resource Scan
Scan Details
Site Domain | geelongadvertiser.com.au |
Base Domain | geelongadvertiser.com.au |
Scan Status | Ok |
Last Scan | 2024-06-12T00:31:34+00:00 |
Next Scan | 2024-06-19T00:31:34+00:00 |
Last Scan
Scanned | 2024-06-12T00:31:34+00:00 |
URL | https://geelongadvertiser.com.au/robots.txt |
Redirect | https://www.geelongadvertiser.com.au/robots.txt |
Redirect Domain | www.geelongadvertiser.com.au |
Redirect Base | geelongadvertiser.com.au |
Domain IPs | 23.54.56.122, 2600:1413:b000:68b::ebe, 2600:1413:b000:694::ebe, 2600:1413:b000:696::ebe, 2600:1413:b000:698::ebe, 2600:1413:b000:69b::ebe |
Redirect IPs | 23.54.56.122, 2600:1413:b000:68e::ebe, 2600:1413:b000:692::ebe, 2600:1413:b000:695::ebe, 2600:1413:b000:696::ebe, 2600:1413:b000:698::ebe |
Response IP | 23.54.56.122 |
Found | Yes |
Hash | 11415f86de21ccbb4db90e23a62ab44b8a0029a616b4c80da2d9c1a75eded2b1 |
SimHash | 50654d5169d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /*/comments-* |
Disallow | /404 |
Disallow | /enewsletters/* |
Disallow | /doublerainbow/* |
Disallow | /it-test-only/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.geelongadvertiser.com.au/sitemap.xml |
sitemap | https://www.geelongadvertiser.com.au/news-sitemap.xml |
Comments