independent.ie
robots.txt
Robots Exclusion Standard data for independent.ie
Resource Scan
Scan Details
Site Domain | independent.ie |
Base Domain | independent.ie |
Scan Status | Ok |
Last Scan | 2024-11-15T23:38:04+00:00 |
Next Scan | 2024-11-22T23:38:04+00:00 |
Last Scan
Scanned | 2024-11-15T23:38:04+00:00 |
URL | https://independent.ie/robots.txt |
Redirect | https://www.independent.ie/robots.txt |
Redirect Domain | www.independent.ie |
Redirect Base | independent.ie |
Domain IPs | 104.18.30.138, 104.18.31.138, 2606:4700::6812:1e8a, 2606:4700::6812:1f8a |
Redirect IPs | 104.18.30.138, 104.18.31.138, 2606:4700::6812:1e8a, 2606:4700::6812:1f8a |
Response IP | 104.18.31.138 |
Found | Yes |
Hash | 5f3b30ad52f2aa7808a05dd83dbf0d60e25f210e48f2e92426edfc2cb96014f2 |
SimHash | 683897718c75 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /qwerty/ |
Disallow | /*.ece$ |
Disallow | /utils/ |
Disallow | /account/ |
Disallow | /LoadTest/ |
Disallow | /api/ |
Disallow | /qa/ |
Disallow | /ad-test |
Disallow | /service-archive |
Disallow | /subscribe-archive |
Disallow | /messagent/ |
Disallow | /extra/messagent/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.independent.ie/sitemap/sitemap_googlenews.xmlââââââ |
sitemap | https://www.independent.ie/sitemap/sitemap_channels.xml |
sitemap | https://www.independent.ie/sitemap/sitemap.xml |
sitemap | https://www.independent.ie/sitemap/sitemap_video.xml |
Comments