cdn-02.independent.ie
robots.txt
Robots Exclusion Standard data for cdn-02.independent.ie
Resource Scan
Scan Details
Site Domain | cdn-02.independent.ie |
Base Domain | independent.ie |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-10-23T16:26:06+00:00 |
Next Scan | 2025-01-21T16:26:06+00:00 |
Last Successful Scan
Scanned | 2023-07-08T16:10:35+00:00 |
URL | https://cdn-02.independent.ie/robots.txt |
Redirect | https://www.independent.ie/robots.txt |
Redirect Domain | www.independent.ie |
Redirect Base | independent.ie |
Domain IPs | 13.33.33.10, 13.33.33.5, 13.33.33.50, 13.33.33.9, 2600:9000:229f:1e00:12:80c4:7500:93a1, 2600:9000:229f:5800:12:80c4:7500:93a1, 2600:9000:229f:6000:12:80c4:7500:93a1, 2600:9000:229f:8800:12:80c4:7500:93a1, 2600:9000:229f:8c00:12:80c4:7500:93a1, 2600:9000:229f:9400:12:80c4:7500:93a1, 2600:9000:229f:d000:12:80c4:7500:93a1, 2600:9000:229f:f800:12:80c4:7500:93a1 |
Redirect IPs | 104.18.30.137, 104.18.31.137, 2606:4700::6812:1e89, 2606:4700::6812:1f89 |
Response IP | 104.18.30.137 |
Found | Yes |
Hash | cac435d7fcfb99fa4a02d777cd422b58b9b69c4ddb79d3bb07f46f17c1709143 |
SimHash | 403cc6e88cf1 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /qwerty/ |
Disallow | /*.ece$ |
Disallow | /utils/ |
Disallow | /account/ |
Disallow | /LoadTest/ |
Disallow | /api/ |
Disallow | /qa/ |
Disallow | /ad-test |
Disallow | /service-archive |
Disallow | /subscribe-archive |
Disallow | /messagent/ |
Disallow | /extra/messagent/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.independent.ie/sitemap/sitemap_googlenews.xmlââââââ |
sitemap | https://www.independent.ie/sitemap/sitemap_channels.xml |
sitemap | https://www.independent.ie/sitemap/sitemap.xml |
sitemap | https://www.independent.ie/sitemap/sitemap_video.xml |
Comments