wdeportes.com
robots.txt
Robots Exclusion Standard data for wdeportes.com
Resource Scan
Scan Details
Site Domain | wdeportes.com |
Base Domain | wdeportes.com |
Scan Status | Ok |
Last Scan | 2024-11-14T10:13:41+00:00 |
Next Scan | 2024-11-21T10:13:41+00:00 |
Last Scan
Scanned | 2024-11-14T10:13:41+00:00 |
URL | https://wdeportes.com/robots.txt |
Redirect | https://www.wdeportes.com:443/robots.txt |
Redirect Domain | www.wdeportes.com |
Redirect Base | wdeportes.com |
Domain IPs | 75.2.111.238, 99.83.204.47 |
Redirect IPs | 23.209.46.73, 23.209.46.74, 2600:1413:b000:13::b857:c192, 2600:1413:b000:13::b857:c19a |
Response IP | 23.45.207.165 |
Found | Yes |
Hash | 9d0543d42d3a037fc2c588792e38ed0edee455d0f61702ff057985610672c5ec |
SimHash | 39019f288893 |
Groups
*
Rule | Path |
---|---|
Disallow | /comentario/ |
Disallow | /buscar/ |
Disallow | /images/ |
Disallow | /amp/nota.aspx |
Disallow | /feed.aspx |
Disallow | /dmz/ |
Disallow | /i/ |
Disallow | /pf/api/ |
Disallow | /pf/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.wdeportes.com/arc/outboundfeeds/sitemap-index?outputType=xml |
sitemap | https://www.wdeportes.com/arc/outboundfeeds/news-sitemap-index?outputType=xml |