www2.latercera.com
robots.txt

Robots Exclusion Standard data for www2.latercera.com

Resource Scan

Scan Details

Site Domain www2.latercera.com
Base Domain latercera.com
Scan Status Ok
Last Scan2024-04-26T04:21:01+00:00
Next Scan 2024-05-26T04:21:01+00:00

Last Scan

Scanned2024-04-26T04:21:01+00:00
URL https://www2.latercera.com/robots.txt
Redirect https://www.latercera.com/robots.txt
Redirect Domain www.latercera.com
Redirect Base latercera.com
Domain IPs 13.33.88.121, 13.33.88.53, 13.33.88.55, 13.33.88.82, 2600:9000:223b:1600:16:6e8c:8180:93a1, 2600:9000:223b:200:16:6e8c:8180:93a1, 2600:9000:223b:3400:16:6e8c:8180:93a1, 2600:9000:223b:6a00:16:6e8c:8180:93a1, 2600:9000:223b:8200:16:6e8c:8180:93a1, 2600:9000:223b:a200:16:6e8c:8180:93a1, 2600:9000:223b:ea00:16:6e8c:8180:93a1, 2600:9000:223b:f800:16:6e8c:8180:93a1
Redirect IPs 23.209.46.12, 23.209.46.21, 2600:1413:b000:14::b857:c144, 2600:1413:b000:14::b857:c155
Response IP 42.99.140.161
Found Yes
Hash fa53aa273eea0b883d8e92915816ef46d44ca9860711f9ce8fe2b71ed30a9d3a
SimHash 5818c800cfd3

Groups

petalbot

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /arcio/news-sitemap/

Other Records

Field Value
sitemap https://www.latercera.com/arc/outboundfeeds/sitemap-index?outputType=xml
sitemap https://www.latercera.com/arc/outboundfeeds/news-sitemap-index?outputType=xml
sitemap https://www.latercera.com/arc/outboundfeeds/sitemap?outputType=xml

Comments

  • Huawei