publinews.gt
robots.txt

Robots Exclusion Standard data for publinews.gt

Resource Scan

Scan Details

Site Domain publinews.gt
Base Domain publinews.gt
Scan Status Ok
Last Scan2024-11-11T21:38:33+00:00
Next Scan 2024-11-18T21:38:33+00:00

Last Scan

Scanned2024-11-11T21:38:33+00:00
URL https://publinews.gt/robots.txt
Redirect https://www.publinews.gt:443/robots.txt
Redirect Domain www.publinews.gt
Redirect Base publinews.gt
Domain IPs 75.2.5.251, 99.83.208.40
Redirect IPs 125.56.219.40, 2600:1413:b000:13::b857:c189, 2600:1413:b000:13::b857:c18a, 96.17.72.74
Response IP 23.52.171.161
Found Yes
Hash 251fe16531754f18aaddce53e71dd38bb3bff6a1e04855fb95735fc424b3b0b7
SimHash f21900e08f56

Groups

*

Rule Path
Allow /
Disallow /300x250_1/
Disallow /300x250_2/
Disallow /300x250_3/
Disallow /300x250_4/
Disallow /layer/
Disallow /Skyscrapper/
Disallow /bottom/
Disallow /footer/
Disallow /skin/
Disallow /top/
Disallow /customer-preview/

grapeshot

Rule Path
Disallow

petalbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.publinews.gt/arc/outboundfeeds/sitemap-index?outputType=xml
sitemap https://www.publinews.gt/arc/outboundfeeds/sitemap-news-index?outputType=xml