publinews.gt
robots.txt
Robots Exclusion Standard data for publinews.gt
Resource Scan
Scan Details
Site Domain | publinews.gt |
Base Domain | publinews.gt |
Scan Status | Ok |
Last Scan | 2024-11-11T21:38:33+00:00 |
Next Scan | 2024-11-18T21:38:33+00:00 |
Last Scan
Scanned | 2024-11-11T21:38:33+00:00 |
URL | https://publinews.gt/robots.txt |
Redirect | https://www.publinews.gt:443/robots.txt |
Redirect Domain | www.publinews.gt |
Redirect Base | publinews.gt |
Domain IPs | 75.2.5.251, 99.83.208.40 |
Redirect IPs | 125.56.219.40, 2600:1413:b000:13::b857:c189, 2600:1413:b000:13::b857:c18a, 96.17.72.74 |
Response IP | 23.52.171.161 |
Found | Yes |
Hash | 251fe16531754f18aaddce53e71dd38bb3bff6a1e04855fb95735fc424b3b0b7 |
SimHash | f21900e08f56 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /300x250_1/ |
Disallow | /300x250_2/ |
Disallow | /300x250_3/ |
Disallow | /300x250_4/ |
Disallow | /layer/ |
Disallow | /Skyscrapper/ |
Disallow | /bottom/ |
Disallow | /footer/ |
Disallow | /skin/ |
Disallow | /top/ |
Disallow | /customer-preview/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.publinews.gt/arc/outboundfeeds/sitemap-index?outputType=xml |
sitemap | https://www.publinews.gt/arc/outboundfeeds/sitemap-news-index?outputType=xml |