pjmedia.com
robots.txt
Robots Exclusion Standard data for pjmedia.com
Resource Scan
Scan Details
Site Domain | pjmedia.com |
Base Domain | pjmedia.com |
Scan Status | Ok |
Last Scan | 2024-11-14T15:39:30+00:00 |
Next Scan | 2024-11-21T15:39:30+00:00 |
Last Scan
Scanned | 2024-11-14T15:39:30+00:00 |
URL | https://pjmedia.com/robots.txt |
Domain IPs | 104.18.18.43, 104.18.19.43, 2606:4700::6812:122b, 2606:4700::6812:132b |
Response IP | 104.18.19.43 |
Found | Yes |
Hash | 1854f849a0530a72ea903a24c660514f0eb7be185b48d7163e46ba4d3796db25 |
SimHash | e5a5cb00e2b2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wordpress |
Disallow | /reset/ |
Disallow | /instapundit*?*s= |
Disallow | /cdn-cgi/ |
Other Records
Field | Value |
---|---|
sitemap | https://pjmedia.com/sitemaps/sitemapindex-pjmedia.xml |