newspim.com
robots.txt
Robots Exclusion Standard data for newspim.com
Resource Scan
Scan Details
Site Domain | newspim.com |
Base Domain | newspim.com |
Scan Status | Ok |
Last Scan | 2024-11-09T14:33:02+00:00 |
Next Scan | 2024-11-16T14:33:02+00:00 |
Last Scan
Scanned | 2024-11-09T14:33:02+00:00 |
URL | https://newspim.com/robots.txt |
Domain IPs | 175.117.146.132 |
Response IP | 175.117.146.132 |
Found | Yes |
Hash | 79fb00d241e6a0a162ccab8bcb47f60f0aff369d1c6d56e08b47bfd1dad9b4ed |
SimHash | ec4d4d2cce92 |
Groups
*
Rule | Path |
---|---|
Disallow | /event/ |
Disallow | /quicknews/ |
Disallow | /survey/ |
Disallow | /persondb/ |
Disallow | /search/ |
Disallow | /anda/ |
Disallow | /company/ |
Disallow | /customer/ |
Disallow | /section/corpnews |
Disallow | /ir/ |
Disallow | /forum/ |
Disallow | /etc/ |
Disallow | /newsletter/ |
Disallow | /news/preview/print |
Other Records
Field | Value |
---|---|
sitemap | https://www.newspim.com/sitemap.xml |
sitemap | https://www.newspim.com/section.xml |