newspaperdirect.com
robots.txt
Robots Exclusion Standard data for newspaperdirect.com
Resource Scan
Scan Details
Site Domain | newspaperdirect.com |
Base Domain | newspaperdirect.com |
Scan Status | Ok |
Last Scan | 2024-09-19T13:09:51+00:00 |
Next Scan | 2024-10-19T13:09:51+00:00 |
Last Scan
Scanned | 2024-09-19T13:09:51+00:00 |
URL | https://newspaperdirect.com/robots.txt |
Domain IPs | 207.34.140.6 |
Response IP | 207.34.140.6 |
Found | Yes |
Hash | 7b59398db297c958cbb4a8202090dfb0880f07ee3d32e9e2a5a8d007e81cc3cf |
SimHash | 8e48e8a724f6 |
Groups
*
Rule | Path |
---|---|
Disallow | / |
Disallow | /test |
Disallow | /rb |
Disallow | /oem |
Disallow | /librarypd-demo |
Disallow | /images |
Disallow | /hospitality |
Disallow | /Flunch_images |
Warnings
- 40 invalid lines.
- `xþàycsq+ûsñ²p]rð` is not a known field.
- `|` is not a known field.