manualzz.com
robots.txt
Robots Exclusion Standard data for manualzz.com
Resource Scan
Scan Details
Site Domain | manualzz.com |
Base Domain | manualzz.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-19T14:28:44+00:00 |
Next Scan | 2024-09-26T14:28:44+00:00 |
Last Successful Scan
Scanned | 2024-09-11T13:28:50+00:00 |
URL | https://manualzz.com/robots.txt |
Domain IPs | 104.26.0.78, 104.26.1.78, 172.67.72.99, 2606:4700:20::681a:14e, 2606:4700:20::681a:4e, 2606:4700:20::ac43:4863 |
Response IP | 172.67.72.99 |
Found | Yes |
Hash | 05a99491b35acb31ce8c2804483e8112c891c8f11ee5eada47959cc28dde4444 |
SimHash | 00089e9085a0 |
Groups
*
Rule | Path |
---|---|
Disallow | /viewer_next/ |
Disallow | /theme/ |
Allow | /theme/*/static |
Disallow | /store/ |
Disallow | /upload |
Disallow | /download/ |
Disallow | /docinfo.xml |
Disallow | /sendmail.html |
Disallow | /ask/searchAjax |
Disallow | /cdn-cgi/ |
Disallow | /search/ |
Disallow | /documents/ |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://manualzz.com/sitemap.xml |
Warnings
- `host` is not a known field.