paperzz.com
robots.txt
Robots Exclusion Standard data for paperzz.com
Resource Scan
Scan Details
| Site Domain | paperzz.com |
| Base Domain | paperzz.com |
| Scan Status | Ok |
| Last Scan | 2025-12-18T19:21:27+00:00 |
| Next Scan | 2025-12-25T19:21:27+00:00 |
Last Scan
| Scanned | 2025-12-18T19:21:27+00:00 |
| URL | https://paperzz.com/robots.txt |
| Domain IPs | 104.21.64.150, 172.67.152.32, 2606:4700:3032::6815:4096, 2606:4700:3037::ac43:9820 |
| Response IP | 104.21.64.150 |
| Found | Yes |
| Hash | 550759e2187651ab8157e614ab9a33059b80edeb76d580fb6325dd3c918272dd |
| SimHash | 01449e509530 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /viewer_next/ |
| Disallow | /theme/ |
| Allow | /theme/*/static |
| Disallow | /store/ |
| Disallow | /upload |
| Disallow | /docinfo.xml |
| Disallow | /sendmail.html |
| Disallow | /ask/searchAjax |
| Disallow | /cdn-cgi/ |
| Allow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | https://paperzz.com/sitemap.xml |
Warnings
- `host` is not a known field.