pdf-magazines-archive.com
robots.txt
Robots Exclusion Standard data for pdf-magazines-archive.com
Resource Scan
Scan Details
Site Domain | pdf-magazines-archive.com |
Base Domain | pdf-magazines-archive.com |
Scan Status | Ok |
Last Scan | 2025-09-17T12:58:20+00:00 |
Next Scan | 2025-10-17T12:58:20+00:00 |
Last Scan
Scanned | 2025-09-17T12:58:20+00:00 |
URL | https://pdf-magazines-archive.com/robots.txt |
Domain IPs | 104.21.5.18, 172.67.132.190, 2606:4700:3030::ac43:84be, 2606:4700:3037::6815:512 |
Response IP | 104.21.5.18 |
Found | Yes |
Hash | 4f8627947a4d441d529f947f1426f8dbb2e28946e5e0489386e320a4a210581e |
SimHash | fd09b4624133 |
Groups
*
Rule | Path |
---|---|
Disallow | /engine/go.php |
Disallow | /engine/download.php |
Disallow | /user/ |
Disallow | /newposts/ |
Disallow | /statistics.html |
Disallow | /*subaction%3Duserinfo |
Disallow | /*subaction%3Dnewposts |
Disallow | /*do%3Dlastcomments |
Disallow | /*do%3Dfeedback |
Disallow | /*do%3Dregister |
Disallow | /*do%3Dlostpassword |
Disallow | /*do%3Daddnews |
Disallow | /*do%3Dstats |
Disallow | /*do%3Dpm |
Disallow | /*do%3Dsearch |
Disallow | /dmca.html |
Disallow | /contacts.html |
Warnings
- `host` is not a known field.