geteasyprint.com
robots.txt
Robots Exclusion Standard data for geteasyprint.com
Resource Scan
Scan Details
Site Domain | geteasyprint.com |
Base Domain | geteasyprint.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2025-07-17T10:21:07+00:00 |
Next Scan | 2025-10-15T10:21:07+00:00 |
Last Successful Scan
Scanned | 2025-02-25T00:42:54+00:00 |
URL | https://geteasyprint.com/robots.txt |
Domain IPs | 104.21.4.11, 172.67.223.239, 2606:4700:3032::ac43:dfef, 2606:4700:3034::6815:40b |
Response IP | 172.67.223.239 |
Found | Yes |
Hash | 468ce659f2d5dc3d7b3e91c9d66669cee638d2e7d2005bd2f0504dc3bd960a26 |
SimHash | 8514c8441351 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /recipes/ |
Disallow | /manuals/ |
Disallow | /templates/ |
Disallow | /packagetracker/ |
Disallow | /pdf/ |
Disallow | /email/ |
Disallow | /print/ |