pdfen.com
robots.txt
Robots Exclusion Standard data for pdfen.com
Resource Scan
Scan Details
| Site Domain | pdfen.com |
| Base Domain | pdfen.com |
| Scan Status | Ok |
| Last Scan | 2025-12-06T04:24:30+00:00 |
| Next Scan | 2025-12-13T04:24:30+00:00 |
Last Scan
| Scanned | 2025-12-06T04:24:30+00:00 |
| URL | https://pdfen.com/robots.txt |
| Redirect | https://www.pdfen.com/robots.txt |
| Redirect Domain | www.pdfen.com |
| Redirect Base | pdfen.com |
| Domain IPs | 87.253.157.20 |
| Redirect IPs | 87.253.157.20 |
| Response IP | 87.253.157.20 |
| Found | Yes |
| Hash | 94811bd67a024ff372ffec87214db1ed548699af95c64d1f10960bf574b5fa41 |
| SimHash | a31f155947e5 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /administrator/ |
| Disallow | /bin/ |
| Disallow | /cache/ |
| Disallow | /cli/ |
| Disallow | /components/ |
| Disallow | /includes/ |
| Disallow | /installation/ |
| Disallow | /language/ |
| Disallow | /layouts/ |
| Disallow | /libraries/ |
| Disallow | /logs/ |
| Disallow | /modules/ |
| Disallow | /plugins/ |
| Disallow | /tmp/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.pdfen.com/sitemap.xml |
Comments