pdf4test.com
robots.txt
Robots Exclusion Standard data for pdf4test.com
Resource Scan
Scan Details
Site Domain | pdf4test.com |
Base Domain | pdf4test.com |
Scan Status | Ok |
Last Scan | 2025-09-17T16:08:14+00:00 |
Next Scan | 2025-09-24T16:08:14+00:00 |
Last Scan
Scanned | 2025-09-17T16:08:14+00:00 |
URL | https://pdf4test.com/robots.txt |
Redirect | http://www.pdf4test.com/robots.txt |
Redirect Domain | www.pdf4test.com |
Redirect Base | pdf4test.com |
Domain IPs | 104.21.40.90, 172.67.183.70, 2606:4700:3031::ac43:b746, 2606:4700:3033::6815:285a |
Redirect IPs | 104.21.40.90, 172.67.183.70, 2606:4700:3031::ac43:b746, 2606:4700:3033::6815:285a |
Response IP | 104.21.40.90 |
Found | Yes |
Hash | 43203e8112baa79e7bf8aab700e8b67af9e7e52766f39c53ef4911d528cce9fd |
SimHash | f39e5c53e8db |
Groups
*
Rule | Path |
---|---|
Disallow | /act.php |
Disallow | /search.php |
Disallow | /reg.php |
Disallow | /*.ashx |
Disallow | /*.aspx |
Disallow | /TestEngine/ |
Disallow | /demo/ |
Disallow | /pay* |
Disallow | /page_* |
Disallow | /livechat.php |
Disallow | /cart.php |
Disallow | /checkout.php |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | http://www.pdf4test.com/sitemap.xml |
Warnings
- 2 invalid lines.