docpastor.com
robots.txt
Robots Exclusion Standard data for docpastor.com
Resource Scan
Scan Details
Site Domain | docpastor.com |
Base Domain | docpastor.com |
Scan Status | Ok |
Last Scan | 2024-11-13T06:53:23+00:00 |
Next Scan | 2024-11-20T06:53:23+00:00 |
Last Scan
Scanned | 2024-11-13T06:53:23+00:00 |
URL | https://docpastor.com/robots.txt |
Domain IPs | 104.21.9.137, 172.67.160.44, 2606:4700:3032::6815:989, 2606:4700:3035::ac43:a02c |
Response IP | 104.21.9.137 |
Found | Yes |
Hash | 81239fc32c0e080e71e62c62d6662a5ee03d87cad6440a11854c2312bcf9db50 |
SimHash | fb5e721ac9e7 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Allow | /*/feed/ |
Disallow | /*/trackback/$ |
Disallow | /*.sql$ |
Disallow | /*.tgz$ |
Disallow | /*.gz$ |
Disallow | /*.tar$ |
Disallow | /*.svn$ |
Allow | /ads.txt |
Warnings
- 2 invalid lines.
- `https` is not a known field.