thebooksinorder.com
robots.txt
Robots Exclusion Standard data for thebooksinorder.com
Resource Scan
Scan Details
Site Domain | thebooksinorder.com |
Base Domain | thebooksinorder.com |
Scan Status | Ok |
Last Scan | 2024-09-27T07:15:14+00:00 |
Next Scan | 2024-10-04T07:15:14+00:00 |
Last Scan
Scanned | 2024-09-27T07:15:14+00:00 |
URL | https://thebooksinorder.com/robots.txt |
Domain IPs | 104.21.62.182, 172.67.138.3, 2606:4700:3037::6815:3eb6, 2606:4700:3037::ac43:8a03 |
Response IP | 172.67.138.3 |
Found | Yes |
Hash | 33e40f73cb19b8910ac1f94a99806156efd12f9e9851052a1b2be6cd6327704f |
SimHash | 4b085540e90b |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /linkout/ |
Disallow | /recommended/ |
Disallow | /comments/feed/ |
Disallow | /trackback/ |
Disallow | /index.php |
Disallow | /xmlrpc.php |
Disallow | /search |
Disallow | /page |
Other Records
Field | Value |
---|---|
sitemap | https://www.thebooksinorder.com/sitemap_index.xml |
Warnings
- 2 invalid lines.
Comments