manuals.plus
robots.txt
Robots Exclusion Standard data for manuals.plus
Resource Scan
Scan Details
Site Domain | manuals.plus |
Base Domain | manuals.plus |
Scan Status | Ok |
Last Scan | 2024-11-16T13:48:25+00:00 |
Next Scan | 2024-11-23T13:48:25+00:00 |
Last Scan
Scanned | 2024-11-16T13:48:25+00:00 |
URL | https://manuals.plus/robots.txt |
Domain IPs | 172.66.40.99, 172.66.43.157, 2606:4700:3108::ac42:2863, 2606:4700:3108::ac42:2b9d |
Response IP | 172.66.40.99 |
Found | Yes |
Hash | 1748f345cf300f183335026ddd21bc45f885599d8273a30da66261b739d876f8 |
SimHash | 1850df07ed50 |
Groups
*
Rule | Path |
---|---|
Disallow | /search_gcse |
Disallow | /*-admin/* |
Disallow | /post_* |
Disallow | /category/0/ |
Disallow | /Author/ |
Disallow | /author/ |
Disallow | /*/*/feed |
Disallow | /redirect |
Disallow | /http |
Disallow | /https |
Disallow | /http* |
Disallow | /https* |
Disallow | /*%40*.* |
Disallow | /*%40*.* |
Disallow | /admin.php |