cis-countryman.com
robots.txt
Robots Exclusion Standard data for cis-countryman.com
Resource Scan
Scan Details
Site Domain | cis-countryman.com |
Base Domain | cis-countryman.com |
Scan Status | Ok |
Last Scan | 2024-09-30T19:50:58+00:00 |
Next Scan | 2024-10-07T19:50:58+00:00 |
Last Scan
Scanned | 2024-09-30T19:50:58+00:00 |
URL | https://cis-countryman.com/robots.txt |
Redirect | https://countrymans.info/robots.txt |
Redirect Domain | countrymans.info |
Redirect Base | countrymans.info |
Domain IPs | 104.21.79.83, 172.67.169.103, 2606:4700:3035::ac43:a967, 2606:4700:3036::6815:4f53 |
Redirect IPs | 104.21.13.187, 172.67.133.15, 2606:4700:3030::ac43:850f, 2606:4700:3031::6815:dbb |
Response IP | 104.21.13.187 |
Found | Yes |
Hash | 05e02b60ca9946c3ae1162bf5287f1142affaaf7c2f27a1ff594fc9e4c53ddb5 |
SimHash | 4a5f4ce17338 |
Groups
*
Rule | Path |
---|---|
Disallow | /out.php |
Disallow | /peoples/search |
Disallow | /homonyms/search |
Disallow | /schools/search |
Disallow | /universities/search |
Disallow | /military/search |
Disallow | /companies/search |
Disallow | /dating/search |
Disallow | /pages |
Disallow | /goto/?sn= |
Disallow | *?action=delete |
Warnings
- `clean-param` is not a known field.