clark.com
robots.txt
Robots Exclusion Standard data for clark.com
Resource Scan
Scan Details
Site Domain | clark.com |
Base Domain | clark.com |
Scan Status | Ok |
Last Scan | 2024-11-14T17:40:57+00:00 |
Next Scan | 2024-11-21T17:40:57+00:00 |
Last Scan
Scanned | 2024-11-14T17:40:57+00:00 |
URL | https://clark.com/robots.txt |
Domain IPs | 104.24.88.18, 104.24.89.18, 172.67.67.160, 2606:4700:20::6818:5812, 2606:4700:20::6818:5912, 2606:4700:20::ac43:43a0 |
Response IP | 104.24.89.18 |
Found | Yes |
Hash | 2aebd28f36d97e5e6ea37b332808d2597e4eaa20552f6dac42cdf9ec28804572 |
SimHash | ba0568a28bf0 |
Groups
*
Rule | Path |
---|---|
Disallow | /search*q%3D* |
Disallow | /wp-admin/ |
Disallow | /wp-login* |
Disallow | /page/* |
Disallow | /syndication* |
Disallow | /tag/* |