clc.net
robots.txt

Robots Exclusion Standard data for clc.net

Resource Scan

Scan Details

Site Domain clc.net
Base Domain clc.net
Scan Status Ok
Last Scan2024-10-28T19:03:37+00:00
Next Scan 2024-11-27T19:03:37+00:00

Last Scan

Scanned2024-10-28T19:03:37+00:00
URL http://clc.net/robots.txt
Domain IPs 205.178.189.131
Response IP 205.178.189.131
Found Yes
Hash 0ba83dffd49aeef1b356e01d91c0b48661485b6c768b90e637428488a730cb4f
SimHash 205c71747c55

Groups

*

Rule Path
Disallow /googleresults.jsp
Disallow /results.jsp
Disallow /results-b.jsp
Disallow /ns-results.jsp
Disallow /w-results.jsp
Disallow /s-results.jsp
Disallow /results-travel.jsp
Disallow /results-medical.jsp
Disallow /tc-results.jsp
Disallow /m-results.jsp
Disallow /results-monster.jsp
Disallow /emailAdCampaign.jsp
Disallow /domainSearch.jsp