communitycomm.com
robots.txt
Robots Exclusion Standard data for communitycomm.com
Resource Scan
Scan Details
Site Domain | communitycomm.com |
Base Domain | communitycomm.com |
Scan Status | Ok |
Last Scan | 2024-09-26T13:26:32+00:00 |
Next Scan | 2024-10-26T13:26:32+00:00 |
Last Scan
Scanned | 2024-09-26T13:26:32+00:00 |
URL | https://communitycomm.com/robots.txt |
Domain IPs | 104.21.94.183, 172.67.139.122, 2606:4700:3030::ac43:8b7a, 2606:4700:3034::6815:5eb7 |
Response IP | 172.67.139.122 |
Found | Yes |
Hash | 15c269a818e250e23a5b79f83f4a306da375b5e56a1d407b01ff81df4d345466 |
SimHash | 28060a34f7d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /email_lib/ |
Disallow | /_email_lib/ |
Disallow | /.well-known/ |
Disallow | /js/ |
Disallow | /css/ |
Disallow | /photos/ |
Disallow | /gallery/ |
Disallow | /portfolio/ |
Disallow | /resize/ |
Disallow | /slide/ |
Disallow | /salesportal/ |
Disallow | /email_form_test/ |
Disallow | /liquor/ |
Disallow | /logos/ |
Disallow | /photo/ |
Disallow | /leads/ |
Other Records
Field | Value |
---|---|
sitemap | http://www.communitycomm.com/sitemap.xml |