communitycomm.com
robots.txt

Robots Exclusion Standard data for communitycomm.com

Resource Scan

Scan Details

Site Domain communitycomm.com
Base Domain communitycomm.com
Scan Status Ok
Last Scan2024-09-26T13:26:32+00:00
Next Scan 2024-10-26T13:26:32+00:00

Last Scan

Scanned2024-09-26T13:26:32+00:00
URL https://communitycomm.com/robots.txt
Domain IPs 104.21.94.183, 172.67.139.122, 2606:4700:3030::ac43:8b7a, 2606:4700:3034::6815:5eb7
Response IP 172.67.139.122
Found Yes
Hash 15c269a818e250e23a5b79f83f4a306da375b5e56a1d407b01ff81df4d345466
SimHash 28060a34f7d1

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /email_lib/
Disallow /_email_lib/
Disallow /.well-known/
Disallow /js/
Disallow /css/
Disallow /photos/
Disallow /gallery/
Disallow /portfolio/
Disallow /resize/
Disallow /slide/
Disallow /salesportal/
Disallow /email_form_test/
Disallow /liquor/
Disallow /logos/
Disallow /photo/
Disallow /leads/

ninjabot

Rule Path
Allow /

mediapartners-google*

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap http://www.communitycomm.com/sitemap.xml