communitycollegereview.com
robots.txt

Robots Exclusion Standard data for communitycollegereview.com

Resource Scan

Scan Details

Site Domain communitycollegereview.com
Base Domain communitycollegereview.com
Scan Status Ok
Last Scan2024-11-16T18:54:35+00:00
Next Scan 2024-11-23T18:54:35+00:00

Last Scan

Scanned2024-11-16T18:54:35+00:00
URL https://communitycollegereview.com/robots.txt
Redirect https://www.communitycollegereview.com/robots.txt
Redirect Domain www.communitycollegereview.com
Redirect Base communitycollegereview.com
Domain IPs 23.29.112.242
Redirect IPs 23.29.112.242
Response IP 23.29.112.242
Found Yes
Hash 84095f21a4ae1b31f8ba28bf0ee55666d2939c11f58dddec9539febdff92aa3d
SimHash 280d8e14ad91

Groups

*

Rule Path
Allow /include/*/*.jpg$
Allow /include/*/*.gif$
Allow /include/*/*.png*$
Allow /include/*/*.webp$
Allow /include/*/*.css*$
Allow /include/*/*.js*$
Allow /*/html_fragments/*.php*$
Disallow /include/*/

Other Records

Field Value
sitemap https://www.communitycollegereview.com/sitemap_https/sitemap-index.xml
sitemap https://www.communitycollegereview.com/sitemap_https/imagesitemap-imageindex.xml