communitycollegereview.com
robots.txt
Robots Exclusion Standard data for communitycollegereview.com
Resource Scan
Scan Details
Site Domain | communitycollegereview.com |
Base Domain | communitycollegereview.com |
Scan Status | Ok |
Last Scan | 2024-11-16T18:54:35+00:00 |
Next Scan | 2024-11-23T18:54:35+00:00 |
Last Scan
Scanned | 2024-11-16T18:54:35+00:00 |
URL | https://communitycollegereview.com/robots.txt |
Redirect | https://www.communitycollegereview.com/robots.txt |
Redirect Domain | www.communitycollegereview.com |
Redirect Base | communitycollegereview.com |
Domain IPs | 23.29.112.242 |
Redirect IPs | 23.29.112.242 |
Response IP | 23.29.112.242 |
Found | Yes |
Hash | 84095f21a4ae1b31f8ba28bf0ee55666d2939c11f58dddec9539febdff92aa3d |
SimHash | 280d8e14ad91 |
Groups
*
Rule | Path |
---|---|
Allow | /include/*/*.jpg$ |
Allow | /include/*/*.gif$ |
Allow | /include/*/*.png*$ |
Allow | /include/*/*.webp$ |
Allow | /include/*/*.css*$ |
Allow | /include/*/*.js*$ |
Allow | /*/html_fragments/*.php*$ |
Disallow | /include/*/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.communitycollegereview.com/sitemap_https/sitemap-index.xml |
sitemap | https://www.communitycollegereview.com/sitemap_https/imagesitemap-imageindex.xml |