collegeconfidential.com
robots.txt

Robots Exclusion Standard data for collegeconfidential.com

Resource Scan

Scan Details

Site Domain collegeconfidential.com
Base Domain collegeconfidential.com
Scan Status Ok
Last Scan2024-11-14T14:02:26+00:00
Next Scan 2024-11-21T14:02:26+00:00

Last Scan

Scanned2024-11-14T14:02:26+00:00
URL https://collegeconfidential.com/robots.txt
Redirect https://www.collegeconfidential.com/robots.txt
Redirect Domain www.collegeconfidential.com
Redirect Base collegeconfidential.com
Domain IPs 34.199.111.173, 52.45.139.48
Redirect IPs 34.199.111.173, 52.45.139.48
Response IP 34.199.111.173
Found Yes
Hash 6572ad39430a5ba08bc729defa2467bb32f03f7e9ab7ac61dfd898575922ced4
SimHash 8b36628664b5

Groups

*

Rule Path
Disallow /admin/
Disallow /api/
Disallow /cms/
Disallow /?csrfmiddlewaretoken=
Disallow /search_results.htm?
Disallow /wp-login.php?
Disallow /dean/
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /vibe/
Disallow /admit/
Disallow /tags/
Disallow /tag/
Disallow /topics/
Disallow /*.html
Disallow /*.htm
Disallow /%5Dother
Disallow /index-2/
Disallow /editorial/

Other Records

Field Value
sitemap https://www.collegeconfidential.com/sitemap.xml
sitemap https://www.collegeconfidential.com/sitemap-main.xml
sitemap https://www.collegeconfidential.com/sitemap-colleges.xml
sitemap https://www.collegeconfidential.com/sitemap-articles.xml

Warnings

  • 1 invalid line.