charleston.edu
robots.txt

Robots Exclusion Standard data for charleston.edu

Resource Scan

Scan Details

Site Domain charleston.edu
Base Domain charleston.edu
Scan Status Ok
Last Scan2024-11-12T04:06:30+00:00
Next Scan 2024-12-12T04:06:30+00:00

Last Scan

Scanned2024-11-12T04:06:30+00:00
URL https://charleston.edu/robots.txt
Domain IPs 66.228.54.157
Response IP 45.56.113.205
Found Yes
Hash 0fd5957e2f854c84fede6005acb3f5a421f3ae099ede2887e5d2ec52ee2d06e6
SimHash 0f10395b6111

Groups

gptbot

Rule Path
Allow /about/
Allow /academics/
Allow /cost-aid/
Allow /student-life/
Allow /admission/

blexbot

Rule Path
Disallow /

*

Rule Path
Disallow /twilio-a2p-assets/
Disallow /static/
Disallow /news/
Disallow /events/
Disallow /_testing/
Disallow /test.php
Disallow /test.jpg
Disallow /test/txt
Disallow /view-source.php
Disallow /test-music/
Disallow /index-test.php
Disallow /index-bak.php
Disallow /index-multiple-img.php
Disallow /_barkley-test.php
Disallow /_test-news-2.php
Disallow /index-st-manual.php
Disallow /*?print=*
Disallow index2.php
Disallow *.xml
Disallow /test
Disallow /mycharleston
Disallow /inc
Disallow /uploads
Disallow /giving-old
Disallow /generaldocuments
Disallow /phishing.php
Disallow /down.php
Disallow /index2.php
Disallow /crossdomain.xml
Disallow /robots.txt
Disallow /images
Disallow /install
Disallow /webprojects
Disallow /XpressConnect