collegesimply.com
robots.txt

Robots Exclusion Standard data for collegesimply.com

Resource Scan

Scan Details

Site Domain collegesimply.com
Base Domain collegesimply.com
Scan Status Ok
Last Scan2024-05-26T19:25:57+00:00
Next Scan 2024-06-02T19:25:57+00:00

Last Scan

Scanned2024-05-26T19:25:57+00:00
URL https://collegesimply.com/robots.txt
Redirect https://www.collegesimply.com/robots.txt
Redirect Domain www.collegesimply.com
Redirect Base collegesimply.com
Domain IPs 104.26.0.197, 104.26.1.197, 172.67.72.139, 2606:4700:20::681a:1c5, 2606:4700:20::681a:c5, 2606:4700:20::ac43:488b
Redirect IPs 104.26.0.197, 104.26.1.197, 172.67.72.139, 2606:4700:20::681a:1c5, 2606:4700:20::681a:c5, 2606:4700:20::ac43:488b
Response IP 172.67.72.139
Found Yes
Hash 907c3eb032a610d46b700e9e1fb4777f9b4cd0866079cc9e91c63df882bdba1c
SimHash 7d0fca428a11

Groups

*

Rule Path
Disallow /*sort*

mj12bot

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

ntentbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

yandexbot
yandex

Rule Path
Disallow /

bleriot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

adbeat_bot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

accompanybot

Rule Path
Disallow /