www.student.thinkib.net
robots.txt

Robots Exclusion Standard data for www.student.thinkib.net

Resource Scan

Scan Details

Site Domain www.student.thinkib.net
Base Domain thinkib.net
Scan Status Ok
Last Scan2024-10-11T01:37:47+00:00
Next Scan 2024-11-10T01:37:47+00:00

Last Scan

Scanned2024-10-11T01:37:47+00:00
URL https://www.student.thinkib.net/robots.txt
Redirect https://student.thinkib.net/robots.txt
Redirect Domain student.thinkib.net
Redirect Base thinkib.net
Domain IPs 54.229.218.51
Redirect IPs 104.26.0.38, 104.26.1.38, 172.67.70.39, 2606:4700:20::681a:126, 2606:4700:20::681a:26, 2606:4700:20::ac43:4627
Response IP 172.67.70.39
Found Yes
Hash 6202ceac94ba6e436b870f678b444e394e2a88be21f0fc3f937c4029c74d4861
SimHash f01559d44216

Groups

dataforseobot
petalbot

Rule Path
Disallow /

msnbot
bingbot
yandex
googlebot
googlebot-news
googlebot-image
mediapartners-google

Rule Path
Disallow /css/
Disallow /js/
Disallow /pages/
Disallow /assets/
Allow /

Other Records

Field Value
crawl-delay 20.0

baidu
baiduspider
baiduspider-video
baiduspider-image
rogerbot
ezooms
w00t
zmeu

Rule Path
Disallow /

*

Rule Path
Disallow /css/
Disallow /js/
Disallow /pages/
Disallow /assets/
Allow /

Other Records

Field Value
crawl-delay 20.0

Other Records

Field Value
sitemap https://thinkib.net/sitemap.xml
sitemap https://thinkib.net/sitemap.xml