studylight.org
robots.txt

Robots Exclusion Standard data for studylight.org

Resource Scan

Scan Details

Site Domain studylight.org
Base Domain studylight.org
Scan Status Ok
Last Scan2024-11-12T12:03:57+00:00
Next Scan 2024-11-19T12:03:57+00:00

Last Scan

Scanned2024-11-12T12:03:57+00:00
URL https://studylight.org/robots.txt
Redirect https://www.studylight.org/robots.txt
Redirect Domain www.studylight.org
Redirect Base studylight.org
Domain IPs 74.63.248.118
Redirect IPs 104.26.0.234, 104.26.1.234, 172.67.68.251, 2606:4700:20::681a:1ea, 2606:4700:20::681a:ea, 2606:4700:20::ac43:44fb
Response IP 172.67.68.251
Found Yes
Hash 971ca78069c32e549ffa7639b9514bd2371efa6eefd9249688fca4aaee013477
SimHash 12577b69efda

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /ajax/
Disallow /scratchpad/
Disallow /jscripts/
Disallow /*?print=yes
Disallow /*?search=
Disallow /*?pn=
Disallow /*?st=
Disallow /*?q=
Disallow /*?q1=
Disallow /*?tr1=
Disallow /*?tr2=
Disallow /*?tr3=

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

purebot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

Comments

  • robots.txt file for StudyLight.org
  • last modified October 1, 2024
  • jgarrison@studylight.org