liveasif.org
robots.txt

Robots Exclusion Standard data for liveasif.org

Resource Scan

Scan Details

Site Domain liveasif.org
Base Domain liveasif.org
Scan Status Ok
Last Scan2024-11-10T17:36:15+00:00
Next Scan 2024-11-17T17:36:15+00:00

Last Scan

Scanned2024-11-10T17:36:15+00:00
URL https://www.liveasif.org/robots.txt
Domain IPs 74.63.248.118
Response IP 74.63.248.118
Found Yes
Hash 971ca78069c32e549ffa7639b9514bd2371efa6eefd9249688fca4aaee013477
SimHash 12577b69efda

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /ajax/
Disallow /scratchpad/
Disallow /jscripts/
Disallow /*?print=yes
Disallow /*?search=
Disallow /*?pn=
Disallow /*?st=
Disallow /*?q=
Disallow /*?q1=
Disallow /*?tr1=
Disallow /*?tr2=
Disallow /*?tr3=

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

purebot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

Comments

  • robots.txt file for StudyLight.org
  • last modified October 1, 2024
  • jgarrison@studylight.org