scielo.org
robots.txt

Robots Exclusion Standard data for scielo.org

Resource Scan

Scan Details

Site Domain scielo.org
Base Domain scielo.org
Scan Status Ok
Last Scan2025-11-27T10:32:46+00:00
Next Scan 2025-12-27T10:32:46+00:00

Last Scan

Scanned2025-11-27T10:32:46+00:00
URL https://scielo.org/robots.txt
Domain IPs 189.201.207.11, 200.136.72.41
Response IP 200.136.72.41
Found Yes
Hash ddb1989640ed416298b7c31cb43054cc1b60bc50add636eca29f2642000eee97
SimHash 615183915791

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

bingbot

Rule Path
Disallow

slurp

Rule Path
Disallow

yahoo! slurp

Rule Path
Disallow

ia_archiver

Rule Path
Allow /

archive.org_bot

Rule Path
Allow /

lockss

Rule Path
Allow /

lockss cache

Rule Path
Allow /

doab_check_bot

Rule Path
Allow /

*

Rule Path
Disallow /

Comments

  • Allow only major search spiders
  • Block all other spiders