books.scielo.org
robots.txt

Robots Exclusion Standard data for books.scielo.org

Resource Scan

Scan Details

Site Domain books.scielo.org
Base Domain scielo.org
Scan Status Ok
Last Scan2025-11-15T13:26:22+00:00
Next Scan 2025-12-15T13:26:22+00:00

Last Scan

Scanned2025-11-15T13:26:22+00:00
URL https://books.scielo.org/robots.txt
Domain IPs 2400:52e0:1500::1179:1, 84.17.38.251
Response IP 138.199.46.66
Found Yes
Hash ddb1989640ed416298b7c31cb43054cc1b60bc50add636eca29f2642000eee97
SimHash 615183915791

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

bingbot

Rule Path
Disallow

slurp

Rule Path
Disallow

yahoo! slurp

Rule Path
Disallow

ia_archiver

Rule Path
Allow /

archive.org_bot

Rule Path
Allow /

lockss

Rule Path
Allow /

lockss cache

Rule Path
Allow /

doab_check_bot

Rule Path
Allow /

*

Rule Path
Disallow /

Comments

  • Allow only major search spiders
  • Block all other spiders