research.org
robots.txt

Robots Exclusion Standard data for research.org

Resource Scan

Scan Details

Site Domain research.org
Base Domain research.org
Scan Status Ok
Last Scan2025-09-10T14:52:59+00:00
Next Scan 2025-09-17T14:52:59+00:00

Last Scan

Scanned2025-09-10T14:52:59+00:00
URL http://research.org/robots.txt
Domain IPs 198.55.101.21
Response IP 198.55.101.21
Found Yes
Hash 97968c7172d590ce72f696d89e49afd4ca1c5e8e8e1004516ad3c48344601c36
SimHash 0040d7d34511

Groups

architextspider

Rule Path
Disallow

baiduspider

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

msnbot

Rule Path
Disallow

msnbot-media

Rule Path
Disallow

msnbot-news

Rule Path
Disallow

msnbot-products

Rule Path
Disallow

msnptc

Rule Path
Disallow

naverbot

Rule Path
Disallow

robozilla

Rule Path
Disallow

scooter

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

turnitinbot

Rule Path
Disallow

yandex

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

yahooysmcm

Rule Path
Disallow

ia_archiver

Rule Path
Disallow /

*

Rule Path
Disallow /

Comments

  • robots.txt for research.org !