romanes.org
robots.txt

Robots Exclusion Standard data for romanes.org

Resource Scan

Scan Details

Site Domain romanes.org
Base Domain romanes.org
Scan Status Ok
Last Scan2024-09-29T21:56:45+00:00
Next Scan 2024-10-06T21:56:45+00:00

Last Scan

Scanned2024-09-29T21:56:45+00:00
URL http://romanes.org/robots.txt
Redirect http://romanes.com/robots.txt
Redirect Domain romanes.com
Redirect Base romanes.com
Domain IPs 217.70.184.38
Redirect IPs 46.105.204.23
Response IP 46.105.204.23
Found Yes
Hash 803644e182444efa8db9c0ba9dcaa04c48354cb5a8051345e364a9efd97b226f
SimHash 1a2edd148232

Groups

yandexbot
ezooms

Rule Path
Disallow /

blekkobot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

ccbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

slurp

Rule Path
Disallow /cgi-bin
Disallow /cgibin
Disallow /css/
Disallow /img/
Disallow /html/

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap http://www.romanes.com/sitemap_index