romanes.com
robots.txt

Robots Exclusion Standard data for romanes.com

Resource Scan

Scan Details

Site Domain romanes.com
Base Domain romanes.com
Scan Status Ok
Last Scan2024-09-28T21:05:34+00:00
Next Scan 2024-10-05T21:05:34+00:00

Last Scan

Scanned2024-09-28T21:05:34+00:00
URL https://romanes.com/robots.txt
Domain IPs 46.105.204.23
Response IP 46.105.204.23
Found Yes
Hash 803644e182444efa8db9c0ba9dcaa04c48354cb5a8051345e364a9efd97b226f
SimHash 1a2edd148232

Groups

yandexbot
ezooms

Rule Path
Disallow /

blekkobot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

ccbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

slurp

Rule Path
Disallow /cgi-bin
Disallow /cgibin
Disallow /css/
Disallow /img/
Disallow /html/

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap http://www.romanes.com/sitemap_index