ravelry.com
robots.txt

Robots Exclusion Standard data for ravelry.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ravelry.com
Base Domain	ravelry.com
Scan Status	Ok
Last Scan	2024-10-29T21:04:46+00:00
Next Scan	2024-11-28T21:04:46+00:00

Last Scan

Scanned	2024-10-29T21:04:46+00:00
URL	https://ravelry.com/robots.txt
Redirect	https://www.ravelry.com/robots.txt
Redirect Domain	www.ravelry.com
Redirect Base	ravelry.com
Domain IPs	192.34.84.3
Redirect IPs	192.34.84.3
Response IP	192.34.84.3
Found	Yes
Hash	dc1a48e9628d82b79677076c59c9adfbf6df846b0f69c14eb88b90f6e57655e2
SimHash	97d859f9cef7

Groups

*

Rule	Path
Disallow	/carts/
Disallow	/dl/
Disallow	/deliveries/
Disallow	/purchase/
Disallow	/receipts/
Disallow	/topics/.rss$

Rule

Path

Disallow

/carts/

Disallow

/dl/

Disallow

/deliveries/

Disallow

/purchase/

Disallow

/receipts/

Disallow

*/topics/*.rss$

sitecheck.internetseer.com

Rule	Path
Disallow	/

Rule

Path

Disallow

zealbot

Rule	Path
Disallow	/

Rule

Path

Disallow

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sitesnagger

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

fetch

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

linko

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu

Rule	Path
Disallow	/

Rule

Path

Disallow

larbin

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

zyborg

Rule	Path
Disallow	/

Rule

Path

Disallow

download ninja

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

grub-client

Rule	Path
Disallow	/

Rule

Path

Disallow

k2spider

Rule	Path
Disallow	/

Rule

Path

Disallow

npbot

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

Below entries are from Wikipedia's robots.txt :)
recursive wget
The 'grub' distributed client has been *very* poorly behaved.
Doesn't follow robots.txt anyway, but...
Hits many times per second, not acceptable
http://www.nameprotect.com/botinfo.html
A capture bot, downloads gazillions of pages with no public benefit
http://www.webreaper.net/

ravelry.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

sitecheck.internetseer.com

zealbot

msiecrawler

sitesnagger

webstripper

webcopier

fetch

offline explorer

teleport

teleportpro

webzip

linko

httrack

microsoft.url.control

xenu

larbin

libwww

zyborg

download ninja

wget

grub-client

k2spider

npbot

webreaper

Comments

ravelry.com
robots.txt