healing-arts.org
robots.txt
Robots Exclusion Standard data for healing-arts.org
Resource Scan
Scan Details
Site Domain | healing-arts.org |
Base Domain | healing-arts.org |
Scan Status | Ok |
Last Scan | 2024-10-24T21:36:29+00:00 |
Next Scan | 2024-11-23T21:36:29+00:00 |
Last Scan
Scanned | 2024-10-24T21:36:29+00:00 |
URL | http://healing-arts.org/robots.txt |
Domain IPs | 216.92.201.137 |
Response IP | 216.92.201.137 |
Found | Yes |
Hash | 5912b2cdc6c037c218b756db1c6c84185f69eb92dd1ee85493bf69a0f8842cce |
SimHash | f08cdbdb1bc6 |
Groups
curl/7.10.x (i386-redhat-linux-gnu) libcurl/7.10.x openssl/0.9.7a ipv6 zlib/1.1.4
Rule | Path |
---|---|
Disallow | / |
full_breadth_crawler (zoidberg.ucr.edu; linux i686; http://ivia.ucr.edu/user_agents.html)
Rule | Path |
---|---|
Disallow | / |
nextgensearchbot 1 (for information visit http://www.eliyon.com/nextgensearchbot)
Rule | Path |
---|---|
Disallow | / |
shim-crawler(mozilla-compatible;+http://www.logos.ic.iu-tokyo.ac.jp/crawler/;+crawl@logos.ic.iu-tokyo.ac.jp)
Rule | Path |
---|---|
Disallow | / |
yacy (www.yacy.net; v20040602; i386 linux 2.4.31; java 1.5.0_05; europe/de) yacy.net
Rule | Path |
---|---|
Disallow | / |
yacy (www.yacy.net; v20040602; i386 linux 2.4.31; java 1.4.2 03; europe/en) yacy.net
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /cgi |
Disallow | /cgi-bin |
Disallow | /cgipro |
Disallow | /form.cgi |
Disallow | /counter |
Disallow | /Flash |
Disallow | /images |
Disallow | /imanager |
Disallow | /112750 |
Disallow | /land |
Disallow | /logwebsdata |
Disallow | /Miller-deCoux-Art |
Disallow | /Journal |
Disallow | /look |
Disallow | /sandrabrown |
Disallow | /test |
Disallow | /urchin |
Disallow | /farm.htm |
Disallow | /search |
Disallow | /search.htm |
Disallow | /messages |
Disallow | /dbasics/wwwboard/board/messages |
Disallow | /dbasics/wwwboard/general/messages |
Disallow | /dbasics/wwwboard/ADHD/messages |
Disallow | /dbasics/wwwboard/cp/messages |
Disallow | /dbasics/wwwboard/spiritual/messages |
Disallow | /dbasics/wwwboard/error.txt |
Warnings
- 6 invalid lines.