janleow.com
robots.txt

Robots Exclusion Standard data for janleow.com

Resource Scan

Scan Details

Site Domain janleow.com
Base Domain janleow.com
Scan Status Ok
Last Scan2025-12-01T13:59:31+00:00
Next Scan 2025-12-08T13:59:31+00:00

Last Scan

Scanned2025-12-01T13:59:31+00:00
URL https://janleow.com/robots.txt
Redirect https://www.janleow.com/robots.txt
Redirect Domain www.janleow.com
Redirect Base janleow.com
Domain IPs 2a01:4f8:c012:8427::1, 91.107.211.163
Redirect IPs 2a01:4f8:c012:8427::1, 91.107.211.163
Response IP 91.107.211.163
Found Yes
Hash 2566173af11373a40cd385640c698bd5db735e6afbc10b28e26d7f803004aef3
SimHash ae989919c774

Groups

*

Rule Path
Disallow /archive/
Disallow /cache/
Disallow /support-files/
Disallow /test/
Disallow /ch/
Disallow /CH/
Disallow /favicon.ico
Disallow /LICENSE.txt
Disallow /UPGRADE.txt
Disallow /web/admin.shtml
Disallow /life/favicon.ico
Disallow /life/wp-admin
Disallow /life/wp-includes
Disallow *?replytocom

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap http://www.janleow.com/sitemap_index.xml

Comments

  • $Id: robots.txt,v 1.7.2.1 2007/03/23 18:57:07 drumm Exp $
  • robots.txt
  • WWW.JANLEOW.COM
  • sitemap is to tell search engine robots to find where are the urls
  • sitemap: http://cdn.attracta.com/sitemap/266607.xml.gz
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html
  • Directories
  • Disallow: /imgs/
  • Directories & 301 used for redirects to external sites
  • Files
  • Other sections and installations
  • Wordpress