nzdaisuki.com
robots.txt

Robots Exclusion Standard data for nzdaisuki.com

Resource Scan

Scan Details

Site Domain nzdaisuki.com
Base Domain nzdaisuki.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-12T06:46:21+00:00
Next Scan 2024-11-26T06:46:21+00:00

Last Successful Scan

Scanned2024-10-27T04:24:09+00:00
URL https://nzdaisuki.com/robots.txt
Domain IPs 183.90.182.164
Response IP 183.90.182.164
Found Yes
Hash 2455348e2922443bb0433e491b330bc2907673546bc67e211995c55b3530dbf9
SimHash a23c15784dd5

Groups

*

Rule Path
Disallow /administrator/
Disallow /bbs/category.php
Disallow /bbs/detail.php
Disallow /bbs/detail2.php
Disallow /bbs/detail_id.php
Disallow /bbs/latest.php
Disallow /bbs/search.php
Disallow /bin/
Disallow /cache/
Disallow /cgi-bin/news/news.cgi
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /news/news.php
Disallow /modules/
Disallow /plugins/
Disallow /tmp/
Disallow /wp-admin.php
Disallow /wp-signuo.php
Disallow /wp-content/
Disallow /shell*/
Disallow /desk*/
Disallow /mysql*/
Disallow /php*/
Disallow /.git/
Disallow /.env

icc-crawler

Rule Path
Disallow /ad_banner/
Disallow /bbs/
Disallow /cf_event/
Disallow /cf_flat/
Disallow /cf_job/
Disallow /cf_mate/
Disallow /cf_trade/
Disallow /column_insurance/
Disallow /column_kenchiku/
Disallow /column_medicare/
Disallow /column_michael/
Disallow /investment/
Disallow /links/
Disallow /living/info/
Disallow /living/study/
Disallow /migrant/
Disallow /nature/
Disallow /restaurant/
Disallow /ryugaku/
Disallow /tour/
Disallow /yellowpage/

googlebot

Rule Path
Allow /components/
Allow /modules/

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml
  • 2015-11-25
  • added /bbs/*.php
  • added /cgi-bin/news/news.cgi
  • added /news/news.php
  • 2015-11-25
  • 2020-11-02
  • added /wp-admin.php
  • added /wp-signuo.php
  • added /wp-content/
  • 2020-11-02
  • 2020-11-20
  • added /shell*/
  • added /desk*/
  • added /mysql*/
  • added /php*/
  • added /.git/
  • added /.env
  • 2020-11-02
  • 2021-01-29
  • 2021-01-29
  • 2015-07-29
  • 2015-07-29