ncstar.com
robots.txt

Robots Exclusion Standard data for ncstar.com

Resource Scan

Scan Details

Site Domain ncstar.com
Base Domain ncstar.com
Scan Status Ok
Last Scan2024-09-21T13:17:33+00:00
Next Scan 2024-10-21T13:17:33+00:00

Last Scan

Scanned2024-09-21T13:17:33+00:00
URL https://ncstar.com/robots.txt
Redirect https://www.ncstar.com/robots.txt
Redirect Domain www.ncstar.com
Redirect Base ncstar.com
Domain IPs 35.227.135.234
Redirect IPs 35.227.135.234
Response IP 35.227.135.234
Found Yes
Hash 81a484a9135bec56b8df8ef1ecfaffff3ccda5de7908efe4ca26cb90a8d9108f
SimHash 6eb69b03c6f3

Groups

yandex
moget
ichiro
naverbot
yeti
baiduspider
baiduspider-video
baiduspider-image
sogou spider
youdaobot
*

Rule Path
Disallow
Allow /ajax

Other Records

Field Value
crawl-delay 5

ninjabot

Rule Path
Disallow /
Disallow /cron.php
Disallow /install.php
Disallow /update.php
Disallow /xmlrpc.php
Disallow /admin/
Disallow /search/
Disallow /user/register/
Disallow /user/login/
Disallow /user/logout/

*

Rule Path
Disallow /*?*filter%5B
Disallow /*?*filter%5B

Other Records

Field Value
crawl-delay 10

Comments

  • XM Symphony robots.txt file
  • The following spiders are considered aggressive and /or non-desirable.
  • The following rules allow all other user-agents, but with a crawl delay of 10 sec
  • NOTE by default this is set to NOT allow indexing of the site
  • when site goes live: Change "Disallow: /" to "Disallow:"
  • Files
  • Paths (clean URLs)

Warnings

  • 1 invalid line.