novotec-edm.com
robots.txt

Robots Exclusion Standard data for novotec-edm.com

Resource Scan

Scan Details

Site Domain novotec-edm.com
Base Domain novotec-edm.com
Scan Status Ok
Last Scan2024-09-29T21:29:15+00:00
Next Scan 2024-10-29T21:29:15+00:00

Last Scan

Scanned2024-09-29T21:29:15+00:00
URL https://novotec-edm.com/robots.txt
Redirect https://www.novotec-edm.com/robots.txt
Redirect Domain www.novotec-edm.com
Redirect Base novotec-edm.com
Domain IPs 104.26.14.140, 104.26.15.140, 172.67.74.94, 2606:4700:20::681a:e8c, 2606:4700:20::681a:f8c, 2606:4700:20::ac43:4a5e
Redirect IPs 104.26.14.140, 104.26.15.140, 172.67.74.94, 2606:4700:20::681a:e8c, 2606:4700:20::681a:f8c, 2606:4700:20::ac43:4a5e
Response IP 172.67.74.94
Found Yes
Hash 31c87e9dc659d08c85a6e50a1fd0f020e7a3e45a2d3f5b747c0fa11b191b2a49
SimHash aa19f85143f1

Groups

*

Rule Path
Disallow

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

leguideimgserver

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

ingrid

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

jyxobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

catchbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

zeus 32297 webster pro v2.9 win32

Rule Path
Disallow /

amznkassocbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

wbot

Rule Path
Disallow /

jobdiggerspider

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

yahoo! slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

rogerbot

Rule Path Comment
Allow /*?p= -
Allow /index.php/blog/ -
Allow /catalog/seo_sitemap/category/ -
Allow /media/catalog/ -
Disallow /404/ -
Disallow /app/ -
Disallow /cgi-bin/ -
Disallow /downloader/ -
Disallow /errors/ -
Disallow /includes/ -
Disallow /js/ -
Disallow /lib/ -
Disallow /magento/ -
Disallow /media/captcha/ -
Disallow /media/css_secure/ -
Disallow /media/customer/ -
Disallow /media/dhl/ -
Disallow /media/downloadable/ -
Disallow /media/import/ -
Disallow /media/pdf/ -
Disallow /media/sales/ -
Disallow /media/tmp/ -
Disallow /media/wysiwyg/ -
Disallow /media/xmlconnect/ -
Disallow /pkginfo/ -
Disallow /report/ -
Disallow /scripts/ -
Disallow /shell/ -
Disallow /stats/ -
Disallow /var/ -
Disallow /index.php/ -
Disallow /catalog/product_compare/ -
Disallow /catalog/category/view/ -
Disallow /catalog/product/view/ -
Disallow /catalog/product/gallery/ -
Disallow /catalogsearch/ -
Disallow /checkout/ -
Disallow /control/ -
Disallow /customer/ -
Disallow /customize/ -
Disallow /newsletter/ -
Disallow /poll/ -
Disallow /review/ -
Disallow /sendfriend/ -
Disallow /tag/ -
Disallow /wishlist/ -
Disallow /cron.php -
Disallow /cron.sh -
Disallow /error_log -
Disallow /install.php -
Disallow /LICENSE.html -
Disallow /LICENSE.txt -
Disallow /LICENSE_AFL.txt -
Disallow /STATUS.txt -
Disallow /get.php Magento 1.5+
Disallow /*.js$ -
Disallow /*.css$ -
Disallow /*.php$ -
Disallow /*?SID= -
Disallow /rss* -
Disallow /*PHPSESSID -
Disallow /*/filter/* -
Disallow /catalogsearch/ -

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.novotec-edm.com/sitemap/worldwide/sitemap.xml
sitemap https://www.novotec-edm.com/sitemap/germany/sitemap.xml
sitemap https://www.novotec-edm.com/sitemap/usa/sitemap.xml
sitemap https://www.novotec-edm.com/sitemap/russia/sitemap.xml
sitemap https://www.novotec-edm.com/sitemap/china/sitemap.xml
sitemap https://www.novotec-edm.com/sitemap/france/sitemap.xml

Comments

  • Website Sitemap
  • Bots
  • Allowable Index
  • Mind that Allow is not an official standard
  • Allow: /catalogsearch/result/
  • Directories
  • Disallow: /media/
  • Disallow: /media/catalog/
  • Disallow: /media/css/
  • Disallow: /media/js/
  • Disallow: /skin/
  • Paths (clean URLs)
  • Files
  • Paths (no clean URLs)
  • Filters en Search

Warnings

  • 2 invalid lines.