machinetoolhelp.com
robots.txt

Robots Exclusion Standard data for machinetoolhelp.com

Resource Scan

Scan Details

Site Domain machinetoolhelp.com
Base Domain machinetoolhelp.com
Scan Status Ok
Last Scan2025-11-04T18:31:40+00:00
Next Scan 2025-11-11T18:31:40+00:00

Last Scan

Scanned2025-11-04T18:31:40+00:00
URL https://machinetoolhelp.com/robots.txt
Domain IPs 108.179.232.91
Response IP 108.179.232.91
Found Yes
Hash 42919f2d7700835b25d2536cf2831b024609b00872173ee6171cb33d8a4e3f60
SimHash fd149d194570

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /1.html
Disallow /cnc.html
Disallow /cgi-bin/trap.cgi
Disallow /zz/
Disallow /cnclock.htm
Disallow /Troubleshooting/1.html
Disallow /Repairing/1.html
Disallow /Applications/1.html
Disallow /cnc-blog-community
Disallow /node/newsletters
Disallow /newsletters/
Disallow /RSS/
Disallow /lists/
Disallow /php/
Disallow /cnc-information/database/
Disallow /cnc-information/includes/
Disallow /cnc-information/misc/
Disallow /cnc-information/modules/
Disallow /cnc-information/sites/
Disallow /cnc-information/themes/
Disallow /cnc-information/scripts/
Disallow /cnc-information/updates/
Disallow /cnc-information/profiles/
Disallow /cnc-information/xmlrpc.php
Disallow /cnc-information/cron.php
Disallow /cnc-information/update.php
Disallow /cnc-information/install.php
Disallow /cnc-information/INSTALL.txt
Disallow /cnc-information/INSTALL.mysql.txt
Disallow /cnc-information/INSTALL.pgsql.txt
Disallow /cnc-information/CHANGELOG.txt
Disallow /cnc-information/MAINTAINERS.txt
Disallow /cnc-information/LICENSE.txt
Disallow /cnc-information/UPGRADE.txt
Disallow /cnc-information/admin/
Disallow /cnc-information/comment/reply/
Disallow /cnc-information/contact/
Disallow /cnc-information/logout/
Disallow /cnc-information/node/add/
Disallow /cnc-information/search/
Disallow /cnc-information/user/register/
Disallow /cnc-information/user/password/
Disallow /cnc-information/user/login/
Disallow /cnc-information/?q=admin%2F
Disallow /cnc-information/?q=comment%2Freply%2F
Disallow /cnc-information/?q=contact%2F
Disallow /cnc-information/?q=logout%2F
Disallow /cnc-information/?q=node%2Fadd%2F
Disallow /cnc-information/?q=search%2F
Disallow /cnc-information/?q=user%2Fpassword%2F
Disallow /cnc-information/?q=user%2Fregister%2F
Disallow /cnc-information/?q=user%2Flogin%2F

Comments

  • $Id: robots.txt,v 1.9 2007/06/27 22:37:44 goba Exp $
  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)