cualhost.com
robots.txt

Robots Exclusion Standard data for cualhost.com

Resource Scan

Scan Details

Site Domain cualhost.com
Base Domain cualhost.com
Scan Status Ok
Last Scan2026-02-10T22:27:03+00:00
Next Scan 2026-02-17T22:27:03+00:00

Last Scan

Scanned2026-02-10T22:27:03+00:00
URL https://cualhost.com/robots.txt
Domain IPs 34.174.40.233
Response IP 34.174.40.233
Found Yes
Hash fc42b01e81cb408827fdede0dfb02a69d6ddd4a35816ecfa6d151a061891c6e4
SimHash 28184b12c0a1

Groups

googlebot

Rule Path
Allow /sitemap.xml
Allow /sitemap.xml.gz

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Disallow /cgi-bin/
Disallow /about
Disallow /contact
Disallow /wp-
Disallow /feed/
Disallow /trackback
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.cgi$
Disallow /*.wmv$
Disallow /*.png$
Disallow /*.gif$
Disallow /*.jpg$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*.php*
Disallow /wp-*
Allow /wp-content/uploads/

googlebot-image

Rule Path
Allow /*

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Comments

  • disallow all files in these directories
  • allow Google ImageBot to search all images
  • disallow archiving site
  • disable duggmirror