theworldgadget.com
robots.txt

Robots Exclusion Standard data for theworldgadget.com

Resource Scan

Scan Details

Site Domain theworldgadget.com
Base Domain theworldgadget.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-15T00:40:51+00:00
Next Scan 2024-12-14T00:40:51+00:00

Last Successful Scan

Scanned2024-08-17T00:14:52+00:00
URL https://theworldgadget.com/robots.txt
Domain IPs 107.6.172.84
Response IP 107.6.172.84
Found Yes
Hash f306627a48e1180d3951ae376c2a8354b5a99073214dbd9a8fb0a06c24370ff9
SimHash aab8c8186418

Groups

*

Rule Path
Disallow /wp-
Disallow /?s=*
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Disallow /author/
Allow /*.js$
Allow /*.css$
Disallow /page/*/
Disallow /categoria/*/page/*/
Disallow /*/feed/
Disallow /*/feed/rss/
Disallow /*/trackback/
Disallow /tracback/
Disallow /*/attachment/
Disallow /tag/*/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /*/*/*/feed.xml
Disallow /?attachment_id%2F

Other Records

Field Value
sitemap https://www.theworldgadget.com/sitemap.xml

Comments

  • Disallow: /tienda/
  • Disallow: /productos/*/
  • Disallow: /contacto/

Warnings

  • 2 invalid lines.