stevesgoods.com
robots.txt

Robots Exclusion Standard data for stevesgoods.com

Resource Scan

Scan Details

Site Domain stevesgoods.com
Base Domain stevesgoods.com
Scan Status Ok
Last Scan2024-08-27T21:09:28+00:00
Next Scan 2024-09-26T21:09:28+00:00

Last Scan

Scanned2024-08-27T21:09:28+00:00
URL https://stevesgoods.com/robots.txt
Domain IPs 104.16.159.43, 104.17.9.99
Response IP 104.16.159.43
Found Yes
Hash 57a551372fa002d348f2a73095b26423b94e864c13870825da2326b82aeb85cb
SimHash a8155a01cde2

Groups

*

Rule Path Comment
Allow /wp-admin/admin-ajax.php -
Allow /wp-content/uploads/* -
Allow /wp-content/*.js -
Allow /wp-content/*.css -
Allow /wp-includes/*.js -
Allow /wp-includes/*.css -
Disallow /strippedCss/ -
Disallow /*?* -
Disallow /?lang=es -
Disallow /cgi-bin -
Disallow /cdn-cgi/ -
Disallow /*/attachment/ -
Disallow /tag/ -
Allow /page/ -
Allow /*/page/ -
Allow /*/*/page/ -
Allow /*/*/*/page/ -
Disallow /*/*/?add-to-cart=* -
Disallow /*/*/*/?add-to-cart=* -
Disallow /comments/ -
Disallow /xmlrpc.php -
Disallow /?attachment_id* -
Disallow /author/ -
Allow /category/ -
Disallow /uncategorized/ -
Disallow /product-tag/ -
Disallow /checkout/ -
Disallow /privacy-policy/ -
Disallow /terms-and-conditions/ -
Disallow /my-account/ -
Disallow /etiqueta-producto/ -
Disallow /?page_id=* -
Disallow /*/?add-to-cart=* -
Disallow /*/image.svg -
Disallow /wholesale-registration-page/ -
Disallow /wholesale-log-in-page/ -
Allow /hemp-industry-blog/ -
Disallow / /
Disallow /?page_id=10748 -
Disallow /*/recommendation_icon.png -

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*trackback*
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

semrushbot

Rule Path
Allow /*.js$
Allow /*.css$

bingbot

Rule Path
Allow /*.css$
Allow /*.js$

msnbot

Rule Path
Allow /*.css$
Allow /*.js$

slurp

Rule Path
Allow /*.css$
Allow /*.js$

duckduckbot

Rule Path
Allow /*.css$
Allow /*.js$

baidu

Rule Path
Allow /*.css$
Allow /*.js$

applebot

Rule Path
Allow /*.css$
Allow /*.js$

archive.org_bot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://stevesgoods.com/sitemap_index.xml

Comments

  • robots.txt file for https://stevesgoods.com/
  • Search blocking
  • Trackback blocking
  • Rss Feed Blocking
  • Blocking Bad bots and search crawlers
  • Prevents resource problems blocked in Google Webmaster Tools