smart-energy.com
robots.txt

Robots Exclusion Standard data for smart-energy.com

Resource Scan

Scan Details

Site Domain smart-energy.com
Base Domain smart-energy.com
Scan Status Ok
Last Scan2024-11-14T20:05:42+00:00
Next Scan 2024-12-14T20:05:42+00:00

Last Scan

Scanned2024-11-14T20:05:42+00:00
URL https://www.smart-energy.com/robots.txt
Domain IPs 13.33.28.109, 13.33.28.113, 13.33.28.16, 13.33.28.30
Response IP 13.33.28.109
Found Yes
Hash 120f33983f01556e0217278bf41cc9350b33c8274f83c487296d430289d3be65
SimHash 4260484a37ba

Groups

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

*

Rule Path
Allow /wp-includes/js/

*

Rule Path
Disallow /wp-admin/

*

Rule Path
Disallow /wp-includes/

*

Rule Path
Disallow /xmlrpc.php

*

Rule Path
Disallow /profile

*

Rule Path
Disallow /cgi-bin/

*

Rule Path
Disallow /wp-content/cache/

*

Rule Path
Disallow /trackback/

*

Rule Path
Disallow /comments/

*

Rule Path
Disallow /administrator/

*

Rule Path
Disallow */trackback/

*

Rule Path
Disallow */comments/

*

Rule Path
Disallow /license.txt

*

Rule Path
Disallow /*.php$

*

Rule Path
Disallow *?filter

*

Rule Path
Disallow /wp-content/themes/

*

Rule Path
Disallow /readme.html

*

Rule Path
Disallow /wp-content/cache/

Other Records

Field Value
sitemap https://www.smart-energy.com/sitemap_index.xml

Comments

  • Horrible bandwidth eating robots
  • BEGIN W3TC ROBOTS
  • END W3TC ROBOTS