digivate.com
robots.txt

Robots Exclusion Standard data for digivate.com

Resource Scan

Scan Details

Site Domain digivate.com
Base Domain digivate.com
Scan Status Ok
Last Scan2025-08-28T12:10:20+00:00
Next Scan 2025-09-27T12:10:20+00:00

Last Scan

Scanned2025-08-28T12:10:20+00:00
URL https://digivate.com/robots.txt
Redirect https://www.digivate.com/robots.txt
Redirect Domain www.digivate.com
Redirect Base digivate.com
Domain IPs 85.92.81.130
Redirect IPs 85.92.81.130
Response IP 85.92.81.130
Found Yes
Hash 7881ab62e3d1f88bd14dc657a5d492c71519170cbfb37d8e6cd980b6aa8fdbd9
SimHash 4a00494679b8

Groups

*

Rule Path
Disallow /profile
Disallow /cgi-bin/
Disallow /trackback/
Disallow /comments/
Disallow /administrator/
Disallow */trackback/
Disallow */comments/
Disallow /license.txt
Disallow /*.php$
Disallow *?filter
Disallow /readme.html
Disallow /text.php
Disallow /blog/2011/03/
Disallow /blog/2011/11/
Disallow /blog/2011/12/
Disallow /blog/2011/02/
Disallow /blog/2011/07/
Disallow /blog/author/admin/
Disallow /blog/page/16/
Disallow /blog/page/17/
Disallow /blog/page/18/
Disallow /blog/page/19/
Disallow /blog/page/20/
Disallow /blog/page/21/
Disallow /blog/page/22/
Disallow /blog/page/23/
Disallow /blog/page/24/
Disallow /blog/page/25/
Disallow /blog/category/uncategorized/
Disallow /blog/tag/principles/
Disallow /sitemap.xml.gz

Other Records

Field Value
crawl-delay 5

scrapy

Rule Path
Allow /

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
semrush
semrushbot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
splitsignalbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.digivate.com/sitemap_index.xml

Comments

  • Sitemap for search engines
  • Global crawl settings for all user-agents
  • Blocking specific blog sections
  • Blocking compressed sitemap
  • Allow Scrapy
  • Blocking unwanted bots that consume bandwidth
  • Additional bots to be blocked