businessgreen.com
robots.txt

Robots Exclusion Standard data for businessgreen.com

Resource Scan

Scan Details

Site Domain businessgreen.com
Base Domain businessgreen.com
Scan Status Ok
Last Scan2024-09-20T20:27:46+00:00
Next Scan 2024-09-27T20:27:46+00:00

Last Scan

Scanned2024-09-20T20:27:46+00:00
URL https://businessgreen.com/robots.txt
Redirect https://www.businessgreen.com/robots.txt
Redirect Domain www.businessgreen.com
Redirect Base businessgreen.com
Domain IPs 104.16.126.118, 104.16.127.118
Redirect IPs 104.16.126.118, 104.16.127.118
Response IP 104.16.127.118
Found Yes
Hash 63edba68016a6cec9ef38d8bb986e308624fe85746dacdadf834cd132920f0cf
SimHash 2756bad648b1

Groups

*

Rule Path
Disallow /search
Disallow /feeds/rss/search
Disallow /4818/
Disallow /print-article/
Disallow /digital_assets/
Disallow /v3_ie8_meta.html
Disallow /send-to-friend/
Disallow /tag/page/
Disallow /pdf/
Disallow /blog?month
Disallow /home/show_comment_page/

Other Records

Field Value
crawl-delay 20

googlebot

Rule Path
Disallow /search
Disallow /feeds/rss/search
Disallow /4818/
Disallow /print-article/
Disallow /digital_assets/
Disallow /v3_ie8_meta.html
Disallow /send-to-friend/
Disallow /tag/page/
Disallow /pdf/
Disallow /blog?month
Disallow /home/show_comment_page/

bingbot

Rule Path
Disallow /search
Disallow /feeds/rss/search
Disallow /4818/
Disallow /print-article/
Disallow /digital_assets/
Disallow /v3_ie8_meta.html
Disallow /send-to-friend/
Disallow /tag/page/
Disallow /pdf/
Disallow /blog?month
Disallow /home/show_comment_page/

Other Records

Field Value
crawl-delay 1

yahoo slurp!

Rule Path
Disallow /search
Disallow /feeds/rss/search
Disallow /4818/
Disallow /print-article/
Disallow /digital_assets/
Disallow /v3_ie8_meta.html
Disallow /send-to-friend/
Disallow /tag/page/
Disallow /pdf/
Disallow /blog?month
Disallow /home/show_comment_page/

Other Records

Field Value
crawl-delay 1

mediapartners-google

Rule Path
Disallow

ahrefsbot
compspybot
crystalsemanticsbot
curious george
cybeye.com
daumoa
docomo
exb language crawler
ezooms
flamingo_searchengine
genieo
genio
gsa-crawler
lexxebot
libcrawl
linkdex
lwnutch
magpie-crawler
meltwater
mnogosearch
omgilibot/0.3
openwebindex
psbot
rediffnewsbot
repparser
scanmine
screaming frog seo spider
seoengworldbot
shopwiki
showyoubot
sindice-site-manager
sogou
sogou spider
sosospider
webvac
wocbot
woriobot
yacybot
yeti
yolinkbot_text
youdaobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.businessgreen.com/sitemap_index.xml
sitemap https://www.businessgreen.com/news-sitemap.xml
sitemap https://www.businessgreen.com/videositemap.xml
sitemap https://www.businessgreen.com/sitemap_index.xml
sitemap https://www.businessgreen.com/news-sitemap.xml
sitemap https://www.businessgreen.com/videositemap.xml
sitemap https://www.businessgreen.com/sitemap_index.xml
sitemap https://www.businessgreen.com/news-sitemap.xml
sitemap https://www.businessgreen.com/videositemap.xml
sitemap https://www.businessgreen.com/sitemap_index.xml
sitemap https://www.businessgreen.com/news-sitemap.xml
sitemap https://www.businessgreen.com/videositemap.xml

Comments

  • Robots.txt for https://www.businessgreen.com
  • Updated 12th July 2017 by JH
  • Version 1.10 - https declarations
  • Sitemap declarations
  • Agent specific disallowed sections
  • Googlebot
  • Bingbot
  • Yahoo
  • Ad serving - allow adsense access
  • Fully exclude these robots from crawling anything