broadbandtvnews.com
robots.txt

Robots Exclusion Standard data for broadbandtvnews.com

Resource Scan

Scan Details

Site Domain broadbandtvnews.com
Base Domain broadbandtvnews.com
Scan Status Ok
Last Scan2024-08-31T20:55:26+00:00
Next Scan 2024-09-30T20:55:26+00:00

Last Scan

Scanned2024-08-31T20:55:26+00:00
URL https://broadbandtvnews.com/robots.txt
Redirect https://www.broadbandtvnews.com/robots.txt
Redirect Domain www.broadbandtvnews.com
Redirect Base broadbandtvnews.com
Domain IPs 192.124.249.38
Redirect IPs 192.124.249.38
Response IP 192.124.249.38
Found Yes
Hash e61431f8b520b9ccd39f1f206703f9d1792ae5538296a1bf77a5f9dccbba6f2a
SimHash 6ae74941f6a1

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /cgi-bin/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-signup.php
Disallow /?s=*
Disallow /search/
Disallow /search/*
Disallow /wp-json/
Disallow /*/trackback/
Disallow /*/comments/
Disallow /*?add-to-cart=*
Disallow /*?orderby=*
Disallow /*?filter_*
Disallow /cdn-cgi/bm/cv/
Disallow /cdn-cgi/challenge-platform/
Allow /wp-content/uploads/
Allow /wp-content/cache/

ahrefsbot
semrushbot
mj12bot
dotbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule Path
Disallow /

googlebot

Rule Path
Disallow /*/feed/

bingbot

Rule Path
Disallow /*/feed/

slurp

Rule Path
Disallow /*/feed/

*

Rule Path
Allow /*/feed/

feedly

Rule Path
Allow /*/feed/

newsblur

Rule Path
Allow /*/feed/

inoreader

Rule Path
Allow /*/feed/

feedspot

Rule Path
Allow /*/feed/

theoldreader

Rule Path
Allow /*/feed/

feedburner

Rule Path
Allow /*/feed/

feedreader

Rule Path
Allow /*/feed/

newsfire

Rule Path
Allow /*/feed/

newsgator

Rule Path
Allow /*/feed/

my yahoo

Rule Path
Allow /*/feed/

Other Records

Field Value
sitemap https://www.broadbandtvnews.com/sitemap.xml
sitemap https://www.broadbandtvnews.com/news-sitemap.xml

Comments

  • Global rules
  • -----------------
  • Prevent crawling CF challenge URLs
  • Allow access to necessary assets
  • Sitemaps
  • -----------------
  • Ban bots that don't benefit us.
  • --------------------------------
  • Block feeds for search engines to reduce server load
  • -------------------------------
  • Allow feeds for other user agents and specific RSS readers
  • -----------------------------------
  • Ensure Feedly and other popular RSS readers are allowed
  • Ensure FeedBurner is allowed
  • Additional popular RSS readers