thebusinessconcept.com
robots.txt

Robots Exclusion Standard data for thebusinessconcept.com

Resource Scan

Scan Details

Site Domain thebusinessconcept.com
Base Domain thebusinessconcept.com
Scan Status Ok
Last Scan2025-11-02T16:46:59+00:00
Next Scan 2025-11-09T16:46:59+00:00

Last Scan

Scanned2025-11-02T16:46:59+00:00
URL https://thebusinessconcept.com/robots.txt
Redirect https://www.thebusinessconcept.com/robots.txt
Redirect Domain www.thebusinessconcept.com
Redirect Base thebusinessconcept.com
Domain IPs 104.21.40.249, 172.67.158.144, 2606:4700:3036::6815:28f9, 2606:4700:3036::ac43:9e90
Redirect IPs 104.21.40.249, 172.67.158.144, 2606:4700:3036::6815:28f9, 2606:4700:3036::ac43:9e90
Response IP 104.21.40.249
Found Yes
Hash bcdd7b8b70a6eaf643a9b04e824e6850d458e7cdb3c613e06af7e44337f2f488
SimHash f6814c12b5e0

Groups

*

Rule Path
Disallow /wp-json/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /?s=*
Disallow /search/*
Disallow /cdn-cgi/bm/cv/
Disallow /cdn-cgi/challenge-platform/

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thebusinessconcept.com/sitemaps/sitemap_index.xml

Comments

  • Our sitemap index file
  • Block certain WordPress pages and endpoints
  • Whitelist certain WordPress endpoints
  • Block search result pages
  • Block Cloudflare endpoints
  • Ban certain bots