bel-bo.be
robots.txt

Robots Exclusion Standard data for bel-bo.be

Resource Scan

Scan Details

Site Domain bel-bo.be
Base Domain bel-bo.be
Scan Status Ok
Last Scan2024-06-04T20:33:25+00:00
Next Scan 2024-07-04T20:33:25+00:00

Last Scan

Scanned2024-06-04T20:33:25+00:00
URL https://bel-bo.be/robots.txt
Redirect https://www.bel-bo.be/robots.txt
Redirect Domain www.bel-bo.be
Redirect Base bel-bo.be
Domain IPs 2001:4860:4802:32::15, 2001:4860:4802:34::15, 2001:4860:4802:36::15, 2001:4860:4802:38::15, 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 142.251.12.121, 2404:6800:4003:c05::79
Response IP 74.125.130.121
Found Yes
Hash e9998a292dc241510b5e93d5a151b2c050386ef60b4e00103ad5331db98655e3
SimHash 64327d53c7f3

Groups

baiduspider

Rule Path
Disallow /

jobdiggerspider

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Allow /*?page=
Allow /*?p=
Allow /*html?$
Disallow /index.php/
Disallow /*?
Disallow /*?page=*&
Disallow /*?p=&
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /sales/
Disallow /mydata/
Disallow /graphql

Other Records

Field Value
sitemap https://www.bel-bo.be/media/sitemap/sitemap_fr_be_index.xml
sitemap https://www.bel-bo.be/media/sitemap/sitemap_nl_be_index.xml

Comments

  • Baiduspider
  • Too much requests and not in target area
  • Disallow jobdigger
  • Too much requests and not in target area
  • Crawlers Setup