jongeneel.nl
robots.txt

Robots Exclusion Standard data for jongeneel.nl

Resource Scan

Scan Details

Site Domain jongeneel.nl
Base Domain jongeneel.nl
Scan Status Ok
Last Scan2024-09-07T06:46:57+00:00
Next Scan 2024-10-07T06:46:57+00:00

Last Scan

Scanned2024-09-07T06:46:57+00:00
URL https://jongeneel.nl/robots.txt
Redirect https://www.jongeneel.nl/robots.txt
Redirect Domain www.jongeneel.nl
Redirect Base jongeneel.nl
Domain IPs 2a02:26f0:1180:33::210:657, 2a02:26f0:1180:33::210:65e, 95.100.96.2, 95.100.96.40
Redirect IPs 2600:1413:b000:6::17d5:2bcf, 2600:1413:b000:6::17d5:2bd9, 96.17.96.16, 96.17.96.27
Response IP 104.81.138.26
Found Yes
Hash a19ff336b4c3255001561d02c575d3c47e72f96843a68bacab3093ed45918de8
SimHash 3df7475c4ff3

Groups

*

Rule Path
Disallow /cart
Disallow /checkout
Disallow /my-account
Disallow /my-company
Disallow /wishlists
Disallow /xxmm
Disallow /0x0x0mm
Disallow /search*
Allow /*page%3D
Disallow /*?q=*
Disallow /?pagesize
Disallow /?sort=name
Disallow /incapsula
Disallow /producten/actie-van-de-maand/actie-van-de-maand/c/actie-artikelenpage%3D

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.jongeneel.nl/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of pages
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Block Proximic