jeanscentre.nl
robots.txt

Robots Exclusion Standard data for jeanscentre.nl

Resource Scan

Scan Details

Site Domain jeanscentre.nl
Base Domain jeanscentre.nl
Scan Status Ok
Last Scan2024-09-07T23:20:08+00:00
Next Scan 2024-10-07T23:20:08+00:00

Last Scan

Scanned2024-09-07T23:20:08+00:00
URL https://jeanscentre.nl/robots.txt
Redirect https://www.jeanscentre.nl/robots.txt
Redirect Domain www.jeanscentre.nl
Redirect Base jeanscentre.nl
Domain IPs 76.76.21.21
Redirect IPs 76.76.21.22, 76.76.21.9
Response IP 76.76.21.241
Found Yes
Hash 1a2686200ce8e40532110873466985588eae3fde27cd6c778a72163dec487566
SimHash 2f02ff3140b1

Groups

*

Rule Path
Allow /
Disallow */jeansmaat_*
Disallow */fit_*
Disallow */prijs_*
Disallow */undefined_*
Disallow */categorie_*
Disallow *pretty-path%3D*
Disallow *sort%3D*
Disallow */cart/
Disallow */checkout/
Disallow */login/
Disallow */search/

ingrid

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

yahoo! slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

cazoodlebot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.jeanscentre.nl/sitemap.xml

Warnings

  • `host` is not a known field.