wlw.de
robots.txt

Robots Exclusion Standard data for wlw.de

Resource Scan

Scan Details

Site Domain wlw.de
Base Domain wlw.de
Scan Status Ok
Last Scan2024-06-16T16:24:28+00:00
Next Scan 2024-06-30T16:24:28+00:00

Last Scan

Scanned2024-06-16T16:24:28+00:00
URL https://wlw.de/robots.txt
Redirect https://www.wlw.de:443/robots.txt
Redirect Domain www.wlw.de
Redirect Base wlw.de
Domain IPs 18.194.150.160, 18.195.220.98, 52.28.9.41
Redirect IPs 3.73.149.246, 52.28.69.102, 52.57.27.186
Response IP 52.57.27.186
Found Yes
Hash 9dae4a97f43c9aacc858939511f3f557e8d6b36a63d7b5176d9a16dda32166b4
SimHash 08118c54e855

Groups

*

Rule Path
Disallow /extern/adm-images/
Disallow /sse/
Disallow /internal_api/
Disallow /picture500/
Disallow /en/
Disallow /fr/
Disallow /de/nachrichten/anfrage*
Disallow */produkte*?*
Disallow */firma/p-1*
Disallow */firma/p-2*
Disallow */firma/p-3*
Disallow */firma/p-4*
Disallow */firma/p-5*
Disallow */firma/p-6*
Disallow */firma/p-7*
Disallow */firma/p-8*
Disallow */firma/p-9*
Disallow */firma/p-0*
Disallow *cp_print*
Disallow */suche/ui5-*
Disallow */inside-business/checkip
Disallow */unternehmen/*
Disallow */vergleichen?*
Disallow *login?*
Disallow *oauth2/*
Disallow *?category=null*
Disallow *?q=*
Disallow *?*&q=*
Disallow *?previewAsCustomer=*
Disallow *?*&previewAsCustomer=*