real.de
robots.txt

Robots Exclusion Standard data for real.de

Resource Scan

Scan Details

Site Domain real.de
Base Domain real.de
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-06-11T08:29:57+00:00
Next Scan 2025-08-10T08:29:57+00:00

Last Successful Scan

Scanned2024-10-15T08:13:59+00:00
URL https://real.de/robots.txt
Redirect https://www.kaufland.de/robots.txt
Redirect Domain www.kaufland.de
Redirect Base kaufland.de
Domain IPs 31.204.116.31
Redirect IPs 141.101.90.104, 141.101.90.105, 141.101.90.106, 141.101.90.107
Response IP 141.101.90.104
Found Yes
Hash 1c9fc87e8c3598f18573ab56746c80a8996b3da3eced3b090ca2c0615e27b598
SimHash 40c680e8a730

Groups

*

Rule Path
Disallow /cart/*
Disallow /cart-new/*
Disallow /checkout/*
Disallow /item/search/*page
Disallow /s/
Allow /shops/*/$
Disallow /shops/*/*
Disallow /account/
Disallow *autosuggest*
Disallow /backend/account/v1/widget
Disallow /optmzly/df-json/
Disallow /backend/iam/
Disallow /backend/search/
Disallow /backend/navigation/
Disallow /backend/tracking/
Disallow /backend/home-page/
Disallow /backend/recommendations/
Disallow /backend/product-detail-page/
Disallow *%26st%3D*
Disallow *%26ac%3D*
Disallow *id_unit%3D*

mediapartners-google

Rule Path
Allow /item/search/
Allow /s/

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

petalbot

Rule Path
Disallow /

gptbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1