sauerlandkurier.de
robots.txt

Robots Exclusion Standard data for sauerlandkurier.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	sauerlandkurier.de
Base Domain	sauerlandkurier.de
Scan Status	Ok
Last Scan	2024-06-28T07:11:41+00:00
Next Scan	2024-07-05T07:11:41+00:00

Last Scan

Scanned	2024-06-28T07:11:41+00:00
URL	https://sauerlandkurier.de/robots.txt
Domain IPs	193.30.29.210
Response IP	193.30.29.210
Found	Yes
Hash	3a50015a5e176a8301529dc2a1524845906431a907a18af33d1babe40087fc6b
SimHash	2b2313180f71

Groups

*

Rule	Path
Disallow	/lightweight-ajax
Disallow	/*?trafficsource
Disallow	/suche/
Disallow	/*?cmp=defrss
Disallow	/test/
Disallow	/fdn/bootstrap/
Disallow	/bi/bootstrap/
Disallow	/bi/doop/
Disallow	/sso/

Rule

Path

Disallow

/lightweight-ajax

Disallow

/*?trafficsource

Disallow

/suche/

Disallow

/*?cmp=defrss

Disallow

/test/

Disallow

/fdn/bootstrap/

Disallow

/bi/bootstrap/

Disallow

/bi/doop/

Disallow

/sso/

xovi

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bingbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

gptbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

ccbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

msnbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

Comments

robots.txt www.sauerlandkurier.de
Legal notice: www.sauerlandkurier.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
The use of robots or other automated means to access www.sauerlandkurier.de or collect or mine data without the express permission of www.sauerlandkurier.de is strictly prohibited.

sauerlandkurier.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

xovi

sistrix

searchmetricsbot

bingbot

gptbot

ccbot

msnbot

Other Records

Comments

sauerlandkurier.de
robots.txt