giessener-allgemeine.de
robots.txt

Robots Exclusion Standard data for giessener-allgemeine.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	giessener-allgemeine.de
Base Domain	giessener-allgemeine.de
Scan Status	Ok
Last Scan	2024-06-27T03:39:50+00:00
Next Scan	2024-07-04T03:39:50+00:00

Last Scan

Scanned	2024-06-27T03:39:50+00:00
URL	https://giessener-allgemeine.de/robots.txt
Domain IPs	91.234.30.209
Response IP	91.234.30.209
Found	Yes
Hash	04f37f11418a9cb2e3162346b0983893b24f5168c093aa7bf3087463a01307c0
SimHash	2b2113180f33

Groups

*

Rule	Path
Disallow	/lightweight-ajax
Disallow	/*?trafficsource
Disallow	/suche/
Disallow	/*?cmp=defrss
Disallow	/test/
Disallow	/fdn/bootstrap/
Disallow	/bi/bootstrap/
Disallow	/bi/doop/
Disallow	/sso/

Rule

Path

Disallow

/lightweight-ajax

Disallow

/*?trafficsource

Disallow

/suche/

Disallow

/*?cmp=defrss

Disallow

/test/

Disallow

/fdn/bootstrap/

Disallow

/bi/bootstrap/

Disallow

/bi/doop/

Disallow

/sso/

xovi

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bingbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

gptbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

ccbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

msnbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

Comments

robots.txt www.giessener-allgemeine.de
Legal notice: www.giessener-allgemeine.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
The use of robots or other automated means to access www.giessener-allgemeine.de or collect or mine data without the express permission of www.giessener-allgemeine.de is strictly prohibited.

giessener-allgemeine.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

xovi

sistrix

searchmetricsbot

bingbot

gptbot

ccbot

msnbot

Other Records

Comments

giessener-allgemeine.de
robots.txt