ippen.media
robots.txt

Robots Exclusion Standard data for ippen.media

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ippen.media
Base Domain	ippen.media
Scan Status	Ok
Last Scan	2024-06-13T23:58:47+00:00
Next Scan	2024-07-13T23:58:47+00:00

Last Scan

Scanned	2024-06-13T23:58:47+00:00
URL	https://ippen.media/robots.txt
Domain IPs	91.234.30.234
Response IP	91.234.30.234
Found	Yes
Hash	4ca2a51f9d3fe202da468521f6fe720f2c0cabc9017f16de9a9bb1afef25446f
SimHash	2b2113584bb1

Groups

*

Rule	Path
Disallow	/lightweight-ajax
Disallow	/*?trafficsource
Disallow	/suche/
Disallow	/*?cmp=defrss
Disallow	/test/
Disallow	/netzwerk/
Disallow	/bi/bootstrap/
Disallow	/bi/doop/
Disallow	/sso/

Rule

Path

Disallow

/lightweight-ajax

Disallow

/*?trafficsource

Disallow

/suche/

Disallow

/*?cmp=defrss

Disallow

/test/

Disallow

/netzwerk/

Disallow

/bi/bootstrap/

Disallow

/bi/doop/

Disallow

/sso/

xovi

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bingbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

gptbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

ccbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

msnbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

Comments

robots.txt www.ippen.media
Legal notice: www.ippen.media expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
The use of robots or other automated means to access www.ippen.media or collect or mine data without the express permission of www.ippen.media is strictly prohibited.

ippen.mediarobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

xovi

sistrix

searchmetricsbot

bingbot

gptbot

ccbot

msnbot

Other Records

Comments

ippen.media
robots.txt