ippen.media
robots.txt

Robots Exclusion Standard data for ippen.media

Resource Scan

Scan Details

Site Domain ippen.media
Base Domain ippen.media
Scan Status Ok
Last Scan2024-06-13T23:58:47+00:00
Next Scan 2024-07-13T23:58:47+00:00

Last Scan

Scanned2024-06-13T23:58:47+00:00
URL https://ippen.media/robots.txt
Domain IPs 91.234.30.234
Response IP 91.234.30.234
Found Yes
Hash 4ca2a51f9d3fe202da468521f6fe720f2c0cabc9017f16de9a9bb1afef25446f
SimHash 2b2113584bb1

Groups

*

Rule Path
Disallow /lightweight-ajax
Disallow /*?trafficsource
Disallow /suche/
Disallow /*?cmp=defrss
Disallow /test/
Disallow /netzwerk/
Disallow /bi/bootstrap/
Disallow /bi/doop/
Disallow /sso/

xovi

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /test/

gptbot

Rule Path
Allow /ueber-uns/
Disallow /

ccbot

Rule Path
Allow /ueber-uns/
Disallow /

msnbot

Rule Path
Disallow /test/

Other Records

Field Value
crawl-delay 5

Comments

  • robots.txt www.ippen.media
  • Legal notice: www.ippen.media expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access www.ippen.media or collect or mine data without the express permission of www.ippen.media is strictly prohibited.