mysj.de
robots.txt

Robots Exclusion Standard data for mysj.de

Resource Scan

Scan Details

Site Domain mysj.de
Base Domain mysj.de
Scan Status Ok
Last Scan2024-05-25T12:51:46+00:00
Next Scan 2024-06-01T12:51:46+00:00

Last Scan

Scanned2024-05-25T12:51:46+00:00
URL https://mysj.de/robots.txt
Redirect https://www.serienjunkies.de/robots.txt
Redirect Domain www.serienjunkies.de
Redirect Base serienjunkies.de
Domain IPs 172.104.231.59, 2a01:7e01:1::ac68:e73b
Redirect IPs 91.234.30.137
Response IP 91.234.30.137
Found Yes
Hash 7b0abb4840dd5f6b9f1f24a86ec232bc36413d12cdb67691192d45e543085317
SimHash 2b2117584f73

Groups

*

Rule Path
Disallow /lightweight-ajax
Disallow /*?trafficsource
Disallow /suche/
Disallow /*?cmp=defrss
Disallow /test/
Disallow /bi/bootstrap/
Disallow /bi/doop/
Disallow /sso/

xovi

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /test/

gptbot

Rule Path
Allow /ueber-uns/
Disallow /

ccbot

Rule Path
Allow /ueber-uns/
Disallow /

msnbot

Rule Path
Disallow /test/

Other Records

Field Value
crawl-delay 5

Comments

  • robots.txt www.serienjunkies.de
  • Legal notice: www.serienjunkies.de expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access www.serienjunkies.de or collect or mine data without the express permission of www.serienjunkies.de is strictly prohibited.