die-tagespost.de
robots.txt

Robots Exclusion Standard data for die-tagespost.de

Resource Scan

Scan Details

Site Domain die-tagespost.de
Base Domain die-tagespost.de
Scan Status Ok
Last Scan2024-11-17T06:02:31+00:00
Next Scan 2024-11-24T06:02:31+00:00

Last Scan

Scanned2024-11-17T06:02:31+00:00
URL https://die-tagespost.de/robots.txt
Redirect https://www.die-tagespost.de/robots.txt
Redirect Domain www.die-tagespost.de
Redirect Base die-tagespost.de
Domain IPs 82.211.32.82
Redirect IPs 82.211.32.209
Response IP 82.211.32.209
Found Yes
Hash 0b27bf4867a5161577bf4eca3285a6d438e1d1a93288f2b19e5e256f48601f75
SimHash b0222c28edb6

Groups

*

Rule Path
Disallow /_/ecards.html
Disallow /_/ecards.html*
Disallow /_/epaper/
Disallow /_/sendmail.html*
Disallow /_/tools/bb_redirect.html*
Disallow /_/tools/diaview.html?prev=true*
Disallow /_/tools/pdfpage.html
Disallow /_/tools/pdfpage.html?arid=*
Disallow /_/tools/pdfpage.html*
Disallow /?_FCLIST=*
Disallow /?voteerg=*
Disallow /*_FORMAT%3DPRINT*
Disallow /*?_CMFUNC*
Disallow /*?_FRAME=*
Disallow /*?_UNLOCK=nocache
Disallow /*?fcms=*
Disallow /*?fcms=*
Disallow /*?ID=*
Disallow /*?link2=*
Disallow /*?pin_type=*
Disallow /*?po_id=*
Disallow /*?select1=*
Disallow /*?SID*
Disallow /*?sid*
Disallow /*.asp$
Disallow /*.exec$
Disallow /*%26list%3D1$
Disallow /*ID%3D*
Disallow /*index.php?option=*&task=*&id=*&Itemid=*
Disallow /*Z$
Disallow /2008/
Disallow /admin/*
Disallow /abfall/
Disallow /au/
Disallow /backup/
Disallow /bcw_rightbox/
Disallow /chat/
Disallow /cme*
Disallow /cmspic/
Disallow /Dokumente/
Disallow /dpa/
Disallow /kiosk/epa*
Disallow /fehler/
Disallow /flash/
Disallow /hugo/
Disallow /import/
Disallow /kna-nachrichten/
Disallow /linkatory/
Disallow /mobile/
Disallow /msuup/
Disallow /online_test/
Disallow /ordner/
Disallow /suche/
Disallow /tgs-videos/
Disallow /tscontent/
Disallow /wetterImages/
Disallow /_/pics/spacer.gif
Disallow /leserbefragung2022
Disallow /apa-epaper/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.die-tagespost.de/sitemap.sitemap.xml
sitemap https://www.die-tagespost.de/online-Artikel.sitemap.xml
sitemap https://www.die-tagespost.de/News-XML.sitemap.xml

Comments

  • ID: 1
  • Legal notice: die-tagespost.de expressly reserves the right to use its content for commercial text and data mining (ยง44 b UrhG).
  • The use of robots or other automated means to access faz.net or collect or mine data without the express permission of die-tagespost.de is strictly prohibited.
  • die-tagespost.de may, in its discretion, permit certain automated access to certain die-tagespost.de pages,
  • If you would like to apply for permission to crawl die-tagespost.de, collect or use data, please email info@die-tagespost.de.