thurgauerzeitung.ch
robots.txt

Robots Exclusion Standard data for thurgauerzeitung.ch

Resource Scan

Scan Details

Site Domain thurgauerzeitung.ch
Base Domain thurgauerzeitung.ch
Scan Status Ok
Last Scan2024-11-08T06:29:27+00:00
Next Scan 2024-11-15T06:29:27+00:00

Last Scan

Scanned2024-11-08T06:29:27+00:00
URL https://thurgauerzeitung.ch/robots.txt
Redirect https://www.thurgauerzeitung.ch/robots.txt
Redirect Domain www.thurgauerzeitung.ch
Redirect Base thurgauerzeitung.ch
Domain IPs 194.40.216.15
Redirect IPs 194.40.216.18
Response IP 194.40.216.18
Found Yes
Hash ef5800f795a9edb9b2673749567fda08fd3e15fcf3e936b757f1a2e57e6419ba
SimHash 4b9e9cc2fd32

Groups

*

Rule Path
Disallow /suche$
Disallow /suche?
Disallow /marktplaetze
Disallow /test/
Disallow /_fragment?
Disallow /fragments/render/
Disallow /api/
Disallow /digitaldata
Disallow /dynamic-partials/
Disallow /statistic
Disallow /thurgauerzeitung/webview2/
Disallow /sport/live-resultate
Disallow /blaize/
Disallow /zephr/

googlebot-news

Rule Path
Disallow /brandedcontent
Disallow /services
Disallow /sponsoredcontent
Disallow /wettbewerbe
Disallow /paidcontent
Disallow /aboplus

google-extended

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

blp_bbot/0.1

Rule Path
Disallow /

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thurgauerzeitung.ch/sitemap.xml

Comments

  • robots.txt zu https://www.thurgauerzeitung.ch/